Gene Francci3_1669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1669 
Symbol 
ID3903056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2003424 
End bp2004497 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content68% 
IMG OID637879007 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_480774 
Protein GI86740374 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGACG TCCGTCCACC GCCCAGCCGC ATGCGTCGCC GCGCAGGCGC ACCGATCGAA 
AGGCACACCA TGTCGACGCT CCTCGTGACC GGAGCCGCCG GGTTCATCGG ATCGAACTTC
GTCCGCTACT GGCGGACGCG GCATCCGGAG GACGCGGTCG TGGCGCTCGA CGCCCTGACC
TACGCCGGCT GCCGGGAGAA CCTCGCCGAC GTCGCGGACC GCGTCACGTT CGTCCACGGC
GACATCCGTG ACCAGGAGCT CGTCGAGTCG GTGCTGCGGG AGCACTCCGT GGACGTGGTG
GTGAACTTCG CCGCCGAGTC GCACAACAGC CTGGCGATCA TCCGCCCCGG CGAGTTCTTC
GCGACGAACG TGATGGGTAC CCAGACCCTG CTCGAGGCGG CACGCACGGT CGGGGTGGCC
CGCTTCCACC AGATCTCCAC CTGCGAGGTC TACGGCGACA TGGACCTCAA CGACCCCGGT
GCCTTCACCG AGGACTCCCC CTACCTCCCC CGCACGCCCT ACAACGCGGC GAAGGCCGGC
GGGGACCACG CCGTACGCGC CTACGGCTTC ACCTACAACC TCCCCGTGAC GATCACCAAC
TGCTCGAACA ACTACGGTCC CTACCAGTTC CCGGAGAAGG TCATCCCCCT GTTCGTAACC
CGGGCGCTGC AGGGCGAGTC GCTCCCGATG TACGCCTCCA CCACGAACCG GCGCGAGTGG
CTGCACGTGA TGGACCACTG CCGGGCGATC GACGCGGTTC TGGACCGTGG GCGGCTCGGC
GAGACCTACC ACGTCGGATC CGGCGTCGAG GCGGACATCG AGACGATCGC CGACACGGTG
CTCGCCGAGC TCGGTCTGCC CGCGTCGTTG AAGACGATCG TGCCCGACCG CCCCTCGCAC
GACCGCCGCT ACCTGCTGGA CTCCACCAAG CTGCGGACCG AGCTGGGCTG GACGCCGTTG
ATCGACTTCG CCGAGGGCAT GCGGTCGACC ATCGCCTGGT ACAAGGAGAA CGAAGCCTGG
TGGCGTCCGC TGCTCGGTCG CTCCCCGGTC TCCGAAACGG CCTGGACGAG CTGA
 
Protein sequence
MGDVRPPPSR MRRRAGAPIE RHTMSTLLVT GAAGFIGSNF VRYWRTRHPE DAVVALDALT 
YAGCRENLAD VADRVTFVHG DIRDQELVES VLREHSVDVV VNFAAESHNS LAIIRPGEFF
ATNVMGTQTL LEAARTVGVA RFHQISTCEV YGDMDLNDPG AFTEDSPYLP RTPYNAAKAG
GDHAVRAYGF TYNLPVTITN CSNNYGPYQF PEKVIPLFVT RALQGESLPM YASTTNRREW
LHVMDHCRAI DAVLDRGRLG ETYHVGSGVE ADIETIADTV LAELGLPASL KTIVPDRPSH
DRRYLLDSTK LRTELGWTPL IDFAEGMRST IAWYKENEAW WRPLLGRSPV SETAWTS