Gene EcolC_3904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3904 
Symbol 
ID6064380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4285723 
End bp4287063 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content52% 
IMG OID641603318 
Productanaerobic C4-dicarboxylate transporter 
Protein accessionYP_001726833 
Protein GI170021879 
COG category[R] General function prediction only 
COG ID[COG2704] Anaerobic C4-dicarboxylate transporter 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases
[TIGR00770] anaerobic c4-dicarboxylate membrane transporter family protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTATTTA CTATCCAACT TATCATAATA CTGATATGTC TGTTTTATGG TGCCCGAAAG 
GGTGGTATCG CGCTGGGTTT ATTAGGCGGT ATCGGTCTGG TCATTCTGGT CTTCGTCTTC
CACCTTCAGC CAGGTAAACC ACCGGTTGAT GTCATGCTGG TTATCATTGC GGTGGTTGCG
GCATCGGCGA CCTTGCAAGC TTCGGGCGGT CTTGATGTCA TGCTGCAAAT TGCCGAGAAG
CTGCTGCGCC GCAACCCGAA ATATGTCTCA ATTGTCGCGC CGTTTGTGAC CTGTACACTG
ACCATTCTTT GCGGTACGGG TCATGTGGTT TACACCATTC TGCCGATCAT CTACGACGTC
GCCATTAAGA ACAACATCCG TCCGGAACGT CCGATGGCGG CAAGTTCTAT CGGTGCACAG
ATGGGGATTA TCGCCAGTCC GGTGTCGGTT GCGGTCGTGT CTCTGGTTGC AATGCTGGGT
AATGTCACCT TTGATGGTCG CCATCTTGAG TTCCTCGACC TGCTGGCAAT CACCATTCCA
TCGACGTTAA TCGGTATCCT GGCGATCGGT ATCTTCAGCT GGTTCCGCGG TAAAGATCTG
GATAAAGACG AAGAGTTCCA GAAATTCATC TCCGTACCGG AAAACCGTGA GTATGTTTAC
GGTGATACCG CGACGCTGCT CGATAAAAAA CTGCCGAAAA GCAACTGGCT GGCAATGTGG
ATTTTCCTCG GGGCAATCGC TGTAGTCGCA CTTCTTGGTG CTGATTCGGA CCTGCGTCCA
TCCTTCGGCG GCAAACCGCT GTCGATGGTA CTGGTTATTC AGATGTTTAT GCTGCTGACC
GGGGCGCTGA TTATTATCCT GACCAAAACC AATCCCGCGT CTATCTCAAA AAACGAAGTC
TTCCGTTCCG GTATGATCGC CATCGTGGCG GTGTACGGTA TCGCATGGAT GGCAGAAACC
ATGTTCGGTG CGCATATGTC TGAAATTCAG GGCGTACTGG GTGAAATGGT GAAAGAGTAT
CCGTGGGCCT ATGCCATTGT TCTGCTGCTG GTTTCCAAGT TTGTAAACTC TCAGGCTGCG
GCGCTGGCGG CGATTGTTCC GGTCGCGCTG GCGATCGGCG TTGATCCGGC ATACATCGTG
GCTTCAGCAC CGGCTTGCTA CGGTTATTAC ATCCTGCCGA CTTATCCGAG CGATCTGGCA
GCGATTCAGT TTGACCGTTC CGGCACCACC CACATCGGTC GCTTCGTCAT CAACCACAGC
TTTATTCTGC CGGGGTTGAT TGGTGTGAGC GTATCGTGCG TCTTCGGCTG GATCTTCGCC
GCGATGTACG GGTTCTTATA A
 
Protein sequence
MLFTIQLIII LICLFYGARK GGIALGLLGG IGLVILVFVF HLQPGKPPVD VMLVIIAVVA 
ASATLQASGG LDVMLQIAEK LLRRNPKYVS IVAPFVTCTL TILCGTGHVV YTILPIIYDV
AIKNNIRPER PMAASSIGAQ MGIIASPVSV AVVSLVAMLG NVTFDGRHLE FLDLLAITIP
STLIGILAIG IFSWFRGKDL DKDEEFQKFI SVPENREYVY GDTATLLDKK LPKSNWLAMW
IFLGAIAVVA LLGADSDLRP SFGGKPLSMV LVIQMFMLLT GALIIILTKT NPASISKNEV
FRSGMIAIVA VYGIAWMAET MFGAHMSEIQ GVLGEMVKEY PWAYAIVLLL VSKFVNSQAA
ALAAIVPVAL AIGVDPAYIV ASAPACYGYY ILPTYPSDLA AIQFDRSGTT HIGRFVINHS
FILPGLIGVS VSCVFGWIFA AMYGFL