Gene EcolC_2115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2115 
Symbol 
ID6067107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2311582 
End bp2312865 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content47% 
IMG OID641601523 
Productmajor facilitator superfamily metabolite/H(+) symporter 
Protein accessionYP_001725082 
Protein GI170020128 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0135318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTCC AGTTATATTC GCTCGGCGCA GCGTTAGTGT TTCATGAAAT ATTTTTTCCT 
GAATCATCAA CGGCAATGGC GTTAATTCTG GCAATGGGAA CCTACGGTGC AGGTTATGTG
GCGCGTATTG TCGGAGCATT TATTTTCGGC AAAATGGGCG ACAGAATAGG GCGTAAAAAA
GTGCTCTTTA TTACCATCAC CATGATGGGG ATCTGTACCA CCTTAATTGG TGTGTTACCG
ACCTATGCAC AGATTGGTGT TTTTGCCCCC ATCTTGCTGG TGACGCTGCG TATTATTCAG
GGATTGGGTG CAGGTGCGGA AATTTCCGGT GCCGGTACGA TGCTGGCGGA ATATGCGCCA
AAAGGTAAGC GCGGAATTAT CTCCTCATTT GTGGCTATGG GAACTAACTG CGGAACTTTA
AGCGCAACGG CAATCTGGGC CTTTATGTTC TTCATTCTCA GTAAAGAGGA ACTGCTGGCG
TGGGGATGGC GTATACCGTT CCTGGCGAGC GTTGTCGTGA TGGTCTTTGC TATCTGGTTG
CGTATGAATC TGAAAGAAAG CCCGGTTTTT GAGAAGGTTA ACGACAGCAA CCAACCGACA
GCAAAACCTG CACCTGCTGG TAGCATGTTC CAGAGCAAAT CCTTCTGGCT GGCAACAGGG
CTGCGTTTTG GTCAGGCGGG TAACTCAGGT TTAATTCAGA CTTTCCTTGC GGGCTATTTA
GTGCAGACGT TATTGTTTAA CAAAGCAATT CCAACAGATG CATTGATGAT CAGTTCGATT
CTCGGCTTTA TGACCATTCC GTTCCTTGGT TGGTTATCCG ATAAAATTGG TCGCCGGATC
CCGTATATTA TTATGAATAC CTCCGCGATT GTGCTGGCAT GGCCAATGCT TTCTATCATC
GTAGATAAAA GCTATGCCCC GAGCACCATT ATGGTTGCAC TGATTGTGAT TCATAACTGT
GCGGTGCTGG GATTATTTGC TCTGGAAAAT ATTACCATGG CAGAAATGTT CGGCTGTAAA
AACCGCTTTA CCCGGATGGC TATTTCTAAA GAAATTGGTG GTCTTATCGC TTCCGGTTTT
GGTCCTATCC TGGCGGGTAT TTTCTGCACC ATGACGGAAT CCTGGTATCC GATCGCCATT
ATGATCATGG CATATTCAGT GATTGGTTTA ATCTCTGCGC TGAAAATGCC AGAAGTGAAA
GACCGTGATT TAAGTGCGCT GGAAGACGCC GCGGAAGATC AACCGCGTGT TGTAAGAGCT
GCGCAACCTT CCAGAAGTCT GTAA
 
Protein sequence
MDFQLYSLGA ALVFHEIFFP ESSTAMALIL AMGTYGAGYV ARIVGAFIFG KMGDRIGRKK 
VLFITITMMG ICTTLIGVLP TYAQIGVFAP ILLVTLRIIQ GLGAGAEISG AGTMLAEYAP
KGKRGIISSF VAMGTNCGTL SATAIWAFMF FILSKEELLA WGWRIPFLAS VVVMVFAIWL
RMNLKESPVF EKVNDSNQPT AKPAPAGSMF QSKSFWLATG LRFGQAGNSG LIQTFLAGYL
VQTLLFNKAI PTDALMISSI LGFMTIPFLG WLSDKIGRRI PYIIMNTSAI VLAWPMLSII
VDKSYAPSTI MVALIVIHNC AVLGLFALEN ITMAEMFGCK NRFTRMAISK EIGGLIASGF
GPILAGIFCT MTESWYPIAI MIMAYSVIGL ISALKMPEVK DRDLSALEDA AEDQPRVVRA
AQPSRSL