Gene EcolC_0012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0012 
Symbol 
ID6068536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp12047 
End bp13339 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content54% 
IMG OID641599417 
Productd-galactonate transporter 
Protein accessionYP_001723027 
Protein GI170018073 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00881] phosphoglycerate transporter family protein
[TIGR00893] d-galactonate transporter 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATTC CCGTTAATGC AGCAAAGCCG GGGCGTCGGC GTTATCTGAC GCTGGTGATG 
ATCTTTATTA CGGTAGTCAT TTGTTATGTC GACCGCGCCA ACCTGGCCGT GGCTTCCGCC
CATATTCAGG AAGAGTTCGG CATTACCAAA GCGGAAATGG GCTATGTATT TTCGGCCTTC
GCCTGGCTTT ATACGCTATG TCAGATCCCC GGCGGTTGGT TTTTAGATCG CGTAGGTTCT
CGCGTGACTT ATTTTATTGC GATATTTGGC TGGTCAGTGG CGACTTTATT CCAGGGCTTT
GCCACGGGCT TAATGTCATT AATTGGTCTG CGCGCGATAA CCGGTATTTT CGAAGCGCCT
GCTTTCCCGA CCAATAACCG GATGGTGACC AGCTGGTTCC CGGAACATGA ACGCGCTTCT
GCCGTTGGTT TTTATACGTC TGGTCAGTTT GTCGGTCTGG CGTTTCTGAC GCCGCTGCTG
ATCTGGATCC AGGAGCTGTT GAGCTGGCAC TGGGTGTTCA TTGTCACCGG TGGTATCGGC
ATTATCTGGT CGCTGATTTG GTTTAAGGTT TATCAGCCGC CGCGCCTGAC CAAAGGCATC
AGCAAAGCTG AACTGGATTA CATTCGTGAT GGCGGCGGTC TGGTGGATGG CGATGCACCG
GTGAAGAAAG AGGCGCGTCA GCCGTTAACA GCCAAAGACT GGAAACTGGT GTTCCATCGT
AAACTGATCG GCGTCTATCT TGGGCAATTT GCGGTGGCTT CTACACTGTG GTTTTTCTTA
ACCTGGTTCC CGAACTATTT AACCCAGGAA AAAGGGATCA CGGCGCTGAA AGCGGGCTTT
ATGACCACGG TACCGTTCCT CGCGGCGTTT GTCGGCGTCC TGCTCTCTGG CTGGGTCGCG
GATCTGCTGG TACGTAAGGG CTTTTCACTG GGCTTTGCGC GTAAAACGCC GATTATCTGC
GGCTTGCTGA TCTCCACCTG CATTATGGGT GCTAACTACA CTAACGATCC GATGATGATT
ATGTGCCTGA TGGCGCTGGC ATTCTTCGGC AACGGTTTTG CTTCGATTAC CTGGTCGCTG
GTCTCTTCTC TGGCACCGAT GCGCCTGATT GGTTTAACCG GCGGCGTGTT TAACTTCGCC
GATGGTCTGG GCGGCATCAC CGTTCCGCTG GTGGTGGGGT ACCTGGCGCA GGGTTACGGT
TTCGCACCTG CACTGGTTTA TATCTCCGCC GTCGCGTTGA TTGGCGCGCT CTCTTACATC
CTGCTGGTGG GCGATGTGAA GCGCGTTGGC TAA
 
Protein sequence
MDIPVNAAKP GRRRYLTLVM IFITVVICYV DRANLAVASA HIQEEFGITK AEMGYVFSAF 
AWLYTLCQIP GGWFLDRVGS RVTYFIAIFG WSVATLFQGF ATGLMSLIGL RAITGIFEAP
AFPTNNRMVT SWFPEHERAS AVGFYTSGQF VGLAFLTPLL IWIQELLSWH WVFIVTGGIG
IIWSLIWFKV YQPPRLTKGI SKAELDYIRD GGGLVDGDAP VKKEARQPLT AKDWKLVFHR
KLIGVYLGQF AVASTLWFFL TWFPNYLTQE KGITALKAGF MTTVPFLAAF VGVLLSGWVA
DLLVRKGFSL GFARKTPIIC GLLISTCIMG ANYTNDPMMI MCLMALAFFG NGFASITWSL
VSSLAPMRLI GLTGGVFNFA DGLGGITVPL VVGYLAQGYG FAPALVYISA VALIGALSYI
LLVGDVKRVG