Gene EcolC_2143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2143 
Symbol 
ID6064726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2338828 
End bp2339820 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content53% 
IMG OID641601551 
Productmonosaccharide-transporting ATPase 
Protein accessionYP_001725110 
Protein GI170020156 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATTC GCTACGGTTG GGAACTGGCT CTTGCCGCAC TGCTCGTTAT TGAGATTGTC 
GCATTTGGTG CAATTAACCC GCGGATGTTA GATCTCAATA TGTTGCTGTT CAGCACCAGT
GACTTTATCT GCATTGGCAT TGTCGCCCTA CCGCTGACGA TGGTGATTGT CAGTGGCGGG
ATCGATATTT CGTTTGGTTC GACCATCGGC CTCTGCGCCA TTGCATTGGG CGTACTGTTT
CAAAGTGGTG TGCCGATGCC GCTGGCGATA CTCCTGACCT TACTGCTCGG CGCATTGTGC
GGGCTGATCA ACGCCGGATT AATTATCTAT ACCAAAGTTA ACCCGCTGGT GATTACGCTT
GGCACGCTGT ATCTGTTTGC CGGAAGCGCT CTGCTGCTTT CCGGTATGGC CGGAGCGACG
GGGTACGAAG GTATTGGTGG ATTCCCGATG GCGTTTACAG ATTTCGCTAA CCTGGATGTG
CTGGGACTCC CCGTTCCGCT GATTATCTTC CTGATATGTC TCCTCGTTTT CTGGCTCTGG
CTGCATAAAA CCCATGCCGG ACGTAATGTG TTTTTGATTG GGCAAAGCCC GCGCGTGGCG
CTTTATAGCG CGATTCCAGT TAACCGCACC TTATGTGCGC TCTATGCCAT GACGGGGCTG
GCGTCTGCGG TCGCCGCTGT GCTGCTGGTA TCGTATTTTG GTTCAGCACG TTCCGATCTC
GGTGCGTCGT TTCTGATGCC CGCCATCACC GCCGTGGTGC TTGGCGGTGC CAATATTTAT
GGTGGTTCCG GTTCCATTAT CGGCACCGCC ATTGCGGTTT TATTAGTGGG ATATTTGCAA
CAAGGTTTGC AAATGGCAGG AGTGCCAAAT CAGGTGTCCA GCGCCCTTTC CGGTGCGCTA
CTTATCGTCG TTGTCGTAGG TCGTTCCGTT AGCCTGCATC GCCAGCAAAT TAAAGAGTGG
CTGGCGCGTC GGGCCAATAA CCCATTGCCA TAA
 
Protein sequence
MRIRYGWELA LAALLVIEIV AFGAINPRML DLNMLLFSTS DFICIGIVAL PLTMVIVSGG 
IDISFGSTIG LCAIALGVLF QSGVPMPLAI LLTLLLGALC GLINAGLIIY TKVNPLVITL
GTLYLFAGSA LLLSGMAGAT GYEGIGGFPM AFTDFANLDV LGLPVPLIIF LICLLVFWLW
LHKTHAGRNV FLIGQSPRVA LYSAIPVNRT LCALYAMTGL ASAVAAVLLV SYFGSARSDL
GASFLMPAIT AVVLGGANIY GGSGSIIGTA IAVLLVGYLQ QGLQMAGVPN QVSSALSGAL
LIVVVVGRSV SLHRQQIKEW LARRANNPLP