Gene ECH74115_3297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3297 
Symbol 
ID6972024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3029185 
End bp3030435 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content51% 
IMG OID643387109 
Productnucleoside transporter, NupC family 
Protein accessionYP_002271573 
Protein GI209396884 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0458092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.103773 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTCA TGAGAAGTGT TCTGGGAATG GTGGTATTGC TGACGATTGC GTTTTTACTG 
TCAGTAAACA AGAAGAAGAT CAGCCTGCGT ACCGTTGGCG CGGCGTTAGT GTTACAGGTC
GTGATTGGCG GCATTATGCT TTGGTTACCG CCAGGGCGTT GGGTCGCTGA AAAAGTCGCT
TTTGGTGTGC ATAAAGTGAT GGCGTACAGC GACGCGGGTA GCGCATTTAT CTTCGGTTCT
CTGGTCGGAC CGAAAATGGA CACCTTATTT GATGGCGCAG GATTTATCTT TGGTTTCAGG
GTATTACCGG CAATTATCTT CGTCACTGCA CTGGTGAGTA TTCTCTACTA CATCGGTGTG
ATGGGGATTT TAATTCGCAT TCTCGGCGGT ATATTCCAGA AAGCATTAAA TATCAGCAAG
ATTGAGTCAT TCGTCGCGGT CACTACCATT TTCCTCGGGC AAAACGAAAT TCCGGCGATT
GTGAAGCCCT TTATCGATCG GCTGAATCGC AATGAATTAT TTACAGCGAT TTGTAGCGGC
ATGGCCTCGA TTGCTGGTTC GACAATGATT GGTTATGCCG CACTGGGCGT GCCTGTGGAA
TATTTGCTGG CGGCATCGTT AATGGCGATC CCCGGGGGGA TCTTGTTTGC CCGCCTGCTA
AGCCCGGCTA CGGAATCTTC GCAGGTTTCT TTTAATAACC TCTCTTTCAC CGAAACACCG
CCAAAAAGCA TTATTGAAGC CGCCGCGACA GGGGCAATGA CCGGGCTGAA AATCGCCGCA
GGTGTGGCGA CAGTAGTGAT GGCATTCGTC GCCATCATTG CGTTAATTAA CGGTATTATC
GGCGGCGTTG GCGGCTGGTT TGGTTTTGAA CATGCCTCGC TGGAGTCCAT TGTAGGTTAT
CTGCTGGCCC CACTGGCGTG GGTAATGGGT GTTGACTGGA GTGATGCGAA TCTTGCCGGG
AGTTTGATTG GACAGAAACT GGCAATAAAT GAATTTGTCG CTTATCTCAA TTTCTCACCC
TATCTGCAAA CGGCTGGCAC TCTGGATGCT AAAACCGTGG CGATTATTTC CTTCGCGTTG
TGCGGTTTCG CTAACTTTGG TTCTATCGGG GTGGTGGTGG GGGCGTTTTC TGCGGTTGCG
CCACACCGTG CGCCGGAAAT CGCCCAGCTT GGTTTACGCG CGCTGGCGGC GGCGACACTT
TCTAACCTGA TGAGTGCCAC CATTGCCGGG TTCTTTATTG GTTTAGCGTA G
 
Protein sequence
MDVMRSVLGM VVLLTIAFLL SVNKKKISLR TVGAALVLQV VIGGIMLWLP PGRWVAEKVA 
FGVHKVMAYS DAGSAFIFGS LVGPKMDTLF DGAGFIFGFR VLPAIIFVTA LVSILYYIGV
MGILIRILGG IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDRLNR NELFTAICSG
MASIAGSTMI GYAALGVPVE YLLAASLMAI PGGILFARLL SPATESSQVS FNNLSFTETP
PKSIIEAAAT GAMTGLKIAA GVATVVMAFV AIIALINGII GGVGGWFGFE HASLESIVGY
LLAPLAWVMG VDWSDANLAG SLIGQKLAIN EFVAYLNFSP YLQTAGTLDA KTVAIISFAL
CGFANFGSIG VVVGAFSAVA PHRAPEIAQL GLRALAAATL SNLMSATIAG FFIGLA