Gene ECH74115_3300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3300 
Symbol 
ID6969084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3032375 
End bp3033625 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content51% 
IMG OID643387112 
Productnucleoside transporter, NupC family 
Protein accessionYP_002271576 
Protein GI209397477 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00807637 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.0252399 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATAA TGAGAAGTGT TGTGGGGATG GTGGTGTTAC TGGCAATAGC ATTTCTGTTG 
TCAGTGAATA AAAAGAGCAT CAGTTTGCGT ACGGTTGCAG CCGCACTACT GCTGCAAATT
GCTCTTGGTG GCATCATGCT CTATTTTCCG CCGGGAAAGT GGGCTGTAGA ACAGGCGGCA
TTAGGCGTTC ATAAAGTGAT GTCTTACAGT GATGCTGGTA GCGCCTTCAT TTTTGGTTCG
CTGGTCGGGC CGAAAATGGA TGTCCTCTTT GACGGTGCGG GTTTTATCTT CGCCTTTCGC
GTACTTCCGG CGATTATCTT CGTCACTGCG CTCATTAGTC TGCTGTACTA CATTGGCGTG
ATGGGGCTGC TGATTCGCAT CCTTGGCAGC ATTTTCCAGA AAGCCCTCAA CATCAGCAAA
ATCGAATCTT TTGTCGCAGT CACCACTATT TTCCTCGGGC AAAATGAGAT CCCGGCGATC
GTTAAACCGT TTATCGATCG CATGAATCGA AACGAGTTGT TTACGGCTAT TTGTAGCGGG
ATGGCGTCCA TTGCTGGTTC GATGATGATT GGTTATGCCG GAATGGGCGT ACCAATTGAC
TACCTGTTGG CGGCATCGCT GATGGCGATC CCCGGCGGTA TTTTGTTTGC ACGTATTCTT
AGTCCAGCAA CCGAGCTTTC GCAGGTCACG TTTGAAAATC TGTCGTTCAG CGAAACGCCG
CCAAAAAGCT TTATCGAAGC AGCGGCGAGC GGTGCGATGA CCGGGCTGAA AATCGCCGCT
GGTGTGGCGA CGGTGGTAAT GGCATTTGTG GCAATTATTG CGCTGATCAA CGGCATTATC
GGCGGAATTG GCGGCTGGTT TGGCTTCGCC AATGCTTCTC TGGAAAGTAT TTTTGGCTAT
GTGCTGGCAC CGCTGGCGTG GATCATGGGT GTGGACTGGA GTGATGCCAA CCTTGCGGGT
AGCCTGATTG GGCAGAAACT GGCGATTAAC GAATTCGTCG CTTACCTGAG TTTCTCCCCA
TACCTGCAAA CGGGCGGCAC GCTGGAAGTG AAAACCATTG CGATTATCTC CTTTGCGCTT
TGTGGTTTTG CTAACTTTGG TTCTATCGGT GTTGTCGTTG GGGCATTTTC GGCTATTTCG
CCAAAACGCG CACCGGAAAT CGCTCAGCTT GGTTTACGGG CGCTGGCAGC AGCAACGCTT
TCCAACCTGA TGAGCGCGAC CATTGCCGGG TTCTTTATTG GTCTGGCGTA A
 
Protein sequence
MDIMRSVVGM VVLLAIAFLL SVNKKSISLR TVAAALLLQI ALGGIMLYFP PGKWAVEQAA 
LGVHKVMSYS DAGSAFIFGS LVGPKMDVLF DGAGFIFAFR VLPAIIFVTA LISLLYYIGV
MGLLIRILGS IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDRMNR NELFTAICSG
MASIAGSMMI GYAGMGVPID YLLAASLMAI PGGILFARIL SPATELSQVT FENLSFSETP
PKSFIEAAAS GAMTGLKIAA GVATVVMAFV AIIALINGII GGIGGWFGFA NASLESIFGY
VLAPLAWIMG VDWSDANLAG SLIGQKLAIN EFVAYLSFSP YLQTGGTLEV KTIAIISFAL
CGFANFGSIG VVVGAFSAIS PKRAPEIAQL GLRALAAATL SNLMSATIAG FFIGLA