Gene EcSMS35_2308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2308 
Symbol 
ID6142796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2338555 
End bp2339805 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content51% 
IMG OID641617182 
ProductNupC family nucleoside transporter 
Protein accessionYP_001744355 
Protein GI170684054 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.903484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.000933793 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATGTCA TGAGAAGTGT TCTGGGAATG GTGGTATTGC TGGCAATTGC ATTTTTGCTG 
TCAGTAAACA AGAAGAGGAT CAGCCTGCGT ACCGTTGGCG CGGCGTTAGT GTTACAGGTC
GTGATTGGCG GCATTATGCT TTGGTTACCG CCAGGGCGTT GGGTCGCTGA AAAAGTCGCT
TTTGGCGTCC ATAAAGTGAT GGCGTACAGC GACGCGGGTA GCGCATTTAT CTTCGGTTCG
CTGGTCGGGC CGAAAATGGA CACGCTGTTT GATGGTGCAG GATTTATCTT TGGTTTCAGA
GTATTACCGG CAATTATCTT CGTCACCGCG CTGGTGAGTA TTCTCTACTA CATCGGTGTG
ATGGGGATCT TAATTCGCAT TCTCGGCGGT ATCTTCCAGA AAGCATTAAA TATCAGCAAG
ATCGAGTCAT TCGTCGCGGT CACTACCATT TTCCTCGGGC AAAACGAAAT TCCGGCAATC
GTCAAACCCT TTATCGATCG GCTGAATCGC AATGAATTAT TTACAGCGAT TTGTAGTGGC
ATGGCCTCGA TTGCTGGTTC GACAATGATT GGTTACGCCG CACTGGGCGT ACCTGTGGAA
TATTTGCTGG CGGCATCGTT AATGGCGATC CCAGGCGGGA TCTTGTTTGC CCGCCTGTTA
AGCCCGGCTA CGGAATCTTC GCAGGTTTCT TTTAATAACC TCTCTTTCAC CGAAACACCG
CCAAAAAGTA TTATTGAGGC CGCTGCGACA GGGGCAATGA CCGGGCTGAA AATCGCTGCG
GGTGTGGCAA CAGTGGTGAT GGCATTTGTT GCCATCATTG CGTTAATTAA CGGTATTATC
GGCGGCGTTG GCGGCTGGTT TGGTTTTGAG CAAGCCTCGC TGGAGTCCAT TTTAGGTTAT
CTGCTGGCCC CGCTGGCGTG GGTGATGGGG GTTGACTGGA GTGATGCAAA TCTTGCCGGG
AGTTTGATTG GGCAGAAGTT GGCGATCAAT GAATTTGTCG CTTACCTCAA TTTCTCACCT
TATCTGCAAA CAGCTGGCAC GCTCGATGCT AAAACCGTGG CGATTATTTC CTTCGCGTTG
TGCGGTTTCG CTAACTTTGG TTCTATCGGG GTAGTGGTTG GGGCATTTTC TGCGGTTGCG
CCACACCGCG CGCCGGAAAT CGCCCAGCTT GGTTTACGCG CGCTGGCGGC GGCGACACTT
TCTAACCTGA TGAGTGCCAC CATTGCAGGA TTCTTTATTG GCTTAGCGTA G
 
Protein sequence
MDVMRSVLGM VVLLAIAFLL SVNKKRISLR TVGAALVLQV VIGGIMLWLP PGRWVAEKVA 
FGVHKVMAYS DAGSAFIFGS LVGPKMDTLF DGAGFIFGFR VLPAIIFVTA LVSILYYIGV
MGILIRILGG IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDRLNR NELFTAICSG
MASIAGSTMI GYAALGVPVE YLLAASLMAI PGGILFARLL SPATESSQVS FNNLSFTETP
PKSIIEAAAT GAMTGLKIAA GVATVVMAFV AIIALINGII GGVGGWFGFE QASLESILGY
LLAPLAWVMG VDWSDANLAG SLIGQKLAIN EFVAYLNFSP YLQTAGTLDA KTVAIISFAL
CGFANFGSIG VVVGAFSAVA PHRAPEIAQL GLRALAAATL SNLMSATIAG FFIGLA