Gene EcSMS35_2311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2311 
Symbol 
ID6143241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2341742 
End bp2342992 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content51% 
IMG OID641617185 
ProductNupC family nucleoside transporter 
Protein accessionYP_001744358 
Protein GI170681919 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0705452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.000255025 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGATATAA TGAGAAGTGT TGTGGGGATG GTGGTGTTAC TGGCAATAGC ATTTTTGTTG 
TCAGTGAATA AAAAGAGCAT CAGTTTGCGC ACGGTTGGAG CCGCGTTGCT GCTACAAATC
GCCATTGGCG GAATCATGCT CTATTTTCCA CCAGGAAAAT GGGCGGTAGA ACAGGCGGCA
TTAGGCGTTC ATAAAGTGAT GTCTTACAGT GATGCCGGTA GCGCCTTCAT TTTTGGTTCG
CTGGTTGGGC CGAAAATGGA TGTCCTCTTT GACGGTGCGG GTTTTATCTT CGCCTTTCGC
GTACTCCCGG CGATTATCTT CGTCACTGCG CTCATTAGTC TGCTGTACTA CATTGGCGTG
ATGGGGCTGC TGATTCGCAT CCTTGGCAGC ATTTTCCAGA AAGCCCTCAA CATCAGCAAA
ATCGAATCTT TTGTTGCGGT TACCACTATT TTCCTCGGGC AAAATGAGAT CCCGGCGATC
GTTAAACCGT TTATCGATCG CATGAATCGC AACGAGTTGT TTACCGCAAT TTGTAGCGGG
ATGGCGTCCA TTGCGGGTTC GATGATGATT GGTTATGCCG GAATGGGCGT ACCAATTGAC
TACCTGTTAG CGGCATCGCT GATGGCGATC CCTGGCGGGA TTTTGTTTGC ACGTATTCTT
AGCCCGGCAA CCGAGCCTTC GCAGGTCACA TTTGAAAATT TATCGTTCAG CGAAACGCCG
CCAAAAAGCT TTATCGAAGC GGCGGCGAGC GGTGCGATGA CCGGGCTGAA AATCGCCGCT
GGTGTGGCGA CGGTGGTAAT GGTATTTGTG GCAATTATTG CGCTGATCAA CGGCATTATC
GGCGGAATTG GCGGCTGGTT TGGCTTCGCC AATGCTTCTC TGGAAAGTAT TTTTGGCTAT
GTGCTGGCAC CGCTGGCGTG GATCATGGGT GTGGACTGGA GTGATGCCAA CCTTGCGGGT
AGCCTGATTG GGCAGAAACT GGCAATTAAC GAATTCGTCG CTTACCTGAG TTTCTCCCCA
TACCTGCAAA CGGGCGGCAC GCTGGAAGTG AAAACCATTG CGATTATCTC CTTTGCGCTT
TGTGGTTTTG CTAACTTTGG TTCTATCGGT GTTGTCGTTG GGGCATTTTC GGCTATTTCG
CCAAAACGCG CACCGGAAAT CGCTCAGCTT GGTTTACGGG CGCTGGCAGC AGCAACGCTT
TCCAACCTGA TGAGCGCGAC CATTGCCGGG TTCTTTATTG GTTTAGCTTG A
 
Protein sequence
MDIMRSVVGM VVLLAIAFLL SVNKKSISLR TVGAALLLQI AIGGIMLYFP PGKWAVEQAA 
LGVHKVMSYS DAGSAFIFGS LVGPKMDVLF DGAGFIFAFR VLPAIIFVTA LISLLYYIGV
MGLLIRILGS IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDRMNR NELFTAICSG
MASIAGSMMI GYAGMGVPID YLLAASLMAI PGGILFARIL SPATEPSQVT FENLSFSETP
PKSFIEAAAS GAMTGLKIAA GVATVVMVFV AIIALINGII GGIGGWFGFA NASLESIFGY
VLAPLAWIMG VDWSDANLAG SLIGQKLAIN EFVAYLSFSP YLQTGGTLEV KTIAIISFAL
CGFANFGSIG VVVGAFSAIS PKRAPEIAQL GLRALAAATL SNLMSATIAG FFIGLA