Gene EcHS_A2297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2297 
Symbol 
ID5591814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2295071 
End bp2296321 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content51% 
IMG OID640921425 
ProductNupC family nucleoside transporter 
Protein accessionYP_001458961 
Protein GI157161643 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGTCA TGAGAAGTGT TCTGGGAATG GTGGTATTGC TGACGATTGC GTTTTTGCTG 
TCAGTAAACA AGAAGAAGAT CAGCCTGCGT ACCGTTGGCG CGGCGTTAGT GTTACAGGTC
GTGATTGGCG GGATTATGCT TTGGTTACCG CCAGGGCGTT GGGTCGCTGA AAAAGTCGCT
TTTGGCGTGC ATAAAGTGAT GGCGTACAGC GACGCGGGTA GCGCATTTAT CTTCGGTTCG
CTGGTCGGGC CGAAAATGGA CACGCTGTTT GATGGCGCAG GATTTATCTT TGGTTTCAGG
GTATTACCGG CAATTATCTT CGTCACTGCA CTGGTGAGTA TTCTCTACTA CATCGGTGTG
ATGGGGATTT TAATTCGCAT TCTCGGCGGT ATATTCCAGA AAGCATTAAA TATCAGCAAG
ATTGAGTCAT TCGTCGCGGT CACCACCATT TTCCTCGGGC AAAACGAAAT TCCGGCGATC
GTCAAACCCT TTATCGATCG TCTGAATCGC AATGAATTAT TTACAGCGAT TTGTAGTGGC
ATGGCCTCGA TTGCTGGTTC GACAATGATT GGTTACGCCG CCCTGGGCGT ACCTGTGGAA
TATTTGCTGG CGGCATCGTT AATGGCGATC CCAGGCGGGA TCTTGTTTGC CCGCCTGTTA
AGCCCGGCTA CGGAATCTTC GCAGGTTTCC TTTAATAACC TCTCTTTCAC CGAAACACCG
CCAAAAAGCA TTATTGAAGC CGCTGCGACA GGGGCAATGA CCGGGCTGAA AATCGCCGCC
GGTGTAGCGA CAGTTGTTAT GGCATTTGTC GCCATCATTG CGTTAATTAA TGGTATTATC
GGCGGCGTTG GCGGCTGGTT TGGTTTTGCA CATGCCTCGC TGGAGTCCAT TTTAGGTTAC
CTGTTGGCCC CATTGGCGTG GGTGATGGGG GTTGACTGGA GTGATGCAAA TCTTGCCGGG
AGTTTGATTG GGCAGAAGCT GGCGATCAAT GAATTTGTCG CTTATCTCAA TTTCTCGCCA
TATCTGCAAA CGGGTGGCAC TCTGGATGCT AAAACCGTGG CGATTATTTC TTTCGCGTTG
TGCGGTTTCG CTAACTTTGG TTCTATCGGG GTGGTGGTGG GGGCGTTTTC TGCGGTTGCG
CCACACCGTG CGCCGGAAAT CGCCCAACTT GGTTTACGCG CGCTGGCGGC GGCGACACTT
TCTAACCTGA TGAGTGCTAC TATTGCAGGA TTCTTTATTG GTTTAGCGTA G
 
Protein sequence
MDVMRSVLGM VVLLTIAFLL SVNKKKISLR TVGAALVLQV VIGGIMLWLP PGRWVAEKVA 
FGVHKVMAYS DAGSAFIFGS LVGPKMDTLF DGAGFIFGFR VLPAIIFVTA LVSILYYIGV
MGILIRILGG IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDRLNR NELFTAICSG
MASIAGSTMI GYAALGVPVE YLLAASLMAI PGGILFARLL SPATESSQVS FNNLSFTETP
PKSIIEAAAT GAMTGLKIAA GVATVVMAFV AIIALINGII GGVGGWFGFA HASLESILGY
LLAPLAWVMG VDWSDANLAG SLIGQKLAIN EFVAYLNFSP YLQTGGTLDA KTVAIISFAL
CGFANFGSIG VVVGAFSAVA PHRAPEIAQL GLRALAAATL SNLMSATIAG FFIGLA