Gene EcE24377A_2458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2458 
Symbol 
ID5588385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2441183 
End bp2442433 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content50% 
IMG OID640926118 
ProductNupC family nucleoside transporter 
Protein accessionYP_001463513 
Protein GI157158606 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGTCA TGAGAAGTGT TCTGGGAATG GTGGTATTGC TGGCAATTGC ATTTTTGCTG 
TCAGTAAACA AGAAGAAGAT CAGCCTGCGT ACCGTTGGCG CGGCGTTAGT GTTACAGGTT
GTGATTGGCG GCATTATGCT TTGGTTACCG CCAGGGCGTT GGGTCGCTGA AAAAGTCGCT
TTTGGCGTGC ATAAAGTGAT GGCGTACAGC GACGCGGGTA GCGCATTTAT CTTCGGTTCG
CTGGTCGGGC CGAAAATGGA CACGCTGTTT GATGGCGCAG GATTTATCTT TGGTTTCAGA
GTATTACCGG CAATTATCTT CGTTACAGCA CTGGTGAGTA TTCTCTACTA CATCGGTGTG
ATGGGGATTT TAATTCGCAT TCTCGGCGGT ATCTTCCAGA AAGCGTTAAA TATCAGCAAG
ATCGAGTCAT TCGTCGCGGT CACTACCATT TTCCTCGGGC AAAACGAAAT TCCGGCAATC
GTCAAACCCT TTATCGATCG TCTGAATCGC AATGAATTAT TTACAGCGAT TTGTAGTGGC
ATGGCCTCGA TTGCTGGTTC GACAATGATT GGTTACGCCG CCCTGGGCGT ACCTGTGGAA
TATTTGCTGG CGGCATCGTT AATGGCGATC CCAGGCGGGA TCTTGTTTGC CCGCCTGTTA
AGCCCGGCTA CGGAATCTTC GCAGGTTTCC TTTAATAACC TCTCTTTCAC CGAAACACCG
CCAAAAAGCA TTATTGAAGC CGCTGCGACA GGGGCAATGA CCGGGCTGAA AATCGCCGCC
GGTGTAGCGA CAGTTGTTAT GGCATTTGTC GCCATCATTG CGTTAATTAA TGGTATTATC
GGCGGCGTTG GCGGCTGGTT TGGTTTTGCA CATGCCTCGC TGGAGTCCAT TTTAGGTTAC
CTGTTGGCCC CATTGGCGTG GGTGATGGGG GTTGACTGGA GTGATGCAAA TCTTGCCGGG
AGTTTGATTG GGCAGAAGCT GGCGATCAAT GAATTTGTCG CTTATCTCAA TTTCTCGCCA
TATCTGCAAA CGGGTGGCAC TCTGGATGCT AAAACCGTGG CGATTATTTC TTTCGCGTTG
TGCGGTTTCG CTAACTTTGG TTCTATCGGG GTGGTGGTGG GGGCGTTTTC TGCGGTTGCG
CCACACCGTG CGCCGGAAAT CGCCCAACTT GGTTTACGCG CGCTGGCGGC GGCGACACTT
TCTAACCTGA TGAGTGCTAC TATTGCAGGA TTCTTTATTG GTTTAGCGTA G
 
Protein sequence
MDVMRSVLGM VVLLAIAFLL SVNKKKISLR TVGAALVLQV VIGGIMLWLP PGRWVAEKVA 
FGVHKVMAYS DAGSAFIFGS LVGPKMDTLF DGAGFIFGFR VLPAIIFVTA LVSILYYIGV
MGILIRILGG IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDRLNR NELFTAICSG
MASIAGSTMI GYAALGVPVE YLLAASLMAI PGGILFARLL SPATESSQVS FNNLSFTETP
PKSIIEAAAT GAMTGLKIAA GVATVVMAFV AIIALINGII GGVGGWFGFA HASLESILGY
LLAPLAWVMG VDWSDANLAG SLIGQKLAIN EFVAYLNFSP YLQTGGTLDA KTVAIISFAL
CGFANFGSIG VVVGAFSAVA PHRAPEIAQL GLRALAAATL SNLMSATIAG FFIGLA