Gene EcolC_1484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1484 
Symbol 
ID6067149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1637840 
End bp1639090 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content51% 
IMG OID641600904 
Productnucleoside transporter 
Protein accessionYP_001724474 
Protein GI170019520 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.125327 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000367127 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACATAA TGAGAAGTGT TGTGGGGATG GTGGTGTTAC TGGCAATAGC ATTTCTGTTG 
TCAGTGAATA AAAAGAGCAT CAGTTTGCGT ACGGTTGGAG CCGCACTGCT GCTGCAAATC
GCTATTGGTG GCATCATGCT CTACTTCCCA CCGGGAAAAT GGGCAGTAGA ACAGGCGGCA
TTAGGCGTTC ATAAAGTGAT GTCTTACAGT GATGCCGGTA GCGCCTTCAT TTTTGGTTCG
CTGGTTGGGC CGAAAATGGA TGTCCTGTTT GACGGTGCGG GTTTTATCTT CGCCTTTCGC
GTACTTCCGG CGATTATTTT CGTTACTGCG CTCATCAGTC TGCTGTACTA CATTGGCGTG
ATGGGGCTGC TGATTCGCAT CCTTGGCAGC ATTTTCCAGA AAGCTCTTAA CATCAGCAAA
ATCGAATCTT TTGTCGCAGT CACAACTATT TTCCTCGGGC AAAATGAGAT CCCGGCGATC
GTTAAACCGT TTATCGATCG CATGAATCGC AACGAGTTGT TTACCGCAAT TTGTAGCGGG
ATGGCGTCCA TTGCTGGTTC GATGATGATT GGTTATGCCG GAATGGGCGT ACCAATTGAC
TACCTGTTAG CGGCATCGCT GATGGCGATC CCAGGTGGTA TTTTGTTTGC ACGTATTCTT
AGCCCGGCCA CCGAGCCTTC GCAGGTCACA TTTGAAAATC TGTCGTTCAG CGAAACGCCG
CCAAAAAGCT TTATCGAAGC GGCGGCGAGC GGTGCGATGA CCGGGCTGAA AATCGCCGCT
GGTGTAGCGA CGGTGGTAAT GGCGTTTGTC GCAATTATTG CGCTGATCAA CGGCATTATC
GGCGGAATTG GTGGCTGGTT TGGTTTCGCC AATGCCTCTC TGGAAAGTAT TTTTGGCTAT
GTGCTGGCAC CGCTGGCGTG GATCATGGGT GTGGACTGGA GTGATGCCAA TCTTGCGGGT
AGCCTGATTG GGCAGAAACT GGCGATTAAC GAATTCGTCG CTTACCTGAG TTTCTCCCCA
TACCTGCAAA CGGGCGGCAC GCTGGAAGTG AAAACCATTG CGATTATCTC CTTTGCGCTT
TGTGGTTTTG CTAACTTTGG TTCTATCGGT GTTGTCGTTG GCGCATTTTC GGCTATTTCG
CCAAAACGTG CTCCAGAAAT CGCCCAGCTT GGTTTACGGG CGCTGGCAGC AGCAACGCTT
TCCAACCTGA TGAGTGCGAC TATTGCCGGG TTCTTTATTG GTCTGGCGTA A
 
Protein sequence
MDIMRSVVGM VVLLAIAFLL SVNKKSISLR TVGAALLLQI AIGGIMLYFP PGKWAVEQAA 
LGVHKVMSYS DAGSAFIFGS LVGPKMDVLF DGAGFIFAFR VLPAIIFVTA LISLLYYIGV
MGLLIRILGS IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDRMNR NELFTAICSG
MASIAGSMMI GYAGMGVPID YLLAASLMAI PGGILFARIL SPATEPSQVT FENLSFSETP
PKSFIEAAAS GAMTGLKIAA GVATVVMAFV AIIALINGII GGIGGWFGFA NASLESIFGY
VLAPLAWIMG VDWSDANLAG SLIGQKLAIN EFVAYLSFSP YLQTGGTLEV KTIAIISFAL
CGFANFGSIG VVVGAFSAIS PKRAPEIAQL GLRALAAATL SNLMSATIAG FFIGLA