Gene EcDH1_1494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1494 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1619524 
End bp1620774 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content51% 
IMG OID 
Productnucleoside transporter 
Protein accessionACX39164 
Protein GI260448742 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.432303 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATAA TGAGAAGTGT TGTGGGGATG GTGGTGTTAC TGGCAATAGC ATTTCTGTTG 
TCAGTGAATA AAAAGAGCAT CAGTTTGCGC ACGGTTGGAG CCGCACTGCT GCTGCAAATC
GCTATTGGTG GCATCATGCT CTACTTCCCA CCGGGAAAAT GGGCAGTAGA ACAGGCGGCA
TTAGGCGTTC ATAAAGTGAT GTCTTACAGT GATGCCGGTA GCGCCTTCAT TTTTGGTTCG
CTGGTTGGGC CGAAAATGGA TGTCCTGTTT GACGGTGCGG GTTTTATCTT CGCCTTTCGC
GTACTTCCGG CGATTATTTT CGTTACTGCG CTCATCAGTC TGCTGTACTA CATTGGCGTG
ATGGGGCTGC TGATTCGCAT CCTTGGCAGC ATTTTCCAGA AAGCCCTCAA CATCAGCAAA
ATCGAATCTT TTGTTGCGGT TACTACTATT TTCCTCGGGC AAAATGAGAT CCCGGCGATC
GTTAAACCGT TTATCGATCG CATGAATCGC AACGAGTTGT TTACCGCAAT TTGTAGCGGG
ATGGCGTCCA TTGCTGGTTC GATGATGATT GGTTATGCCG GAATGGGCGT ACCAATTGAC
TACCTGTTAG CGGCATCGCT GATGGCGATC CCTGGCGGGA TTTTGTTTGC ACGTATTCTT
AGCCCGGCAA CCGAGCCTTC GCAGGTCACA TTTGAAAATC TGTCGTTCAG CGAAACGCCG
CCAAAAAGCT TTATCGAAGC GGCGGCGAGC GGTGCGATGA CCGGGCTAAA AATCGCCGCT
GGTGTGGCGA CGGTGGTAAT GGCGTTTGTC GCAATTATTG CGCTGATCAA CGGCATTATC
GGCGGAATTG GCGGCTGGTT TGGTTTCGCC AATGCCTCTC TGGAAAGTAT TTTTGGCTAT
GTGCTGGCAC CGCTGGCGTG GATCATGGGT GTGGACTGGA GTGATGCCAA TCTTGCGGGT
AGCCTGATTG GGCAGAAACT GGCGATTAAC GAATTCGTCG CTTACCTGAG TTTCTCCCCA
TACCTGCAAA CGGGCGGCAC GCTGGAAGTG AAAACCATTG CGATTATCTC CTTTGCGCTT
TGTGGTTTTG CTAACTTTGG TTCTATCGGT GTTGTCGTTG GCGCATTTTC GGCTATTTCG
CCAAAACGCG CGCCGGAAAT CGCCCAGCTT GGTTTACGGG CGCTGGCAGC AGCAACGCTT
TCCAACCTGA TGAGTGCGAC TATTGCCGGG TTCTTTATTG GTCTGGCGTA A
 
Protein sequence
MDIMRSVVGM VVLLAIAFLL SVNKKSISLR TVGAALLLQI AIGGIMLYFP PGKWAVEQAA 
LGVHKVMSYS DAGSAFIFGS LVGPKMDVLF DGAGFIFAFR VLPAIIFVTA LISLLYYIGV
MGLLIRILGS IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDRMNR NELFTAICSG
MASIAGSMMI GYAGMGVPID YLLAASLMAI PGGILFARIL SPATEPSQVT FENLSFSETP
PKSFIEAAAS GAMTGLKIAA GVATVVMAFV AIIALINGII GGIGGWFGFA NASLESIFGY
VLAPLAWIMG VDWSDANLAG SLIGQKLAIN EFVAYLSFSP YLQTGGTLEV KTIAIISFAL
CGFANFGSIG VVVGAFSAIS PKRAPEIAQL GLRALAAATL SNLMSATIAG FFIGLA