Gene EcDH1_3133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3133 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3369526 
End bp3370830 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content52% 
IMG OID 
ProductInosine kinase 
Protein accessionACX40759 
Protein GI260450337 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.838114 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTC CCGGTAAACG TAAATCCAAA CATTACTTCC CCGTAAACGC ACGCGATCCG 
CTGCTTCAGC AATTCCAGCC AGAAAACGAA ACCAGCGCTG CCTGGGTAGT GGGTATCGAT
CAAACGCTGG TCGATATTGA AGCGAAAGTG GATGATGAAT TTATTGAGCG TTATGGATTA
AGCGCCGGGC ATTCACTGGT GATTGAGGAT GATGTAGCCG AAGCGCTTTA TCAGGAACTA
AAACAGAAAA ACCTGATTAC CCATCAGTTT GCGGGTGGCA CCATTGGTAA CACCATGCAC
AACTACTCGG TGCTCGCGGA CGACCGTTCG GTGCTGCTGG GCGTCATGTG CAGCAATATT
GAAATTGGCA GTTATGCCTA TCGTTACCTG TGTAACACTT CCAGCCGTAC CGATCTTAAC
TATCTACAAG GCGTGGATGG CCCGATTGGT CGTTGCTTTA CGCTGATTGG CGAGTCCGGG
GAACGTACCT TTGCTATCAG TCCAGGCCAC ATGAACCAGC TGCGGGCTGA AAGCATTCCG
GAAGATGTGA TTGCCGGAGC CTCGGCACTG GTTCTCACCT CATATCTGGT GCGTTGCAAG
CCGGGTGAAC CCATGCCGGA AGCAACCATG AAAGCCATTG AGTACGCGAA GAAATATAAC
GTACCGGTGG TGCTGACGCT GGGCACCAAG TTTGTCATTG CCGAGAATCC GCAGTGGTGG
CAGCAATTCC TCAAAGATCA CGTCTCTATC CTTGCGATGA ACGAAGATGA AGCCGAAGCG
TTGACCGGAG AAAGCGATCC GTTGTTGGCA TCTGACAAGG CGCTGGACTG GGTAGATCTG
GTGCTGTGCA CCGCCGGGCC AATCGGCTTG TATATGGCGG GCTTTACCGA AGACGAAGCG
AAACGTAAAA CCCAGCATCC GCTGCTGCCG GGCGCTATAG CGGAATTCAA CCAGTATGAG
TTTAGCCGCG CCATGCGCCA CAAGGATTGC CAGAATCCGC TGCGTGTATA TTCGCACATT
GCGCCGTACA TGGGCGGGCC GGAAAAAATC ATGAACACTA ATGGAGCGGG GGATGGCGCA
TTGGCAGCGT TGCTGCATGA CATTACCGCC AACAGCTACC ATCGTAGCAA CGTACCAAAC
TCCAGCAAAC ATAAATTCAC CTGGTTAACT TATTCATCGT TAGCGCAGGT GTGTAAATAT
GCTAACCGTG TGAGCTATCA GGTACTGAAC CAGCATTCAC CTCGTTTAAC GCGCGGCTTG
CCGGAGCGTG AAGACAGCCT GGAAGAGTCT TACTGGGATC GTTAA
 
Protein sequence
MKFPGKRKSK HYFPVNARDP LLQQFQPENE TSAAWVVGID QTLVDIEAKV DDEFIERYGL 
SAGHSLVIED DVAEALYQEL KQKNLITHQF AGGTIGNTMH NYSVLADDRS VLLGVMCSNI
EIGSYAYRYL CNTSSRTDLN YLQGVDGPIG RCFTLIGESG ERTFAISPGH MNQLRAESIP
EDVIAGASAL VLTSYLVRCK PGEPMPEATM KAIEYAKKYN VPVVLTLGTK FVIAENPQWW
QQFLKDHVSI LAMNEDEAEA LTGESDPLLA SDKALDWVDL VLCTAGPIGL YMAGFTEDEA
KRKTQHPLLP GAIAEFNQYE FSRAMRHKDC QNPLRVYSHI APYMGGPEKI MNTNGAGDGA
LAALLHDITA NSYHRSNVPN SSKHKFTWLT YSSLAQVCKY ANRVSYQVLN QHSPRLTRGL
PEREDSLEES YWDR