Gene EcDH1_4005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4005 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4322602 
End bp4323735 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content55% 
IMG OID 
Productthiazole biosynthesis protein ThiH 
Protein accessionACX41605 
Protein GI260451183 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACCT TCAGCGATCG CTGGCGACAA CTGGACTGGG ACGACATCCG CCTGCGTATC 
AACGGCAAAA CGGCTGCTGA CGTAGAGCGG GCGCTAAATG CCTCGCAACT CACCCGCGAC
GACATGATGG CGCTGTTATC GCCTGCCGCC AGTGGCTATC TGGAACAACT GGCCCAACGG
GCGCAGCGTC TGACCCGTCA GCGATTTGGC AACACAGTTA GTTTCTACGT CCCGCTTTAT
CTTTCCAATC TTTGCGCTAA CGACTGCACG TACTGTGGAT TTTCCATGAG TAATCGCATC
AAGCGCAAAA CGCTGGATGA AGCGGATATT GCCAGGGAAA GTGCCGCTAT ACGGGAGATG
GGCTTTGAAC ATCTGCTGTT AGTCACTGGT GAACATCAGG CGAAAGTGGG GATGGATTAC
TTTCGTCGTC ATCTCCCTGC CCTTCGTGAA CAGTTCTCTT CACTACAGAT GGAAGTGCAA
CCGCTGGCGG AGACGGAATA CGCCGAGTTA AAGCAACTTG GTCTGGATGG CGTGATGGTT
TATCAGGAGA CATATCACGA GGCGACTTAT GCCCGCCATC ATCTGAAAGG CAAAAAACAG
GACTTCTTCT GGCGGCTGGA AACGCCGGAT CGGCTGGGGC GTGCGGGGAT TGATAAGATA
GGCCTCGGCG CGCTAATTGG CCTTTCCGAC AACTGGCGCG TTGACAGCTA TATGGTTGCC
GAACATTTGC TATGGCTGCA ACAGCATTAC TGGCAAAGCC GTTACTCTGT CTCCTTTCCG
CGCCTGCGCC CGTGTACTGG CGGCATTGAG CCTGCGTCGA TTATGGATGA ACGCCAGTTA
GTGCAAACCA TCTGCGCCTT CCGACTGCTT GCACCGGAGA TTGAACTGTC ACTCTCCACG
CGGGAATCAC CGTGGTTTCG CGATCGCGTT ATTCCGCTGG CGATCAATAA CGTCAGCGCC
TTCTCGAAAA CGCAGCCAGG TGGCTATGCC GATAATCACC CCGAGTTGGA ACAGTTCTCA
CCGCACGACG ATCGCAGACC GGAAGCGGTT GCTGCCGCGT TAACCGCTCA GGGTTTGCAG
CCGGTATGGA AAGACTGGGA CAGCTATCTG GGACGCGCCT CGCAAAGACT ATGA
 
Protein sequence
MKTFSDRWRQ LDWDDIRLRI NGKTAADVER ALNASQLTRD DMMALLSPAA SGYLEQLAQR 
AQRLTRQRFG NTVSFYVPLY LSNLCANDCT YCGFSMSNRI KRKTLDEADI ARESAAIREM
GFEHLLLVTG EHQAKVGMDY FRRHLPALRE QFSSLQMEVQ PLAETEYAEL KQLGLDGVMV
YQETYHEATY ARHHLKGKKQ DFFWRLETPD RLGRAGIDKI GLGALIGLSD NWRVDSYMVA
EHLLWLQQHY WQSRYSVSFP RLRPCTGGIE PASIMDERQL VQTICAFRLL APEIELSLST
RESPWFRDRV IPLAINNVSA FSKTQPGGYA DNHPELEQFS PHDDRRPEAV AAALTAQGLQ
PVWKDWDSYL GRASQRL