Gene EcDH1_3987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3987 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4306938 
End bp4308527 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content56% 
IMG OID 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionACX41587 
Protein GI260451165 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000011742 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACAAC GTCGTCCAGT CCGCCGCGCT CTGCTCAGTG TTTCTGACAA AGCCGGTATC 
GTCGAATTCG CCCAGGCACT TTCCGCACGC GGTGTGGAGC TGCTGTCTAC AGGGGGCACT
GCCCGTCTGT TAGCAGAAAA AGGTCTGCCG GTAACCGAAG TTTCCGATTA CACCGGTTTC
CCGGAGATGA TGGATGGACG CGTGAAGACC CTGCATCCGA AAGTACATGG TGGCATTCTG
GGCCGTCGCG GCCAGGACGA TGCCATTATG GAAGAACATC AGATCCAGCC TATCGATATG
GTGGTTGTTA ACCTGTATCC GTTCGCCCAG ACCGTGGCCC GTGAAGGTTG CTCGCTGGAA
GATGCGGTTG AGAACATCGA TATCGGCGGC CCAACGATGG TGCGCTCCGC CGCCAAGAAC
CATAAAGATG TCGCAATCGT GGTGAAGAGC AGCGACTATG ACGCCATTAT TAAAGAGATG
GATGACAACG AAGGATCGCT GACGCTTGCA ACCCGTTTCG ACCTCGCCAT CAAAGCCTTC
GAACACACTG CCGCCTACGA CAGCATGATT GCCAACTACT TCGGCAGCAT GGTTCCGGCT
TACCACGGTG AAAGCAAAGA AGCCGCCGGT CGCTTCCCAC GCACGCTGAA CCTGAACTTC
ATTAAGAAGC TGGATATGCG TTACGGCGAG AACAGCCACC AGCAGGCTGC CTTCTATATA
GAAGAGAATG TGAAAGAAGC CTCCGTTGCT ACCGCAACCC AGGTTCAGGG TAAAGCCCTC
TCTTATAACA ACATCGCCGA TACCGATGCG GCGCTGGAGT GCGTGAAAGA GTTCGCCGAG
CCGGCATGTG TGATTGTGAA GCACGCCAAC CCTTGCGGCG TGGCTATCGG CAATTCCATT
CTTGATGCTT ACGATCGCGC GTACAAAACC GACCCAACCT CCGCATTCGG CGGCATCATT
GCCTTTAACC GCGAGCTGGA TGCGGAAACC GCACAGGCCA TCATTTCTCG TCAGTTTGTC
GAAGTGATTA TTGCGCCGTC CGCCAGCGAA GAAGCCCTGA AAATCACCGC CGCCAAACAG
AACGTACGCG TTCTGACCTG CGGTCAGTGG GGCGAGCGTG TTCCGGGCCT CGATTTCAAA
CGCGTGAACG GCGGTCTGCT GGTTCAGGAT CGTGACCTGG GGATGGTCGG TGCGGAAGAA
CTGCGCGTGG TGACCAAACG TCAGCCGAGC GAACAGGAAC TGCGTGATGC GCTGTTCTGC
TGGAAGGTGG CGAAGTTTGT GAAATCCAAC GCTATCGTCT ATGCCAAAAA CAATATGACT
ATCGGCATTG GCGCGGGCCA GATGAGCCGC GTGTACTCCG CAAAAATCGC CGGTATTAAA
GCGGCCGATG AAGGCCTGGA AGTGAAAGGT TCCTCGATGG CTTCTGACGC GTTCTTCCCG
TTCCGCGACG GTATTGATGC CGCCGCCGCT GCGGGCGTGA CCTGCGTAAT CCAGCCTGGC
GGTTCTATCC GTGATGACGA AGTGATTGCC GCCGCCGACG AGCACGGTAT TGCGATGCTC
TTCACCGACA TGCGCCACTT CCGCCATTAA
 
Protein sequence
MQQRRPVRRA LLSVSDKAGI VEFAQALSAR GVELLSTGGT ARLLAEKGLP VTEVSDYTGF 
PEMMDGRVKT LHPKVHGGIL GRRGQDDAIM EEHQIQPIDM VVVNLYPFAQ TVAREGCSLE
DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYDAIIKEM DDNEGSLTLA TRFDLAIKAF
EHTAAYDSMI ANYFGSMVPA YHGESKEAAG RFPRTLNLNF IKKLDMRYGE NSHQQAAFYI
EENVKEASVA TATQVQGKAL SYNNIADTDA ALECVKEFAE PACVIVKHAN PCGVAIGNSI
LDAYDRAYKT DPTSAFGGII AFNRELDAET AQAIISRQFV EVIIAPSASE EALKITAAKQ
NVRVLTCGQW GERVPGLDFK RVNGGLLVQD RDLGMVGAEE LRVVTKRQPS EQELRDALFC
WKVAKFVKSN AIVYAKNNMT IGIGAGQMSR VYSAKIAGIK AADEGLEVKG SSMASDAFFP
FRDGIDAAAA AGVTCVIQPG GSIRDDEVIA AADEHGIAML FTDMRHFRH