Gene Dhaf_1478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_1478 
Symbol 
ID7258447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp1573661 
End bp1575217 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content53% 
IMG OID643561386 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002457966 
Protein GI219667531 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGTC GAGCTGTGCT CAGTGTCTCC AATAAAACAG GTCTTGTGGA GCTTGCCCGA 
GGACTTGTGG AATTGGGCTT TGACTTGATT TCTACCGGCG GCACCTTTAA AACGTTGACC
GAAGCGGGGC TGCCTGTTCG CTATGTTACC GAGGTCACGG GATTTCCGGA GATTCTGGAT
GGGCGGGTCA AGACCCTTCA TCCCAGGATT CATGGGGGTA TTTTGGCCAG GGCTACGGCA
GAGCATTTGC AGCAGCTGGA GGACAATGGC ATCGGGTTGA TCGATCTTGT GGTGGTCAAT
CTCTATCCCT TTAAGGAGAC CATTGCCAGG CCGGGGGTTT CGTTCCAGGA GGCTATTGAA
AATATCGATA TTGGCGGTCC TTCCATGGTT CGTGCGGCGG CAAAGAATCA GGAGCGGGTG
AGTATCGTCG TCAATCCGGA GCGGTACCCG GAGGTGCTTC AGGCCCTGCG TGAGCAAGGG
GAAATCTCTT ATGATATGCG TAAACGTTTG GCGGCAGAGG CCTTTGCCCA TACAGCCGAA
TATGATCAAT GCATTGCCGG GTATTTGACT GCCGCACTTG CTGAGGAATC CGTTTCCTCT
TCTTCTTCAC CTTTCCCTGC AACCATAACA CTTGGGGGCC AAAAGGCTCA GGATCTTCGC
TATGGGGAAA ACCCTGCTCA GAAGGCGGCC TTTTACCGGG GGGCGGATGC AGCGGGCACC
TTGGCCTATG GTGAACAGAT TCAGGGTAAA GAATTATCCT ATAACAATTG GATGGATATG
GACGCGGCCT GGGGGATTGT TCAGGATTTC AGTGAGCCGG CCTGTGCTAT TATTAAGCAT
ACCAATCCCT GCGGTACAGC CTTGGGGAAA ACTGCTTTGG AAGCTTATGA AAAGGCCCTG
GCAGCGGACC CGGTCTCGGC CTTTGGCGGA ATTATTGCCT TTAACCGGAC CGTCGATGCT
GAATGTGCCG CCTCACTTAA GGCTCACTTC TATGAAGTTA TCGTTGCCCA TGAGTTCAGC
TCTGACGCCA GGGCAATACT ACAGGAAAAG AAAAACCTTC GTCTCGTCAA AGTAGCACAG
GACGGGAAGC CAGCCCATAC GCCCTGGAAA GTTCGTTCCA TTCAAGGAGG ATTTCTAATT
CAGGAAGAGG ATGAGGGGAC TACGCCGATC TCCGCATGGG AAGTCGTCAG CAAGCGCCAA
CCTGAACCTG AAGAACTTCG TGAACTGGAC TTTGCCTGGC GGGTGGTAAA GCATGTTAAA
TCCAATGCCA TTGTACTGGC CAAAGCCGGT CAAACCCTTG GCGTGGGAGC GGGACAGATG
AATCGGGTTG GCTCAGTTAA GATTGCTTTA GAACAGGCGG GGGATAAAGC CCAAGGGGCT
TATCTGGCCT CCGATGCTTT TTTCCCATTC CCCGATTCCC TGGAGGAGGC GGCTAAGGCA
GGAGTGCGGG CTGTGGTTCA ACCGGGGGGC TCCGTCAGAG ATGCTGAGGT TATCGAAGCG
GCTGACCGTT TGAATTTGAT TATGGTGTTT ACGAACCGCC GTCACTTTAA GCACTGA
 
Protein sequence
MNRRAVLSVS NKTGLVELAR GLVELGFDLI STGGTFKTLT EAGLPVRYVT EVTGFPEILD 
GRVKTLHPRI HGGILARATA EHLQQLEDNG IGLIDLVVVN LYPFKETIAR PGVSFQEAIE
NIDIGGPSMV RAAAKNQERV SIVVNPERYP EVLQALREQG EISYDMRKRL AAEAFAHTAE
YDQCIAGYLT AALAEESVSS SSSPFPATIT LGGQKAQDLR YGENPAQKAA FYRGADAAGT
LAYGEQIQGK ELSYNNWMDM DAAWGIVQDF SEPACAIIKH TNPCGTALGK TALEAYEKAL
AADPVSAFGG IIAFNRTVDA ECAASLKAHF YEVIVAHEFS SDARAILQEK KNLRLVKVAQ
DGKPAHTPWK VRSIQGGFLI QEEDEGTTPI SAWEVVSKRQ PEPEELRELD FAWRVVKHVK
SNAIVLAKAG QTLGVGAGQM NRVGSVKIAL EQAGDKAQGA YLASDAFFPF PDSLEEAAKA
GVRAVVQPGG SVRDAEVIEA ADRLNLIMVF TNRRHFKH