Gene DET1417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDET1417 
SymbolpurH 
ID3229270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDehalococcoides ethenogenes 195 
KingdomBacteria 
Replicon accessionNC_002936 
Strand
Start bp1287504 
End bp1289045 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content54% 
IMG OID637120977 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_182125 
Protein GI57233807 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGCTA TCCTGAGCGT CTCAGATAAA ACCGGTCTTA TCGAATTCGC CAAAGGCTTG 
TCAGAACTGG GTTTTGATAT ATACAGCACC GGCGGAACCA AGAAATCACT CCAGCAGGCA
AATGTAACCG TTCACGGCAT TTCGGACATG ACCGGTTCGC CTGAAATACT GGACGGACGG
GTCAAAACTT TGCATCCCAA GGTACACGGC GGCATACTGG CCCGGCGTGA CCTGCCCGAA
CATATGGCCG AACTGGAAGA ACACCATATC CAACCCATTG ACATGGTGGT AGTCAATCTC
TACCCCTTTG TCAAGACTGT TTCCCGGCCG GATGTAAGCC TGACTGATGC ACTGGAGAAT
ATTGATATCG GCGGACCTAC CATGATACGG GCTTCCGCCA AGAACTTCCC CAGTGTGATT
GTGGTGGTAG ACCCTCAGGA TTACTCCCGT GTACTGGAAC ACCTTCAGGC AGGCACTCTG
AGCCTTGACG AACGCAAGAA ACTGGCCCAA AAGGCCTTCC AGCACGTAGC CATGTATGAT
ACGGCCATCT CCCAGTACCT CTGGCAGGGA GAAGAGGGTT TCCCCGAAAA TATGACCATA
GCCCTTTCCA AACGCTATGA CCTGCGTTAC GGTGAAAACC CCCACCAGCC GGCTGTTTTC
TATGCTGAAA ACAGGGTTGG ACAAGGGCAG GACAGCGGCA TTACCTGGGC GCAGCAGGTC
TGGGGCAAAC AGCTTTCCTT TAACAATATT CTGGACGCAG ACGCCGCCTG GGGAGCCGCC
ACTGACTTTG CGGCTGCCAC AGTAGCCATA GTCAAGCATA CCAATACCTG CGGTCTGGCC
AGTGACGAAA ACATTGCCGA AGCCTACAAG AAGGCCTTTT CGGGTGACCC CGTTTCGGCT
TACGGCGGTA TAGTAGCCTC CAACCGCAAA GTGACACTGT CCATGGCCGA AGCCATGAAG
GGTGTCTTTT ATGAAATCAT CATTGCCCCC GAATACGAAC CGGAGGCACT GGAATTCCTT
AAAACCCGCA AGGATTTGCG TATACTCATA GCCGAACTGC CCAAACATGC GGAAAACAAG
GCCGCTTCGC TGGATTACCG CCGGGTAAAA GGCGGGCTGC TGGTGCAGGC GGCTGATGAA
CTGGCCGAAG AGGCCCTTCA GACCAAGGTA GCCACCAACC GGGCACCCAC CGCTGAGGAA
ATGGCAGATT TGAAATTCGC CTGGCGGGCA GTCAAGCATA TTAAATCAAA CGCCATTGTC
CTGGCTAAAA ATAAAGTCCT GCTGGGAATG GGCGCAGGGC AACCCAACCG AGTAGTCAGC
GTAGACATTG CCAAGAGCAA GGCCGGTGAG GCCTCAAAGG GCAGTGTCAT GGCCTCAGAT
GCCATGTTTC CCTTCCCTGA CAGCGTTGAA CAGGCGGCTG CCGCCGGAGT AACCGCCATT
ATCCAGCCGG GCGGTTCTAT CCGTGACCAG GAATCTATTG ACGCTGCCAA CAAGTACAAT
ATAGCTATGG TATTTACCGG TACCCGCCAC TTCCGCCATT AG
 
Protein sequence
MRAILSVSDK TGLIEFAKGL SELGFDIYST GGTKKSLQQA NVTVHGISDM TGSPEILDGR 
VKTLHPKVHG GILARRDLPE HMAELEEHHI QPIDMVVVNL YPFVKTVSRP DVSLTDALEN
IDIGGPTMIR ASAKNFPSVI VVVDPQDYSR VLEHLQAGTL SLDERKKLAQ KAFQHVAMYD
TAISQYLWQG EEGFPENMTI ALSKRYDLRY GENPHQPAVF YAENRVGQGQ DSGITWAQQV
WGKQLSFNNI LDADAAWGAA TDFAAATVAI VKHTNTCGLA SDENIAEAYK KAFSGDPVSA
YGGIVASNRK VTLSMAEAMK GVFYEIIIAP EYEPEALEFL KTRKDLRILI AELPKHAENK
AASLDYRRVK GGLLVQAADE LAEEALQTKV ATNRAPTAEE MADLKFAWRA VKHIKSNAIV
LAKNKVLLGM GAGQPNRVVS VDIAKSKAGE ASKGSVMASD AMFPFPDSVE QAAAAGVTAI
IQPGGSIRDQ ESIDAANKYN IAMVFTGTRH FRH