Gene Jann_4042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_4042 
SymbolpurH 
ID3936530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp4145097 
End bp4146695 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content64% 
IMG OID637906427 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_511984 
Protein GI89056533 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.809349 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC CTGCCCCCCT CACCCGCGCG CTTCTGTCTG TCTCTGACAA GACCGGATTG 
ATTGAGTTCG CCACCGACCT GTCGTCCCGT GGGGTGGAGC TTCTGTCCAC CGGTGGCACC
GCGAAGGCCT TGCGCGAAGC CGGTCTGGAT GTGCGCGATG TGAGCGAGGT CACGGGCTTC
CCGGAGATGA TGGACGGACG GGTGAAGACA TTGCATCCGA TGATCCACGG CGGCCTGCTG
GCCCTGCGCG ACAACGACGC CCATGTCGCG GCGATGAAAG AGCACGGGAT CGGCGCGATC
GATCTGCTGG TGGTGAACCT TTACCCGTTT GAGGCGACGG TGGCCGCAGG CGCGGATTAT
GACACCTGTA TCGAGAATAT CGATATCGGC GGCCCCGCGA TGATCCGGGC CGCCGCGAAA
AACCATGGCG CGGTGACGGT GCTGACGGAC TCCTCCCAAT ACGCCGGGTT GTTGAGCGAG
CTGGATGCTA ATGCAAACAA GGGCGGTGGC ACGTCGTTCC AGTTCCGCCA GCGGATGGCG
CAGGCGGCTT ACGGGCGCAC GGCGGCCTAT GACGCGGCGG TCTCCAGTTG GATGGCAGGT
GCGGCTGAGA TCAAAACTCC GCCACACCGC GCGTTCGCGG GCAGCCTCGC ACAGGAAATG
CGCTACGGCG AGAACCCACA CCAGAAAGCC GCATTCTATC TTGATGGATC GTCGCGCCCG
GGCGTGGCCA CAGCGCAGCA ACATCAGGGC AAAGCGCTGA GCTACAACAA CATCAATGAC
ACGGACGCCG CGTTTGAGCT GGTCTCGGAA TTCGCGCCCG ACGATGGCCC GGCGGTTGCG
ATCATCAAGC ACGCCAACCC GTCGGGCGTG GCCCGCGGCA ACAGCCTTGC AGAGGCCTAC
AAAGCCGCGT TCGATTGCGA CCGCACCAGC GCGTTTGGTG GCATCGTCGC ATTGAACCAG
ACGTTGGACG CTGCCACGGC AGAAGAGATC GTGCAGATCT TCACAGAGGT GGTGATCGCG
CCCGACGCTG ACGAGGATGC CAAGGCGATC TTCGCCGCCA AGAAAAACCT GCGCCTGCTG
ACAACCGGCG GCCTGCCGGA CCCGCGCGCG CCAATGGTCG CCTACAAGCA GGTCGCGGGC
GGATTGTTGG TGCAGGACAA GGACACCGGC CATGTGGACC CGGAGTTGCT GGAGGTCGTG
ACCAAACGCG CGCCGTCAGC CCAGGAATTG GCCGATCTGC GCTTTGCCTG GACCGTGGCG
AAACACACGA AATCCAACGC GATCATCTAC GCCAAGGGCG GTGCAACCGT GGGCATCGGC
GCGGGGCAGA TGAGCCGCGT GGACAGCTCC ACCATCGCAG CGTTGAAGGC GGCACGGATG
GGCACGGAAT GCGGAATGGC TGACACGCCC GCTAAGGGAT CGGTCGTGGC GTCGGACGCG
TTCTTCCCGT TTGCGGACGG CTTGCTGGCG GCGGCAGAGG CAGGTGCCAC GGCGGTGATC
CAGCCCGGCG GCTCCATGCG TGACGCAGAT GTGATCGCTG CCGCCGACGA GGCAGGCCTG
GCCATGGTCT TCACCGGCAT GCGCCATTTC CGGCACTGA
 
Protein sequence
MTDPAPLTRA LLSVSDKTGL IEFATDLSSR GVELLSTGGT AKALREAGLD VRDVSEVTGF 
PEMMDGRVKT LHPMIHGGLL ALRDNDAHVA AMKEHGIGAI DLLVVNLYPF EATVAAGADY
DTCIENIDIG GPAMIRAAAK NHGAVTVLTD SSQYAGLLSE LDANANKGGG TSFQFRQRMA
QAAYGRTAAY DAAVSSWMAG AAEIKTPPHR AFAGSLAQEM RYGENPHQKA AFYLDGSSRP
GVATAQQHQG KALSYNNIND TDAAFELVSE FAPDDGPAVA IIKHANPSGV ARGNSLAEAY
KAAFDCDRTS AFGGIVALNQ TLDAATAEEI VQIFTEVVIA PDADEDAKAI FAAKKNLRLL
TTGGLPDPRA PMVAYKQVAG GLLVQDKDTG HVDPELLEVV TKRAPSAQEL ADLRFAWTVA
KHTKSNAIIY AKGGATVGIG AGQMSRVDSS TIAALKAARM GTECGMADTP AKGSVVASDA
FFPFADGLLA AAEAGATAVI QPGGSMRDAD VIAAADEAGL AMVFTGMRHF RH