Gene Ccel_2181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_2181 
SymbolpurH 
ID7310871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2553672 
End bp2555216 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content41% 
IMG OID643609113 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002506503 
Protein GI220929594 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAGC GTGCATTAAT AAGTGTTTCA GACAAAACAG GTATTGTTGA GTTTGCATCT 
GCTCTGGCTT CCAAGGGTAT AGAGATAATT TCCACAGGAG GAACTGCAAA AGCTCTTTCA
GCTGCCGGGC TAAAGGTTAT AAACATATCT GACATAACAG GTTTCCCGGA ATGTCTTGAT
GGAAGGGTAA AAACTCTTCA CCCAAAAGTA CATGCAGGAC TTCTTGCAAT AAGAAGCAAC
GAGGAACACA TGAAGCAGAT AAAGGAACTG GGTGTTGAAA CAATTGACAT GGTAATAATA
AACCTTTATC CCTTCAAACA GACAATTTTA AAAGGCAATG TAGAACTGGA AGAAGCTATA
GAGAACATTG ACATAGGCGG TCCTACGATG CTTAGGGCAG CTGCTAAGAA CTATCAGGAT
GTTGCAGTTA TTGTTGATCC TGCGGATTAT AAAAATGTAC TGAATGAAAT GAACGAATCT
GGAGATGTCA GCGTTAAGAC CAAATTCAGA CTGGCCTACA AGGTTTTTGA ACATACAAGT
CATTATGATA CATTGATTGC AAAATATCTG AGAGACACTC TTGGAGATAT AGATTTCCCT
GAAACACTTT CACTAACATA TGAAAAGGCT CAGGATATGC GTTACGGCGA AAACCCACAT
CAAAAAGCAG TATTCTATAA GGAAGTCGGA GCAAACACAG GACTTCTGCC AAGTGCAGTA
CAACTTCACG GTAAAGAACT TTCCTTCAAT AATATAAATG ATACTAACGG TGCTATAGAG
CTTGTCAAGG AATTTGACGA GCCGACAGTT GTTGCTGTAA AACATACTAA TCCTTGCGGC
GTCGGCAGTG CAGACAATAT ATATGACGCT TATATGAGAG CATATGAATC TGATCCTGTA
TCAATATTCG GCGGAATTAT TGCTGCAAAC AGAGAAATTG ACGCTAAGAC AGCTGAAGAA
ATCAACAAGA TATTTGTAGA AATAGTTGTT GCACCTTCTT TTACGGAAGA TGCACTTGCC
GTTTTGACGC AAAAGAAGAA TGTCAGACTT CTTAAACTGG AGAATATCAC TGACGAGATT
TCACCTGATG CATATGACAT GAAAAAGGTT GCAGGAGGTC TGCTGGTACA GAAGTACAAC
AGCCAGCTGT TTAATCAGGA AGACCTGAAA TGTGTAACAG ATGTACAGCC TACAAAGGAA
CAGATGGAAG ACCTTGTTTT TGCAATGAAG GTTGTTAAGC ACACCAAATC AAATGCAATT
ACTCTTGCAA AGGGCAAGAT GACTATTGGT GTGGGCCCGG GTCAGACAAA CAGAATAGTT
CCCACGAAGG TATCCATTGA GTATGCAGGA GAGAGATCAC AGGGAGCTGT AATGGCATCA
GATGCTTACT TTCCGTTCTC AGATTGCGTT GAAGCTGCTG CTGCTGCGGG TATTAAGGCT
ATTATACAAC CCGGCGGTTC AATAAGAGAT CAGGAATCAA TAGATGCATG CAATAAATAC
GGAATCGCAA TGGTGTTTAC AGGAATGAGA CACTTTAAGC ACTAA
 
Protein sequence
MIKRALISVS DKTGIVEFAS ALASKGIEII STGGTAKALS AAGLKVINIS DITGFPECLD 
GRVKTLHPKV HAGLLAIRSN EEHMKQIKEL GVETIDMVII NLYPFKQTIL KGNVELEEAI
ENIDIGGPTM LRAAAKNYQD VAVIVDPADY KNVLNEMNES GDVSVKTKFR LAYKVFEHTS
HYDTLIAKYL RDTLGDIDFP ETLSLTYEKA QDMRYGENPH QKAVFYKEVG ANTGLLPSAV
QLHGKELSFN NINDTNGAIE LVKEFDEPTV VAVKHTNPCG VGSADNIYDA YMRAYESDPV
SIFGGIIAAN REIDAKTAEE INKIFVEIVV APSFTEDALA VLTQKKNVRL LKLENITDEI
SPDAYDMKKV AGGLLVQKYN SQLFNQEDLK CVTDVQPTKE QMEDLVFAMK VVKHTKSNAI
TLAKGKMTIG VGPGQTNRIV PTKVSIEYAG ERSQGAVMAS DAYFPFSDCV EAAAAAGIKA
IIQPGGSIRD QESIDACNKY GIAMVFTGMR HFKH