Gene Veis_3025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_3025 
SymbolpurH 
ID4691935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp3379058 
End bp3380662 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content68% 
IMG OID639850783 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_997776 
Protein GI121609969 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.687258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCAC TCCTGTCCGT CTCCGACAAG ACCGGCATCG TCGAATTCGC GCAAGCCCTG 
CACGCGCTGG GCATAGGGCT GCTGTCCACC GGCGGCACGG CCAAGCTGCT GGCTGGCCAG
GGTCTGCCGG TGACCGAGGT GGCCGAACTG ACGCAATGGC CCGAAATGCT CGACGGCCGC
GTCAAGACGC TGCACCCCAA GGTGCACGCC GGCCTGCTCG CCCGCCGTGA ACGGCCCGGG
CATATGGCGG CCCTGAAGGA GCATGGCATA GCCACCATCG ACCTGCTGGT GGTCAACCTG
TACCCGTTCG AAGCCACCGT GGCCCAGGCC GCTTGCACGC TGGCCGAGGC CGTGGAGAAC
ATCGACATCG GCGGCCCGGC GATGGTGCGC AGCGCGGCCA AGAACTGGCA GCATGTGGGC
GTGCTGACCG ACGCCGCGCA GTACCCGTCC GTGCTGGCCG AGCTCCGCGC CAACGGCACA
TTGTCCGACC CGCTGCGCTT TGCGCTGTCG GTGGCGGCGT TCAACCGCAT TGCGCAGTAC
GACGCTGCGA TCAGCCACTA CCTGTCGTCG CTGCGGTTCG AGGCCGACCG TTGCATCGAC
GACAGCGCGG TGCCGGCGCG CATGCAGTTC CCCGGCCAGA GCAACGCCAT CTTCAGCAAG
GTGCAAGACC TGCGCTATGG CGAGAACGCG CACCAGCAGG CCGCGCTGTA TCGCGAACTG
CACCCGGCCC CCGGCTCCCT GGTCACGGCC GAGCAATTGC AGGGCAAGGA GCTGTCCTAC
AACAACCTGG CCGATGCCGA TGCCGCCTGG GAATGCGTCA AGAGCTTCGA CGCTGCGGCC
TGCGTGATCG TCAAGCACGC CAACCCCTGC GGCGTGGCGC TGGGCCTGGA CGCAGCGAGC
GCCTACCGCA AGGCTTTGCG GACCGACCCG ACCAGCGCCT TTGGCGGCAT CATCGCCTTC
AACTGCGTGG TCGACGACGC GGCCGCCCGG CAGCTCGGCC AGCAGTTCGC CGAGGTGCTG
CTGGCCCCTG ACTTCAGCGC GCAGGCGCTG GAGATCTTCA AAGCCAAGGC CAATCTGCGC
CTGCTCAGGA TTGCGCTGCC CGTCCAGACC GGCCAGGAGG GCAAAGAGCG CGGCCGCAAC
GCGCTCGATG CCCGGCGCAT CGGCTCCGGG CTGCTGCTGC AAACGGCAGA CAACCAGGAG
CTGTCGCCGA GCGCGCTGCG GGTCGTGACG CACAAGCGGC CCGGCCCCGA AGCGCTGCAA
GACCTGCTGT TCGCCTGGAA GGTCGCCAAA TACGTCAAGA GCAATGCCAT CGTGTTCTGC
AAGGACGGCA TGACCATGGG CGTCGGCGCT GGCCAGATGA GCCGCCTGGA TTCGGCACGC
ATCGCCAGCA TCAAGGCGCA GCAGGCCGGG CTGACGCTAC AGGGCACGGC CGTGGCCAGC
GACGCCTTCT TCCCCTTCCG TGACGGCCTG GATGTGGTGC TCGACGCCGG CGCCAGTTGC
GTGATCCAGC CCGGCGGCTC GGTGCGTGAC CAAGAGGTCA TCGATGCGGC CAACGAGCGC
GGCGTGGCCA TGGTGTTCAG CGGCCTGCGG CATTTCCGCC ACTGA
 
Protein sequence
MNALLSVSDK TGIVEFAQAL HALGIGLLST GGTAKLLAGQ GLPVTEVAEL TQWPEMLDGR 
VKTLHPKVHA GLLARRERPG HMAALKEHGI ATIDLLVVNL YPFEATVAQA ACTLAEAVEN
IDIGGPAMVR SAAKNWQHVG VLTDAAQYPS VLAELRANGT LSDPLRFALS VAAFNRIAQY
DAAISHYLSS LRFEADRCID DSAVPARMQF PGQSNAIFSK VQDLRYGENA HQQAALYREL
HPAPGSLVTA EQLQGKELSY NNLADADAAW ECVKSFDAAA CVIVKHANPC GVALGLDAAS
AYRKALRTDP TSAFGGIIAF NCVVDDAAAR QLGQQFAEVL LAPDFSAQAL EIFKAKANLR
LLRIALPVQT GQEGKERGRN ALDARRIGSG LLLQTADNQE LSPSALRVVT HKRPGPEALQ
DLLFAWKVAK YVKSNAIVFC KDGMTMGVGA GQMSRLDSAR IASIKAQQAG LTLQGTAVAS
DAFFPFRDGL DVVLDAGASC VIQPGGSVRD QEVIDAANER GVAMVFSGLR HFRH