Gene Arth_1090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1090 
SymbolpurH 
ID4446428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1176993 
End bp1178672 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content69% 
IMG OID639688896 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_830584 
Protein GI116669651 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.01223 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCTTTA CGCAGCTAGA CCGTGTTCCC ATCCGCCGAG CCCTGATCTC GGTCTACGAC 
AAGACCGGTC TGGAGGAGCT CGCGAAGGGC CTGCACGAAG CAGGCGTCAA GATCGTCTCC
ACCGGCTCCA CCGCGAAGAA GATCGCGGCT GCAGGCATCC CCGTCCAGGA GGTCGAGGAA
GTCACCGGTT CGCCGGAGAT GCTGGACGGC CGCGTCAAGA CGCTCCACCC GCGCGTGCAC
GGCGGCATCC TGGCCGACCG CCGCGTCCCC GCCCACATGG AAACCCTGGC CGGCATGGAG
ATCGAGGCGT TCGACCTCGT CGTCGTGAAC CTCTACCCGT TCGTGGAGAC CGTCAAGTCC
GGTGCCGCGC AGGATGACGT CGTGGAGCAG ATCGACATCG GCGGCCCCGC CATGGTGCGC
TCCGCCGCGA AGAACCACGC CGCCGTCGCG ATCGTTACCG ACCCGAATTT CTACGGCGAC
GTTGTCCGCG CTGCCGCTGA AGGCGGCTTC GACCTGAAGA CCCGCCAGCG CCTGGCCGCG
AAGGCCTTCG CCCACACTGC CAGCTACGAC ACCGCAGTGG CCACGTGGAC GGCCAGCCAG
TTCCTGGACG AGGACGGCGA CGGCGTGATC GACTGGCCGG CCTACGCCGG CCTGGCGCTG
GAACGCTCCG AGGTCCTCCG CTACGGCGAA AACCCGCACC AGCAGGCCGC CCTCTACGTG
GACAAGGCCG CTCCCGCCGG CATCGCGCAG GCTGACCAGA TCCACGGCAA GGCCATGAGC
TACAACAACT TCGTGGACGC CGACGCCGCC CTCCGTGCAG CGTTCGACTT CGCTGAGCCC
GCCGTGGCCA TCATCAAGCA CGCCAACCCC TGCGGCGTGG CAGTCGGTTC CGCCGACGCC
GCGGACCCCA TCGCCGACGC CCACGCCAAG GCCCACGCCT GCGACCCCGT GTCCGCATTC
GGCGGCGTTA TCGCAGCCAA CCGCACGGTC ACCGCCGGAA TGGCGCGCAC CGTTGCCGGC
ATCTTCACCG AGGTCGTCAT CGCGCCGGGC TTCGAGGACG AGGCCGTGGA GATCCTGTCC
AAGAAGAAGA ACATCCGCCT CCTGGCCCTG CCGGAAGGCT ACGGCCGCTA CCCGACCGAG
TTCCGCCAGG TCTCCGGCGG CATGCTGGTG CAGGCTGCTG ACAAGGTCGA CGCCGAAGGC
GACAACCCCG CCAACTGGAC CCTCGCAGCC GGCGAGGCAG CGGATGCAGC CACGCTGGCC
GACCTCGCGT TCGCCTGGAC CGCCTGCCGT GCTGCCAAGT CCAACGCCAT CCTGCTCGCA
GACCACGGCG CTGCCGTCGG CATCGGCATG GGCCAGGTCA ACCGGCTCGA CTCCTGCAAG
CTGGCCGTGG AACGCGCCAA CACCCTGGGT GTGCAGGTCG AGTCCGACGT CGAGGGCGCC
GGGGGTGCAG CCGGTCCGTC GACGACAGAG GCCAGCGCAG CCCCGCAGCG TGCCCGCGGT
GCCGTGGCAG CCTCGGACGC GTTCTTCCCG TTCGCCGACG GACTGCAGAT CCTGATCGAC
GCCGGCGTCC GCGCCGTGGT CCAGCCCGGC GGTTCCGTCC GGGATGACGA AGTGATTGCA
GCGGCGAACG CGGCCGGCAT CACCATGTAC TTCACGGGTG CGCGCCACTT CTTCCACTAG
 
Protein sequence
MSFTQLDRVP IRRALISVYD KTGLEELAKG LHEAGVKIVS TGSTAKKIAA AGIPVQEVEE 
VTGSPEMLDG RVKTLHPRVH GGILADRRVP AHMETLAGME IEAFDLVVVN LYPFVETVKS
GAAQDDVVEQ IDIGGPAMVR SAAKNHAAVA IVTDPNFYGD VVRAAAEGGF DLKTRQRLAA
KAFAHTASYD TAVATWTASQ FLDEDGDGVI DWPAYAGLAL ERSEVLRYGE NPHQQAALYV
DKAAPAGIAQ ADQIHGKAMS YNNFVDADAA LRAAFDFAEP AVAIIKHANP CGVAVGSADA
ADPIADAHAK AHACDPVSAF GGVIAANRTV TAGMARTVAG IFTEVVIAPG FEDEAVEILS
KKKNIRLLAL PEGYGRYPTE FRQVSGGMLV QAADKVDAEG DNPANWTLAA GEAADAATLA
DLAFAWTACR AAKSNAILLA DHGAAVGIGM GQVNRLDSCK LAVERANTLG VQVESDVEGA
GGAAGPSTTE ASAAPQRARG AVAASDAFFP FADGLQILID AGVRAVVQPG GSVRDDEVIA
AANAAGITMY FTGARHFFH