Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1090 |
Symbol | purH |
ID | 4446428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1176993 |
End bp | 1178672 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639688896 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_830584 |
Protein GI | 116669651 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.01223 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCTTTA CGCAGCTAGA CCGTGTTCCC ATCCGCCGAG CCCTGATCTC GGTCTACGAC AAGACCGGTC TGGAGGAGCT CGCGAAGGGC CTGCACGAAG CAGGCGTCAA GATCGTCTCC ACCGGCTCCA CCGCGAAGAA GATCGCGGCT GCAGGCATCC CCGTCCAGGA GGTCGAGGAA GTCACCGGTT CGCCGGAGAT GCTGGACGGC CGCGTCAAGA CGCTCCACCC GCGCGTGCAC GGCGGCATCC TGGCCGACCG CCGCGTCCCC GCCCACATGG AAACCCTGGC CGGCATGGAG ATCGAGGCGT TCGACCTCGT CGTCGTGAAC CTCTACCCGT TCGTGGAGAC CGTCAAGTCC GGTGCCGCGC AGGATGACGT CGTGGAGCAG ATCGACATCG GCGGCCCCGC CATGGTGCGC TCCGCCGCGA AGAACCACGC CGCCGTCGCG ATCGTTACCG ACCCGAATTT CTACGGCGAC GTTGTCCGCG CTGCCGCTGA AGGCGGCTTC GACCTGAAGA CCCGCCAGCG CCTGGCCGCG AAGGCCTTCG CCCACACTGC CAGCTACGAC ACCGCAGTGG CCACGTGGAC GGCCAGCCAG TTCCTGGACG AGGACGGCGA CGGCGTGATC GACTGGCCGG CCTACGCCGG CCTGGCGCTG GAACGCTCCG AGGTCCTCCG CTACGGCGAA AACCCGCACC AGCAGGCCGC CCTCTACGTG GACAAGGCCG CTCCCGCCGG CATCGCGCAG GCTGACCAGA TCCACGGCAA GGCCATGAGC TACAACAACT TCGTGGACGC CGACGCCGCC CTCCGTGCAG CGTTCGACTT CGCTGAGCCC GCCGTGGCCA TCATCAAGCA CGCCAACCCC TGCGGCGTGG CAGTCGGTTC CGCCGACGCC GCGGACCCCA TCGCCGACGC CCACGCCAAG GCCCACGCCT GCGACCCCGT GTCCGCATTC GGCGGCGTTA TCGCAGCCAA CCGCACGGTC ACCGCCGGAA TGGCGCGCAC CGTTGCCGGC ATCTTCACCG AGGTCGTCAT CGCGCCGGGC TTCGAGGACG AGGCCGTGGA GATCCTGTCC AAGAAGAAGA ACATCCGCCT CCTGGCCCTG CCGGAAGGCT ACGGCCGCTA CCCGACCGAG TTCCGCCAGG TCTCCGGCGG CATGCTGGTG CAGGCTGCTG ACAAGGTCGA CGCCGAAGGC GACAACCCCG CCAACTGGAC CCTCGCAGCC GGCGAGGCAG CGGATGCAGC CACGCTGGCC GACCTCGCGT TCGCCTGGAC CGCCTGCCGT GCTGCCAAGT CCAACGCCAT CCTGCTCGCA GACCACGGCG CTGCCGTCGG CATCGGCATG GGCCAGGTCA ACCGGCTCGA CTCCTGCAAG CTGGCCGTGG AACGCGCCAA CACCCTGGGT GTGCAGGTCG AGTCCGACGT CGAGGGCGCC GGGGGTGCAG CCGGTCCGTC GACGACAGAG GCCAGCGCAG CCCCGCAGCG TGCCCGCGGT GCCGTGGCAG CCTCGGACGC GTTCTTCCCG TTCGCCGACG GACTGCAGAT CCTGATCGAC GCCGGCGTCC GCGCCGTGGT CCAGCCCGGC GGTTCCGTCC GGGATGACGA AGTGATTGCA GCGGCGAACG CGGCCGGCAT CACCATGTAC TTCACGGGTG CGCGCCACTT CTTCCACTAG
|
Protein sequence | MSFTQLDRVP IRRALISVYD KTGLEELAKG LHEAGVKIVS TGSTAKKIAA AGIPVQEVEE VTGSPEMLDG RVKTLHPRVH GGILADRRVP AHMETLAGME IEAFDLVVVN LYPFVETVKS GAAQDDVVEQ IDIGGPAMVR SAAKNHAAVA IVTDPNFYGD VVRAAAEGGF DLKTRQRLAA KAFAHTASYD TAVATWTASQ FLDEDGDGVI DWPAYAGLAL ERSEVLRYGE NPHQQAALYV DKAAPAGIAQ ADQIHGKAMS YNNFVDADAA LRAAFDFAEP AVAIIKHANP CGVAVGSADA ADPIADAHAK AHACDPVSAF GGVIAANRTV TAGMARTVAG IFTEVVIAPG FEDEAVEILS KKKNIRLLAL PEGYGRYPTE FRQVSGGMLV QAADKVDAEG DNPANWTLAA GEAADAATLA DLAFAWTACR AAKSNAILLA DHGAAVGIGM GQVNRLDSCK LAVERANTLG VQVESDVEGA GGAAGPSTTE ASAAPQRARG AVAASDAFFP FADGLQILID AGVRAVVQPG GSVRDDEVIA AANAAGITMY FTGARHFFH
|
| |