Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1747 |
Symbol | purH |
ID | 3103599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 1862857 |
End bp | 1864419 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637170908 |
Product | phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_114186 |
Protein GI | 53803917 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.731302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAATC CTATCGCCCG CGCCCTCGTC AGCGTTTCCG ACAAGACCGG CTGCGTGGAG TTCTGCCGGG GCCTCGCCGG CATCGGCGTC GAAATCATCT CGTCCGGCGG CACTGCCAGA CTGTTGGCCG AACACGGCGT CCCGACCATC GAAGTCAGCG ACTACACCGG CTTTCCGGAG ATGATGGACG GCCGGGTCAA GACGCTGCAC CCCAAGGTTC ATGGCGGCAT CCTCGGCCGG CGCGGCATCG ACGAGGCGGT CATGGCAGAA CACGGCATCC GCCCGATCGA CCTGGTCGCG GTCAATCTCT ATCCGTTCGA ACAGACCGTC GCCCGGCCCG ATTGCGACAT GGAGACCGCC ATCGAGAACA TCGACATCGG CGGCCCGGCC CTGATCCGCG CCGCGTCGAA GAATCACGCT TCCGTCGCGG TGGTGGTCGA CCCTGCGGAC TACGCCGCGG TGCTGGCTGA GCTGGAAGCC TCCGGCGGCG GCCTGTCACA CGCCACCCGC TTCGCTCTGG CGGCCAAGGC CTTCCGTCAT ACGGCCTGGT ACGATGCAGC GATCGCCGAC TATCTCGATC GCCGCCAAGG GGCTGACGGC TTCGCCGATC CGCTGCTGCT GCGCTTTCGC CGTATCCAAT CGATGCGCTA CGGCGAGAAT CCACACCAGC GCGCGGCGTT TTACCTCGAA CCCGGCGCCC CCCCCGGCTG CATCGCATCG GCCCGCCAGT TGCAGGGCAA GGAGCTGTCT TACAACAATA TCGCCGATGC CGATGCCGCG CTCGAATGCG TCAAGGGTTT CTCGGATCTC CCCGCCTGCG TGATCGTCAA ACACGCCAAT CCGTGCGGCG TAGCCGAAAG CACCACCCTG TCCCAGGCCT ACGACCTGGC CTATGCCACC GATCCCACCT CCGCCTTCGG CGGCATCATC GCGTTCAACC GGCCGCTGGA CGCCGAAACC GCCCGTACCA TCGTCGAACG GCAGTTCGTC GAGGTCGTCA TCGCGCCCGC GATCGCCGAC GATGCCCTGC CCGTTCTGGC CGCCAAGCCC AACGTGCGCG TGCTGAGCAC CGGCCCCTGG CCCGCCGAGC CGGCAGCTGA GCTGGATTTC AAGCGCGTCG GCGGCGGCTT GCTGGTGCAG GACAAAGACA TCGAACGGGT GACCGGCGGG CGTTTCCGGG TCGTGAGCCG GCGCTCGCCG ACGGAACAGG AGCTGATCGA CCTCCAGTTC GCCTGGCGGG TGGCCAAATT CGTCAAGTCC AACGCCATCG TCTATTGCAG GGACCGCCGC ACGGTCGGCA TCGGCGCCGG CCAGATGAGC CGCGTCTACT CCGCCCGCAT CGCCGCGCTC AAGGCGCAGG ACGAAGGCTT GAGCGTGGCG GGTTCGGTCG TCGCCTCCGA CGCGTACTTC CCGTTCCGCG ACGGTATCGA CGCCGCCGCC GAAGCCGGGG TCACGGCGGT GATCCAACCG GGCGGTTCGG TCAGGGACCC CGAGGTGATC GCTGCGGCGG ACGAACACGG CATGGCCATG GTCTTCACCG GCATCCGTCA CTTCCGCCAT TAG
|
Protein sequence | MSNPIARALV SVSDKTGCVE FCRGLAGIGV EIISSGGTAR LLAEHGVPTI EVSDYTGFPE MMDGRVKTLH PKVHGGILGR RGIDEAVMAE HGIRPIDLVA VNLYPFEQTV ARPDCDMETA IENIDIGGPA LIRAASKNHA SVAVVVDPAD YAAVLAELEA SGGGLSHATR FALAAKAFRH TAWYDAAIAD YLDRRQGADG FADPLLLRFR RIQSMRYGEN PHQRAAFYLE PGAPPGCIAS ARQLQGKELS YNNIADADAA LECVKGFSDL PACVIVKHAN PCGVAESTTL SQAYDLAYAT DPTSAFGGII AFNRPLDAET ARTIVERQFV EVVIAPAIAD DALPVLAAKP NVRVLSTGPW PAEPAAELDF KRVGGGLLVQ DKDIERVTGG RFRVVSRRSP TEQELIDLQF AWRVAKFVKS NAIVYCRDRR TVGIGAGQMS RVYSARIAAL KAQDEGLSVA GSVVASDAYF PFRDGIDAAA EAGVTAVIQP GGSVRDPEVI AAADEHGMAM VFTGIRHFRH
|
| |