Gene MCA1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1747 
SymbolpurH 
ID3103599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1862857 
End bp1864419 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content67% 
IMG OID637170908 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_114186 
Protein GI53803917 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.731302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAATC CTATCGCCCG CGCCCTCGTC AGCGTTTCCG ACAAGACCGG CTGCGTGGAG 
TTCTGCCGGG GCCTCGCCGG CATCGGCGTC GAAATCATCT CGTCCGGCGG CACTGCCAGA
CTGTTGGCCG AACACGGCGT CCCGACCATC GAAGTCAGCG ACTACACCGG CTTTCCGGAG
ATGATGGACG GCCGGGTCAA GACGCTGCAC CCCAAGGTTC ATGGCGGCAT CCTCGGCCGG
CGCGGCATCG ACGAGGCGGT CATGGCAGAA CACGGCATCC GCCCGATCGA CCTGGTCGCG
GTCAATCTCT ATCCGTTCGA ACAGACCGTC GCCCGGCCCG ATTGCGACAT GGAGACCGCC
ATCGAGAACA TCGACATCGG CGGCCCGGCC CTGATCCGCG CCGCGTCGAA GAATCACGCT
TCCGTCGCGG TGGTGGTCGA CCCTGCGGAC TACGCCGCGG TGCTGGCTGA GCTGGAAGCC
TCCGGCGGCG GCCTGTCACA CGCCACCCGC TTCGCTCTGG CGGCCAAGGC CTTCCGTCAT
ACGGCCTGGT ACGATGCAGC GATCGCCGAC TATCTCGATC GCCGCCAAGG GGCTGACGGC
TTCGCCGATC CGCTGCTGCT GCGCTTTCGC CGTATCCAAT CGATGCGCTA CGGCGAGAAT
CCACACCAGC GCGCGGCGTT TTACCTCGAA CCCGGCGCCC CCCCCGGCTG CATCGCATCG
GCCCGCCAGT TGCAGGGCAA GGAGCTGTCT TACAACAATA TCGCCGATGC CGATGCCGCG
CTCGAATGCG TCAAGGGTTT CTCGGATCTC CCCGCCTGCG TGATCGTCAA ACACGCCAAT
CCGTGCGGCG TAGCCGAAAG CACCACCCTG TCCCAGGCCT ACGACCTGGC CTATGCCACC
GATCCCACCT CCGCCTTCGG CGGCATCATC GCGTTCAACC GGCCGCTGGA CGCCGAAACC
GCCCGTACCA TCGTCGAACG GCAGTTCGTC GAGGTCGTCA TCGCGCCCGC GATCGCCGAC
GATGCCCTGC CCGTTCTGGC CGCCAAGCCC AACGTGCGCG TGCTGAGCAC CGGCCCCTGG
CCCGCCGAGC CGGCAGCTGA GCTGGATTTC AAGCGCGTCG GCGGCGGCTT GCTGGTGCAG
GACAAAGACA TCGAACGGGT GACCGGCGGG CGTTTCCGGG TCGTGAGCCG GCGCTCGCCG
ACGGAACAGG AGCTGATCGA CCTCCAGTTC GCCTGGCGGG TGGCCAAATT CGTCAAGTCC
AACGCCATCG TCTATTGCAG GGACCGCCGC ACGGTCGGCA TCGGCGCCGG CCAGATGAGC
CGCGTCTACT CCGCCCGCAT CGCCGCGCTC AAGGCGCAGG ACGAAGGCTT GAGCGTGGCG
GGTTCGGTCG TCGCCTCCGA CGCGTACTTC CCGTTCCGCG ACGGTATCGA CGCCGCCGCC
GAAGCCGGGG TCACGGCGGT GATCCAACCG GGCGGTTCGG TCAGGGACCC CGAGGTGATC
GCTGCGGCGG ACGAACACGG CATGGCCATG GTCTTCACCG GCATCCGTCA CTTCCGCCAT
TAG
 
Protein sequence
MSNPIARALV SVSDKTGCVE FCRGLAGIGV EIISSGGTAR LLAEHGVPTI EVSDYTGFPE 
MMDGRVKTLH PKVHGGILGR RGIDEAVMAE HGIRPIDLVA VNLYPFEQTV ARPDCDMETA
IENIDIGGPA LIRAASKNHA SVAVVVDPAD YAAVLAELEA SGGGLSHATR FALAAKAFRH
TAWYDAAIAD YLDRRQGADG FADPLLLRFR RIQSMRYGEN PHQRAAFYLE PGAPPGCIAS
ARQLQGKELS YNNIADADAA LECVKGFSDL PACVIVKHAN PCGVAESTTL SQAYDLAYAT
DPTSAFGGII AFNRPLDAET ARTIVERQFV EVVIAPAIAD DALPVLAAKP NVRVLSTGPW
PAEPAAELDF KRVGGGLLVQ DKDIERVTGG RFRVVSRRSP TEQELIDLQF AWRVAKFVKS
NAIVYCRDRR TVGIGAGQMS RVYSARIAAL KAQDEGLSVA GSVVASDAYF PFRDGIDAAA
EAGVTAVIQP GGSVRDPEVI AAADEHGMAM VFTGIRHFRH