Gene GYMC61_1139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1139 
SymbolpurH 
ID8524978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1150614 
End bp1152152 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content55% 
IMG OID 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_003252277 
Protein GI261418595 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTGA AACGAGCATT GATCAGCGTG TCCAATAAGG AAGGCATCAT TCCGTTTGCG 
AAGCAGCTGG CTGAACTTGG CATTGATATC ATTTCGACCG GTGGGACAAA ACGAGCGCTT
GAAGACGCCG GCGTTCCCGT CATTTCGATT TCCGATGTCA CCGGCTTTCC GGAAATTTTG
GACGGGCGCG TCAAAACATT GCATCCGGCC ATTCACGGCG GCATTTTGGC GGTGCGCAGC
GATGAGCGCC ACCAAGCAGC GCTTAAAGAG CACGGCATTC GCCCGATCGA TTTGGTCGTC
GTCAACTTGT ATCCGTTCCA ACAAACGATC GCCAAACCGG ATGTGACGCT CGCCGAGGCG
ATTGAAAACA TCGATATCGG CGGCCCGACG ATGGTGCGGG CGGCGGCGAA AAACTATGCT
GATGTCGCGA TTGTCGTCGA TCCAGCCGAC TATCCGATAG TGATTGAAGA ACTGAAAATG
ACCGGTTCGA TCCAAGCAAA AACGCGGCAA CAACTGGCGG CGAAAGCGTT CCGCCATACG
GCGGCGTATG ACGCGATGAT TGCGGAGTAT TTGACAAACC TCACCGGAGA GAACTATCCG
GAAACGCTCA CGGTCACGTA TACGAAAAAA CAATCATTGC GCTATGGCGA GAATCCGCAT
CAATCGGCAG CGTTTTATGC CAAGCCGCTC GGTGCGGCGT TCTCGATTGC CAACGCGACA
CAGCTGCATG GCAAAGAGTT GTCGTACAAC AACATTAACG ACGCCAATGC GGCGATCAAC
CTCATTCGCG AATTTCAAGA GCCGGCTGTG GCCGCCATCA AACATATGAA CCCATGCGGC
GTCGGCGTCG GCGCGACGCT TCTTGAGGCG TTTACGAAAG CGTATGAAGC GGATCCAGTC
TCGATTTTCG GCGGCATTAT TGCGGTCAAC CGTGAAGTGG ACAAAGAAAC AGCCGAACGG
ATGCACGACA TCTTTTTGGA AATCGTCATC GCTCCGTCAT TCAGCGACGA GGCGCTTGCC
ATTTTGACGA AAAAGAAAAA CATCCGTCTG TTGACGCTTG ATTTTACCGG GCCGGACGTC
AAGGAAAACA TGCTCGTTTC CGTCAATGGC GGCTTGCTCG TCCAAGAGGC CGATACGTTC
ACGCTTGAAG ACGCCGAATG GAATGTCGTA ACGAAGCGCG AGCCGACCGA GGCTGAGCGC
GAACAGCTTC GGTTTGCTTG GAAGGTTGTC AAACATGTGA AATCGAATGC GATTGTACTG
GCCAAAAACG GGATGACGGT CGGCGTTGGC ACCGGGCAAA TGAACCGGGT CGGCGCGGCC
AAAATTGCGA TTGAACAGGC TGGGGAACAG GCGGTTGGCG CCGTGTTGGC GTCCGATGCG
TTCTTCCCGA TGGACGATAC GGTCGAGGCG GCGGCGAAAG CCGGCATTAC CGCGATCATT
CAGCCGGGCG GCTCGATTCG CGACGCCGAC TCGATCCGCA AAGCGGATGA ATACGGCATC
GCCATGGTCT TCACCGGCGT GCGCCACTTT AAACATTAA
 
Protein sequence
MAVKRALISV SNKEGIIPFA KQLAELGIDI ISTGGTKRAL EDAGVPVISI SDVTGFPEIL 
DGRVKTLHPA IHGGILAVRS DERHQAALKE HGIRPIDLVV VNLYPFQQTI AKPDVTLAEA
IENIDIGGPT MVRAAAKNYA DVAIVVDPAD YPIVIEELKM TGSIQAKTRQ QLAAKAFRHT
AAYDAMIAEY LTNLTGENYP ETLTVTYTKK QSLRYGENPH QSAAFYAKPL GAAFSIANAT
QLHGKELSYN NINDANAAIN LIREFQEPAV AAIKHMNPCG VGVGATLLEA FTKAYEADPV
SIFGGIIAVN REVDKETAER MHDIFLEIVI APSFSDEALA ILTKKKNIRL LTLDFTGPDV
KENMLVSVNG GLLVQEADTF TLEDAEWNVV TKREPTEAER EQLRFAWKVV KHVKSNAIVL
AKNGMTVGVG TGQMNRVGAA KIAIEQAGEQ AVGAVLASDA FFPMDDTVEA AAKAGITAII
QPGGSIRDAD SIRKADEYGI AMVFTGVRHF KH