Gene GWCH70_1279 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1279 
Symbol 
ID7976060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1327851 
End bp1328828 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content51% 
IMG OID644798223 
Productformiminoglutamase 
Protein accessionYP_002949396 
Protein GI239826772 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01227] formimidoylglutamase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAAAC AGCCGGACAA GGAGAAATGG ACGGGACGGA TCGACAGCGA AAGCGATGAG 
AAAAGTTTTC GCGTCCATCA AAAAATTCGT CTGCTTGATA TAGGACAAAT ACAGACGCAG
GCAGAAAATG CATTTGCTTT ATTAGGCTTT CAATGTGATG AAGGAGTCCG CCGCAATCAA
GGACGGCAAG GAGCGTATCA CGCGCCGGTG GAAGTGAAAA AAGCGCTGGC GAACCTGCCA
TGGCACCTGC CGTCTCACAC AATACTTTAC GATGTGGGCG AAATTACTTG TGAAGGGGGA
GAGTTAGAAA ACAGCCAGAA ACATTTGGGT CAGGCGGTAG AGCGCCTTAT CTGCCATAAC
ATCACGCCGG TTGTCATCGG CGGCGGACAT GAAACCGCGT ACGGGCATTA TCTCGGTGTT
CGTCAGGCGG TCGGTTCGGA AACGAAGCTT GGCATTATCA ATATTGACGC TCATTTTGAC
ATGCGCCCAT ATGAACAAGG GCCGTCGTCG GGGACAATGT TTCGGCAAAT ATTAGATGAA
GATGGAAACG TGGGATACTG CTGCCTCGGC ATTCAACCGC TAGGCAACAC GGCGGCGTTA
TTTGAAACCG CTAATCGATA TGGATGCACG TACGTGCTTG AGGAAGAATT GACGTTGGCA
ACGCTAGAGC GCGCGTATGA GATCATTGAC GATTTTATCC AAAACTATGA TGTACTGATG
CTGACGCTTT GCATGGATGT GTTGAGTGCA AGCGCGGCAC CGGGAGTGAG CGCGCCTTCG
CCGTTCGGGC TTGATCCGAA AATCGTCCGC GCCTTGCTTC GTTATATTAT TTCCAAGCCA
CAAACGATCA GTTTCGATAT TTGTGAAGTG AATCCGTTGG TCGATGAAAA TCGAAAAACG
ATTGCGTTAG CGGCCGCCTT CTGCATGGAA GCGCTCGTTC ATTTCCACCG CCGCCAGCGG
GCGGCGACAG GTCGGTGA
 
Protein sequence
MYKQPDKEKW TGRIDSESDE KSFRVHQKIR LLDIGQIQTQ AENAFALLGF QCDEGVRRNQ 
GRQGAYHAPV EVKKALANLP WHLPSHTILY DVGEITCEGG ELENSQKHLG QAVERLICHN
ITPVVIGGGH ETAYGHYLGV RQAVGSETKL GIINIDAHFD MRPYEQGPSS GTMFRQILDE
DGNVGYCCLG IQPLGNTAAL FETANRYGCT YVLEEELTLA TLERAYEIID DFIQNYDVLM
LTLCMDVLSA SAAPGVSAPS PFGLDPKIVR ALLRYIISKP QTISFDICEV NPLVDENRKT
IALAAAFCME ALVHFHRRQR AATGR