Gene GWCH70_0082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0082 
Symbol 
ID7978533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp106707 
End bp107795 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content39% 
IMG OID644797056 
ProductATP:guanido phosphotransferase 
Protein accessionYP_002948288 
Protein GI239825664 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3869] Arginine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000334735 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTTTG AGAAGTTTTT TAATACGGCG GTCAGTTCTT GGATGAGTCA AGAGGGGCCT 
GATTCTGATA TCGTGTTAAG CAGCCGTATC CGTTTAGCAA GAAACATTGT TGATTTTCAG
TTTCCAACAG TATTTAACAA TGAGGAAGCA CAGCAAATTG TTTCATTGTT TGAGCAAACA
TTTGCTCATC GTTTTTACCC GTCTGTCGGT CGTTTTGAAT TGTTAAAAAT GTCAGAGCTT
CAACCGATTG AAAAAAGGGT ATTGGTAGAA AAGCATTTAA TTAGCCCGCA TTTGGCAGAA
GATTCTCCTT TTGGGGCGTG CTTGCTTTCA GAAAATGAAG AAATAAGCAT TATGATTAAT
GAAGAGGATC ACATTCGTAT TCAATGTTTA TTTCCTGGTC TTCAATTAAC AGAAGCGTTA
AAAGTGGCTA ATGAGCTTGA TGATTGGATT GAGGAACATG TCAATTATGC GTTTGATGAA
AAACTCGGAT ATTTAACAAG CTGTCCGACA AACGTTGGAA CAGGGATGCG CGCTTCTGTT
ATGATGCATC TCCCGGCTCT CGTTTTAACA CAGCAAATAA ACCGCATTAT TCCAGCAATC
AACCAACTAG GATTAGTAGT ACGCGGAACA TATGGAGAAG GCAGTGAGGC GTTAGGTAAC
ATTTTCCAAA TTTCAAATCA AATTACATTA GGAAAGTCGG AAGAGGATAT TGTGGAAGAT
TTGAAAAGCG TTGTTCAACA ATTAATTGCC CAGGAAAGAA TGGCGAGGGA GACATTAGTC
AAAACTTTAA ACATACAATT AGAAGACAGA GTATTCCGTT CTTATGGGAT ATTAGCAAAT
AGCCGTGTTA TTGAATCTAA AGAAGCAGCG CAATGTTTGT CTGATGTACG TTTAGGAATT
GACTTAGGAT ATATTAAAAA TATTTCGCGC AATATTTTAA ATGAGCTGAT GATTTTAACT
CAACCTGGAT TTTTACAACA GTATGCAGGC GGCGTGCTAA GACCGGAAGA ACGGGATGTT
CGACGGGCGG CACTAATCCG CGAACGTCTA AAAATGGAAG AAAGAAAAGC GATGGAGGGT
GATGAATAA
 
Protein sequence
MSFEKFFNTA VSSWMSQEGP DSDIVLSSRI RLARNIVDFQ FPTVFNNEEA QQIVSLFEQT 
FAHRFYPSVG RFELLKMSEL QPIEKRVLVE KHLISPHLAE DSPFGACLLS ENEEISIMIN
EEDHIRIQCL FPGLQLTEAL KVANELDDWI EEHVNYAFDE KLGYLTSCPT NVGTGMRASV
MMHLPALVLT QQINRIIPAI NQLGLVVRGT YGEGSEALGN IFQISNQITL GKSEEDIVED
LKSVVQQLIA QERMARETLV KTLNIQLEDR VFRSYGILAN SRVIESKEAA QCLSDVRLGI
DLGYIKNISR NILNELMILT QPGFLQQYAG GVLRPEERDV RRAALIRERL KMEERKAMEG
DE