Gene GWCH70_2359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2359 
Symbol 
ID7975956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2402320 
End bp2403777 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content50% 
IMG OID644799162 
Productglycine dehydrogenase subunit 2 
Protein accessionYP_002950322 
Protein GI239827698 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000690583 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAAAG ATCAGCCGCT TATTTTTGAA TTAAGCAAAC CTGGCCGCAT TGGCTATAGT 
TTACCGGAGT TAGACGTTCC AGCTGTCCAT GTCGAAGATG TCGTACCGGC CGATTATATT
CGTACGGAGG AGCCGGAGCT TCCAGAGGTG TCGGAGCTCG ATATTATGCG CCATTACACC
GCATTGTCCA AACGGAATCA TGGCGTCGAT TCCGGCTTTT ATCCACTCGG TTCTTGTACG
ATGAAATACA ATCCGAAAAT CAATGAAAAT GTCGCACGCC TCGCTGGATT TGCTCATATT
CATCCGCTTC AGCCTGAGGA AACGGTGCAA GGTGCGTTAG AATTAATGTA TGACCTGCAG
GAGCACTTAA AAGAAATCAC CGGGATGGAC GCCGTTACGT TGCAGCCGGC GGCGGGAGCG
CACGGCGAAT GGACAGGGCT GATGATGATT CGCGCGTATC ACGAAGCGAA CGGCGACTTT
CACCGGACGA AAGTGATTGT TCCTGACTCC GCTCACGGAA CGAACCCAGC TTCCGCAACG
GTAGCCGGCT TTGAAACCGT TACTGTCAAG TCGACAGAAG ATGGGTTAGT GGACTTAGAA
GATTTAAAAC GCGTTGTTGG CCCGGATACA GCGGCATTGA TGCTTACAAA TCCAAATACG
CTCGGCTTGT TTGAGAAAAA TATTTTAGAA ATGGCCGAGA TCGTGCATGA AGCTGGCGGA
AAGTTGTACT ATGACGGCGC TAATTTAAAC GCTGTATTAA GCAAGGCAAG ACCGGGAGAT
ATGGGCTTTG ATGTCGTGCA TCTCAACTTG CATAAAACAT TTACCGGTCC GCACGGCGGA
GGCGGCCCAG GCTCCGGTCC AGTCGGGGTG AAAGCGGACC TTATCCCGTT TTTGCCGAAG
CCGGTCGTTT CAAAAGGGGA AAACGGCTAT TATTTGGACT ATGACCGACC GCAAGCGATT
GGGCGCGTGA AACCTTTCTA CGGCAATTTC GGCATTAACG TCCGCGCTTA CACGTATATT
CGCTCCATGG GTCCAGATGG TTTAAAAGCG GTGACGGAAT ATGCCGTGTT AAATGCCAAC
TACATGATGC GCCGTCTTGC GGAATATTAC GAGTTGCCGT ACGACCGCCA TTGCAAGCAT
GAATTTGTCT TGTCGGGCAA ACGGCAAAAG AAACTCGGTG TCCGCACGCT CGATATTGCT
AAGCGTTTGC TGGATTTCGG CTTCCATCCG CCGACAGTAT ATTTTCCGCT CATTGTGGAT
GAATGTATGA TGATCGAGCC GACGGAAACC GAATCGAAAG AAACGTTAGA CGCGTTTATC
GACGCGATGA TTCAAATCGC TAAAGAAGCG GAAGAAAATC CAGAAATCGT GCAAGAAGCA
CCGCATACGA CCGTTGTCAA ACGCCTCGAC GAAACGACAG CTGCGCGCAA GCCAATCTTG
CGCTACCAAA AACAATAA
 
Protein sequence
MHKDQPLIFE LSKPGRIGYS LPELDVPAVH VEDVVPADYI RTEEPELPEV SELDIMRHYT 
ALSKRNHGVD SGFYPLGSCT MKYNPKINEN VARLAGFAHI HPLQPEETVQ GALELMYDLQ
EHLKEITGMD AVTLQPAAGA HGEWTGLMMI RAYHEANGDF HRTKVIVPDS AHGTNPASAT
VAGFETVTVK STEDGLVDLE DLKRVVGPDT AALMLTNPNT LGLFEKNILE MAEIVHEAGG
KLYYDGANLN AVLSKARPGD MGFDVVHLNL HKTFTGPHGG GGPGSGPVGV KADLIPFLPK
PVVSKGENGY YLDYDRPQAI GRVKPFYGNF GINVRAYTYI RSMGPDGLKA VTEYAVLNAN
YMMRRLAEYY ELPYDRHCKH EFVLSGKRQK KLGVRTLDIA KRLLDFGFHP PTVYFPLIVD
ECMMIEPTET ESKETLDAFI DAMIQIAKEA EENPEIVQEA PHTTVVKRLD ETTAARKPIL
RYQKQ