Gene GWCH70_3377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3377 
Symbol 
ID7977133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3403153 
End bp3404772 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content47% 
IMG OID644800144 
Productpeptidase M20 
Protein accessionYP_002951283 
Protein GI239828659 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4187] Arginine degradation protein (predicted deacylase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.106714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATGGC AAACGACGCA ACAATTAAAG CAATTGCTTT GTCGTTTAGT CGAATATCCA 
AGCATCAGCG GAACGGAAGC AGAGGTGTTG CTTGCGCAGT ACATAGCGGA ACAGTTGCTT
ACACTTGATT ATTTCCAAAG AAATAATGAG TTTGTGCAAT TGCATCCGAC AGGAGACGGC
CGTTATTTTG TTACCGCGCT GGTGAAAAAA GCGGAGCAAG TGCGCGATAC CGTCATTCTC
ATCAGCCATT TTGATGTGGT CGATGTGCAA GATTACGGGG CATGGAAGGA CGCCGCGTTT
TCTCCCGAAA AGCTGACGGA GCGGTTTTAC GAACAGAAAC AACAGCTTCC TTCTGATGTG
CAAGCTGATA TGGAAGAAGG GGAATGGTTA TTTGGCCGCG GCGTGATGGA TATGAAGTGC
GGTCTTGCTC TTCATATGTC GCTGATCGAG CAAGCATGCC AAGGAAAGTT TGAAGGAAAT
TTACTGCTGC TTACGGTACC GGATGAGGAA GTGAGCTCGG TAGGAATGCG TGCGGCCGTG
CCGATCCTTG TCGAAATGGC AGAAAAATAC GGATTAACCT ATCGGCTTGT GCTTAATTCC
GAACCGATGT TTACTCGCTA TCCAGGGGAC AAAGCGAATT ACATTTACAC CGGCTCGATT
GGAAAAGTGC TGCCGGGCTT TTATTGTTAC GGAAAAGAAA CGCATGTCGG AGAACCGTTC
GCAGGGCTAA ATGCCAATTT CATGGTGGCG CAAATCGCTA ATGAATTAGA ATTTAACACT
GATTTTTGCG AAGTGTTTAG TGGGGAAGTC AGTCCGCCGC CGACAAACTT ATTGCAGACC
GATTTAAAAG AAGAATATTC TGTGCAAATA CCGCATCGCG CGGTAACGTT ATTTAATTTA
TTTTTGCAGA AAAGGTCGCT CGATGATGTG ACAAACTCGT TAATCGCAAT CGCAAAGCGG
GCAGCAAAAC GCATCGAAGA GCGTTATAAC GTTGAGGCGT CCCGCTTCGC GAAACTCGAA
AAATGGACAC CGAAACCGCT GTCTGTCAAC GTATTCACCT TTTCAGAGTT AAGAAAGAAA
GCAGTGGAGA TGGTTGGATT AGAAAGAATC GAACAGCTTG AAGCAAGCGT TCTATCAACG
GAAACAGCGA AAGACGAGCG GGAGAAAACG ATCAAACTAG TCGACCGGCT TGCTATACTT
TGTAAAGATT TCGCGCCGAT GATCGTTCTT TTTTATGCTC CTCCTTATTA TCCGGCTGTC
AACGCCAGCA GCGATCCGCT CGTTCAGCGT CTTGTTGCGA AGCTGCAAAA ATACGCGCAA
GAAAAGCATG GTATTTCTCT TGTGCAGCAA CATTACTTCG GCGGAATTTC CGACTTAAGC
TATGTCGGCT TGCAGCAATC TTCATCTTCT TTACGAGCTC TTACCGATAA TATGCCGATA
TGGAATCGCG GCTATCATTT GCCGTTTGAT GCGCTGGCAA AATTTCAAGT TCCGGTGCTC
AATGTCGGTC CGATCGGACG CGACGCCCAT CAATGGACGG AACGCCTCAA CGTTCGTTTT
GCGTTTACGA CGGTGAAATC GTGGCTGGAG TATACAATTA ACGAAGTATT TGCAAGATAG
 
Protein sequence
MKWQTTQQLK QLLCRLVEYP SISGTEAEVL LAQYIAEQLL TLDYFQRNNE FVQLHPTGDG 
RYFVTALVKK AEQVRDTVIL ISHFDVVDVQ DYGAWKDAAF SPEKLTERFY EQKQQLPSDV
QADMEEGEWL FGRGVMDMKC GLALHMSLIE QACQGKFEGN LLLLTVPDEE VSSVGMRAAV
PILVEMAEKY GLTYRLVLNS EPMFTRYPGD KANYIYTGSI GKVLPGFYCY GKETHVGEPF
AGLNANFMVA QIANELEFNT DFCEVFSGEV SPPPTNLLQT DLKEEYSVQI PHRAVTLFNL
FLQKRSLDDV TNSLIAIAKR AAKRIEERYN VEASRFAKLE KWTPKPLSVN VFTFSELRKK
AVEMVGLERI EQLEASVLST ETAKDEREKT IKLVDRLAIL CKDFAPMIVL FYAPPYYPAV
NASSDPLVQR LVAKLQKYAQ EKHGISLVQQ HYFGGISDLS YVGLQQSSSS LRALTDNMPI
WNRGYHLPFD ALAKFQVPVL NVGPIGRDAH QWTERLNVRF AFTTVKSWLE YTINEVFAR