Gene GWCH70_0857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0857 
Symbol 
ID7977863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp920415 
End bp922427 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content48% 
IMG OID644797830 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_002949003 
Protein GI239826379 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGTAC AGGACGTAAA GAAAAAACTG CGCGTCATCA CTGCGGAAGA TCTTTTCCGC 
ATTCGCTTTG TGAACGATCC TCAATTTTCA CCAGATGGCG AGAAAGTCAT CTTTGTGCAA
AAAACGATTG ATGACGAACG GGAATACCGC TCCCACCTTT TTCTCCTGAC ATTAGCGGAC
GGCAAAGTGA ACCAATGGAC GTTCGGAAAA GTAAAAGACG CTTTTCCACG TTGGTCCCCG
GATGGAAAAA CAATCGCGTT TCTTTCCAAC CGCTCCGGAA AAACGCAGAT TTGGCTTCTT
CCAACAGACG GCGGGGAAGC ACGCCAGTTA ACCGCATTAA AAAACGGAGT GCGCCATCTG
CGCTGGTCGC CGGACGGGCG GTTTCTTGTA GCACTCACTT CTCTTGGCAA GAAAGAAACG
TTAGAAGAAC GTGAGGAGCA AAACAAAAAA GAGAAAGAAA AACGCGAGTT AACACCACTT
GTCGTAGAGC GGCTTCAATA TAAATCCGAT GCAGCAGGAT TTCGCGATGA TAAACAAGAT
GTGCTCATTC GCATCAATGT AGAAACAGGC GAGATCCAAT CTCTGACAGA CAGAGATGAC
GAAATCGGCT CATTTGCCAT ATCCCCTGAC GGCAAAACAG TCGCGTTTGT CGCCAATCGG
AGCAATGAGC CTGATTTTAC GTTTACGCGC GACATATTTC TCCTTGATAT CGACAGCGGA
AAAACGACAA AAGTCACAGA TGGAAACGGG CTTTTCACTT CGCTGTCTTG GTCTCCGGAC
GGAAAAACGC TCGCGGCGAT TGGGCATGAC CTCAAATATT TAGGTGCTAC GTTAAATCGA
GTTTGGATTT TCGATCTTGT TAGCGGAGAA AAACGGGCGC TTACCGCAGA TTGGGATGTG
CATGTCGGTG ATGCGATGGT AGGAGATATG CATTCCGACG CGCCAAGTCC TGGACTCATT
TGGGATAAAG AAGGAAACGG CGTGTATTTT CTTGCTTCCG AACGAGGGCG CGTCAACCTT
CATCATGTAG CGTTAGACGG AACAATTACA CCATCTGTAG TCGGCGATTT CCATCTATAT
GGATTAACGA TTCACCCGCA TCAACATATA GCGATTGCGG CAATTAGCGA ACCAACCCAT
ATCGGCGACT TATATACCGT TTCTCTTCAT GATGGAAAAC GAAACAAACT GACAAATGCG
AATAAGGAGT TGGAAGAAGA AATTGTCCTT TCCGAACCGG AGACATTCAC CTATCCATCC
CGTGATGGCT GGAACATTCA AGGATGGATA ATGAAGCCCT CTCATCTCGA ACAAGGACAA
AAAGTGCCGC TTATCGTCGA AATCCACGGT GGTCCGCACG CGATGTACGG TTTTACTTTT
TTCCATGAAA TGCAGGTGCT TGCTGGCAAA GGATATGCGG TGCTGTTTAC GAATCCGCGC
GGCAGCCACG GCTATGGACA AACATTTGTC AATGCCGTAC GCGGCGATTA CGGCGGGATG
GACTATGAAG ACATCATGAG CGGCGTCGAC TATGTGCTCG AGCATTTTGA TTTTATCGAT
GAAACGCGCC TCGGCGTTAC AGGCGGAAGC TACGGCGGAT TTATGACGAA CTGGATCGTC
GGGCATACGG ACCGCTTTAA AGCCGCGGTA ACGCAGCGTT CTATTTCCAA CTGGCTCAGC
TTCTATGGAG TGAGCGACAT CGGTTATTTC TTCACAGAAT GGGAAATCGG CTGCAATGTG
TGGGAAGATC CAGAGCGGCT TTGGCACCAC TCTCCATTGA AATATGTGAA AAACATTCGA
ACGCCGCTAT TGATTCTCCA TAGCGAAAAA GATTACCGCT GCCCGATCGA ACAGGCGGAA
CAGCTGTTTA TTGCGCTGAA ACATTTAAAG CAGGAAACGA AGCTCATCCG CTTCCCGGAA
GCGAACCATG ATCTGTCCCG CAGCGGTCCG CCGACGCTTC GTCTTGAACG GCTGAATCAT
ATTGTCGGTT GGTTTGAACA ACATCTATCA TAA
 
Protein sequence
MTVQDVKKKL RVITAEDLFR IRFVNDPQFS PDGEKVIFVQ KTIDDEREYR SHLFLLTLAD 
GKVNQWTFGK VKDAFPRWSP DGKTIAFLSN RSGKTQIWLL PTDGGEARQL TALKNGVRHL
RWSPDGRFLV ALTSLGKKET LEEREEQNKK EKEKRELTPL VVERLQYKSD AAGFRDDKQD
VLIRINVETG EIQSLTDRDD EIGSFAISPD GKTVAFVANR SNEPDFTFTR DIFLLDIDSG
KTTKVTDGNG LFTSLSWSPD GKTLAAIGHD LKYLGATLNR VWIFDLVSGE KRALTADWDV
HVGDAMVGDM HSDAPSPGLI WDKEGNGVYF LASERGRVNL HHVALDGTIT PSVVGDFHLY
GLTIHPHQHI AIAAISEPTH IGDLYTVSLH DGKRNKLTNA NKELEEEIVL SEPETFTYPS
RDGWNIQGWI MKPSHLEQGQ KVPLIVEIHG GPHAMYGFTF FHEMQVLAGK GYAVLFTNPR
GSHGYGQTFV NAVRGDYGGM DYEDIMSGVD YVLEHFDFID ETRLGVTGGS YGGFMTNWIV
GHTDRFKAAV TQRSISNWLS FYGVSDIGYF FTEWEIGCNV WEDPERLWHH SPLKYVKNIR
TPLLILHSEK DYRCPIEQAE QLFIALKHLK QETKLIRFPE ANHDLSRSGP PTLRLERLNH
IVGWFEQHLS