Gene GWCH70_2998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2998 
Symbol 
ID7977367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3017615 
End bp3019057 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content43% 
IMG OID644799796 
Productcarboxyl-terminal protease 
Protein accessionYP_002950935 
Protein GI239828311 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000384491 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATAAAA AAACAACCGC AATACTTATG GCTTTATCGA TGCTAGTTGG TGCCGGGGGA 
ACATATGCCG GCATGCAGCT TGCAGCGCCT GACAGCGATC GTGAAATAAC ACTGGCCGAG
CCAGATAAAG CGGCAACGAA CGATGACGAG AAAGAACTGA AGAAAGTCGA ACAAGCTTAT
GAGCTGATTA AAAAACGCTA TGTTGAAAAA GTTGATGATG ATAAACTCAT CCAAGGCGCA
ATTCAAGGAA TGATTAGCAC GCTGAACGAC CCGTATTCTG TCTATATGGA TGAAGAAACA
AGCGAACAAT TTACGGAGTC GCTTGACTCT TCGTTTGAAG GAATCGGTGC CGAGGTAAGC
ATGATGAACG GAAAAGTCAC GATTGTCGCA CCGATTAAAA ACTCGCCGGC AGAAAAAGCG
GGATTAAAAC CAAATGATCA AATTTTGCGG GTGAATGGCG AGAGTCTAGA AGGGCTTGAT
TTATATGAAG CCGTGTTGAA AATTCGCGGG GAAAAAGGGA CGACGGTACA ATTGGATATT
CTTCGCCCCG GCGTAAAAGA AGTGATTAAA GTAAAAGTAG TCCGCGACGA AATTCCGATT
GAAACGGTTT ATGATTCTGT AAAAACGTAT AACGGGAAAA AAGTCGGCTA TTTAGAAGTA
ACGTCGTTTT CCGAAAATAC AGCAAAAGAT TTTAAAAAGA AATTAGCAGA ATTAGAAAGC
AAGCATATCG ACGGGTTAAT CATTGATGTG CGCGGCAACC CAGGCGGCTA TTTGCAAAGT
GTGGAAGAAA TTTTAAAACA ATTCATTCCG AAAGATAAAC CATATGTACA AATCGAAGAA
CGCAATGGCG ATAAACAACG TTTTTATTCC GATTTAACGA AGAAAAAACC GTATCCGATC
GCCGTGTTAA TTGACAAAGG CAGCGCATCC GCTTCGGAAA TTTTAGCTGG TGCCATGAAA
GAAGCGGGAG GATATAAGCT CGTTGGTGAA ACATCGTTCG GAAAAGGAAC GGTGCAGCAA
GCGATTCCGA TGGGGGATGG CAGCAACATT AAATTAACGC TCTATAAATG GCTGACGCCG
GATGGCCATT GGATTCATAA AAAAGGTGTT AAGCCGGACG TTGAAGTAAA GCAGCCGGAT
TACTTCCACG TTAGTCCGCT TCATATCGAA AAAGAGCTTT CCTTTGATAT GAACAATGAG
CAAGTAAAAA GTGCGCAACA AATGTTAAAG GGACTTGGAT TTGACCCTGG CCGCACCGAC
GGCTACTTCA GCAAAGAAAC TGAGTCGGCG GTAAAAGCAT TTCAAAAGGC AAATAAACTC
CCGCAAACCG GAAAAATCGA TAAAAACACA GCCGAAGTAT TACAAGCAAA AGTGATGGAC
GCCATTCGCG ACGACAACAA TGATGTACAA CTAAAAACAG CGATGAAAGT GCTGTTTCAT
TGA
 
Protein sequence
MNKKTTAILM ALSMLVGAGG TYAGMQLAAP DSDREITLAE PDKAATNDDE KELKKVEQAY 
ELIKKRYVEK VDDDKLIQGA IQGMISTLND PYSVYMDEET SEQFTESLDS SFEGIGAEVS
MMNGKVTIVA PIKNSPAEKA GLKPNDQILR VNGESLEGLD LYEAVLKIRG EKGTTVQLDI
LRPGVKEVIK VKVVRDEIPI ETVYDSVKTY NGKKVGYLEV TSFSENTAKD FKKKLAELES
KHIDGLIIDV RGNPGGYLQS VEEILKQFIP KDKPYVQIEE RNGDKQRFYS DLTKKKPYPI
AVLIDKGSAS ASEILAGAMK EAGGYKLVGE TSFGKGTVQQ AIPMGDGSNI KLTLYKWLTP
DGHWIHKKGV KPDVEVKQPD YFHVSPLHIE KELSFDMNNE QVKSAQQMLK GLGFDPGRTD
GYFSKETESA VKAFQKANKL PQTGKIDKNT AEVLQAKVMD AIRDDNNDVQ LKTAMKVLFH