Gene GWCH70_2483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2483 
Symbol 
ID7979038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2509134 
End bp2510402 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content45% 
IMG OID644799284 
Productpeptidase U32 
Protein accessionYP_002950444 
Protein GI239827820 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTTAA AAAACGATAA AATTTCCGAG GTCATTAACG GCAAGCGCGT GATTGTGAAG 
AAGCCCGAGC TTCTCGCTCC AGCCGGCAAC CTAGAAAAGC TAAAAATCGC CGTGCATTAT
GGAGCAGATG CGGTATTTAT CGGCGGACAA GAATATAGCT TACGCGCCAA CGCCGACAAT
TTTACGCTTG AAGAAATTGC GGAAGGAGTG CGCTTTGCAA ACCAATATGG CGCAAAAGTG
TATGTGACAG CGAACATTTA TGCACATAAC GAAAACATTC CCGGGCTCGA AGAATATTTG
CAGGCGCTAG AACAAGCAGG TGTTCACGGC ATTATTGTCG CTGATCCGCT TATTATCGAA
ACGGCGCGTC GGGTGGCGCC GAAATTAGAA GTGCACTTGA GTACGCAGCA GTCGATGGCA
AACTGGAAAG CAGTTCAATT TTGGAAAGAA GAAGGATTGG AACGCGTGGT GCTCGCACGC
GAAACGACCG CGGAAGAAAT TCGGGAAATT AAAGAAAAAG TCGATATTGA AATTGAGGCG
TTTATTCATG GGGCGATGTG TTCCGCGTAC TCCGGCCGCT GTGTATTAAG CAACCATATG
ACAGCACGCG ACTCCAACCG CGGGGGATGC TGTCAATCAT GCCGCTGGGA TTACGATTTA
TACCAATTAG AAGGGGATAA AGAAATACCG TTGTTTGATG AAAATGATGC ACCGTTCGCG
ATGAGCGCAA AAGATTTGAA TTTAATTCGC GCGATTCCAA CGATGATTGA ATTAGGCGTT
GACAGCTTGA AAATCGAGGG GCGGATGAAA TCGATCCATT ATGTGGCGAC AGTCGTGAGT
GTTTATCGCA AAGTAATCGA TGCCTATTGC GCCGACCCAG ACAATTTTGT CATTCGCGAA
GAATGGATAA AAGAGTTGGA TAAATGTGCC AACCGCGATA CGGCTCCATC TTTCTTTGAA
GGAATGCCGG GCTATACAGA TCATATGTAC GGTTCTCATA GCCGAAAAAC AAGCCATGAA
TTTGCCGGTC TTGTACTGGA TTATGATAAA GAAACGAAGA TCGTCACATT ACAACAACGC
AACTTTTTTA AACCGGGAGA TGAAGTCGAA TTTTTTGGAC CGGAAATTGA AAACTTCACA
CAAGTAGTGG AAAAAATTTG GGACGAAGAT GGAAACGAGT TAGATGCTGC ACGCCATCCA
TTGCAAATTG TCAAGTTTAA AGTAGAGCGA GAAGTCTTCC CATACAACAT GATGAGAAAG
GAGATCTAA
 
Protein sequence
MLLKNDKISE VINGKRVIVK KPELLAPAGN LEKLKIAVHY GADAVFIGGQ EYSLRANADN 
FTLEEIAEGV RFANQYGAKV YVTANIYAHN ENIPGLEEYL QALEQAGVHG IIVADPLIIE
TARRVAPKLE VHLSTQQSMA NWKAVQFWKE EGLERVVLAR ETTAEEIREI KEKVDIEIEA
FIHGAMCSAY SGRCVLSNHM TARDSNRGGC CQSCRWDYDL YQLEGDKEIP LFDENDAPFA
MSAKDLNLIR AIPTMIELGV DSLKIEGRMK SIHYVATVVS VYRKVIDAYC ADPDNFVIRE
EWIKELDKCA NRDTAPSFFE GMPGYTDHMY GSHSRKTSHE FAGLVLDYDK ETKIVTLQQR
NFFKPGDEVE FFGPEIENFT QVVEKIWDED GNELDAARHP LQIVKFKVER EVFPYNMMRK
EI