Gene GWCH70_1539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1539 
Symbol 
ID7979155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1613902 
End bp1615770 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content49% 
IMG OID644798432 
Productoligoendopeptidase F 
Protein accessionYP_002949605 
Protein GI239826981 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR00181] oligoendopeptidase F 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC GATCGTATGT ATGGTTGATT GTCATACTGC TGCTTATTCC GCTTCGAACG 
GATGCGGAAG AAACAAAAAT CAATGCAAAA TACCAATGGA ATCTAGCCGA CATTTACGTG
TCGGAAGCCA ATTTCAAACG CGACTATCAA GCAGTCGCCG ATGCGTTGCC GAAACTGTCC
TCCTATGAAG GCAAGCTTGC ACGCGCTTCC AACGTCGCCA AACTATTTGC CCTTAATGAA
AGGATAGCAC GAAAGCTCGA AAAATTGTCC CTTTATGCTC ACTTAAAACA AGATCTTAAC
ATTGAAGACA AGACGGCCGC TCATCTAAAA GCGCAAGTGG AAACACTGAT TTCCGACTAT
GCGGCAAAAA CGGCGTTTAT CGAGCCGGAA CTGCTGTCGC TTTCCGAACG GACGCTTGCC
AAGCTGCAAA AAAGCAAGCC GCTTAAGCCA TACCGCTATT ATTTTGAAGA ATTGCGCGAA
CGGAAAAAGC ATACTCTTTC CAAAAGAGAG GAACAGCTGC TCGCCAAGCT TTCTCCCATC
ATGAGTGATC CGGAAAACAT TTATAACAAT GCCGCGCGCG GCGATTATGA CCCGCCTTCT
GTGCGCACAC CGGACGGAAA AACGGTTTCG TTGACGGATG AAAACTATAC AAAAGCGCTT
GAACATCCGG ATCGCAACTA TCGCAAGCGG GCGTTTCAAA CGCGTTTTCA AAGCTATGAA
ACGATCGAAA ACACATCCGC CGCCACGTTA TACGCATCTG TGAAAGCGGA CGAACTGTAC
GCGAAAGCGC GAAAATACAA ATCTGGGCTC GATGCGGCAC TATCGGCCGA TGATGTGCCA
AAACAAGTGT TTACCAATCT CATTTCTACC GTCAATACTC ACTTGCCGTC ACTGCATCGC
TATGTCGAAC TGCGCAAAAA GGCGCTCGGC GTCGACCGTG TCCACACTTA CGATATGTCT
GTGCCGCTCG TTGAAGAGAC CATCGCGAAA AAAATGAAGT TTCCGTTCGA AACGGCGCAA
TCGCTCATCC TTGAAGGGCT GAAACCGCTC GGAGACGACT ACATCCAAAA CGTGCGGCGC
GCTTTCGAAC AGCGCTGGAT TGACGTCTAT CCGCGCCCAA AAAAATATAC GGGCGGCTAT
AATACGGGGG CGTACGACAC CCATCCGTTT ATTTTGCTCA ACTACGACGG GTCGCTCGAT
GGCTTGCTGA CGACCGCCCA CGAAATCGGG CACGCGATGA ATTCCGTCTA TACAAACAAA
ACGCAGCCAT ACCATTATTC CAGGCAATCG ATTTTTACCG CGGAAGTCGC TTCCACCGCC
AACGAATGGC TGATGATGGA TTATTTCTTA AAGCAAGCAA AAACGGACGA AGAAAAGCTG
TATTTGCTCA ACCAGCAAAT CGATCAAATT CGCGGCACAT TATATACGCA AGTAATGTAT
TCCGAATTCG AACAAGCGAT TCATGACAAA GTGCGGCAAG GCGGGAGCTT AACCGCAGCC
GAACTGAACG AGCTTTGGCT TCGCCTGTTG AAAAAATATT ACGGCCCTGC CTACGCCGCC
GATCCGGAAG CTGCGCGCGG CTGGCTGCGC ATTCCGCATT TTTATGATGC GTTTTACGTA
TACAAATACG CAACCTCGCT CGCCGCTTCC TTTGAGCTTG TCAAGCAAAT GAAAGCGGAT
GAAACCGGAG AGGCGACTAA ACGCTATTTG CAGTTTTTGC GCTCTGGAAC ATCCGACGAC
CCGATCCGCC TTTTACAAAA AGCGGGAGTG GATATGACAT CACCGAAGCC GCTCGAGAAC
CTGCTTTCTT ATTTCGATTC GCTCGTCCGC GAAATGGAAC AGCTGTTGAA AAAACAAGGA
AGACTGTAA
 
Protein sequence
MKKRSYVWLI VILLLIPLRT DAEETKINAK YQWNLADIYV SEANFKRDYQ AVADALPKLS 
SYEGKLARAS NVAKLFALNE RIARKLEKLS LYAHLKQDLN IEDKTAAHLK AQVETLISDY
AAKTAFIEPE LLSLSERTLA KLQKSKPLKP YRYYFEELRE RKKHTLSKRE EQLLAKLSPI
MSDPENIYNN AARGDYDPPS VRTPDGKTVS LTDENYTKAL EHPDRNYRKR AFQTRFQSYE
TIENTSAATL YASVKADELY AKARKYKSGL DAALSADDVP KQVFTNLIST VNTHLPSLHR
YVELRKKALG VDRVHTYDMS VPLVEETIAK KMKFPFETAQ SLILEGLKPL GDDYIQNVRR
AFEQRWIDVY PRPKKYTGGY NTGAYDTHPF ILLNYDGSLD GLLTTAHEIG HAMNSVYTNK
TQPYHYSRQS IFTAEVASTA NEWLMMDYFL KQAKTDEEKL YLLNQQIDQI RGTLYTQVMY
SEFEQAIHDK VRQGGSLTAA ELNELWLRLL KKYYGPAYAA DPEAARGWLR IPHFYDAFYV
YKYATSLAAS FELVKQMKAD ETGEATKRYL QFLRSGTSDD PIRLLQKAGV DMTSPKPLEN
LLSYFDSLVR EMEQLLKKQG RL