Gene GWCH70_0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0035 
Symbol 
ID7979424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp47068 
End bp48288 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content40% 
IMG OID644796995 
Product3D domain protein 
Protein accessionYP_002948243 
Protein GI239825619 
COG category[S] Function unknown 
COG ID[COG3583] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.686039 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACCCA ATATGAAGAA GATTTCCGGT TCATTGAGGA AAAATTTCAC TGTTACTGCT 
AGTAGTTTTA TAGCTTTGTC AGCAACAACG GGATTTGCTG GGTACGAAAT AGCCAAAGAT
GATGTGATAC TAACGGCAAA CGGAAAAAAA CAAGAGATTC GTACTCATGC AAAAACGGTA
AAAGAAGTAT TACAGGAGCA AAATATTAAG CCAAGGAAAG AAGATCGCGT CTATCCATCG
TTAGATACGC CGATTACAGA TGATTTAAAC ATCGTTTGGG AAGCGAGCAA GAAAGTCACT
TTGACAGTAG ATGGAAAAAA ACAAGAAATA TGGACGACTG CAAAGAACGT TGCCGAGCTA
TTAAACTCAC AACATATTAA AATCGAAAAG CACGACAAAA TTGCCCCAGC ACCAAATACA
AAAATTAAAA AGGGAATGAA AGTCAACATA GAAAAGGCAT TCCCAGTTCA ATTAAATGTC
GGCGGTAAAC AGCAACAAGT GTGGGCAACT TCGACTACTG TCGCTGACTT TTTAAAACAA
CAAAATGTAA AATTAGATGA ACTCGACCGT GTAGAGCCAT CTTTACAGGA CAAACTAAAA
GAAAATATGG TCGTAAAAGT GATTAAAGTT GAAAAAGTCA CCGATGTAGT GGAAGAACCA
GTTGACTTTG CAGTCGTCAC TAGACAAGAT GCACAATTGC CAAAAGGAGA ACAACGTATT
ATTAGCCCTG GAGAAAAAGG GCGAGTTTCG AAAAAGTATG AAGTAGTGCT CGAAAATGGA
AAAGAAGTAT CGCGGAAATT AATTGAGACA AAGATGATAA AAGAAAGTAA AAATCGGATT
GTTGCTATCG GTACGAAAGT GGCTAAAAGC CGACCTGCTC ATACGCAAAG CCGCTCTGTT
CAGACCGTAT CGCGCGGCCA AAAGCATGCA GCGCGAGAAA TTTATGTTGT TGCTAGCGCC
TATACTGCTT ATTGTCAAGG ATGTTCAGGA ACAACGAGAA TGGGAATTAA CTTGCGTGCA
AATCCTTCTG CAAAAGTAAT CGCAGTGGAT CCAAACGTTA TTCCGCTTGG ATCAAAAGTG
TACGTAGAAG GATACGGATA TGCTATAGCT GCTGATACAG GATCAGGTAT TAATGGTTAT
GAAATTGACG TGTTTATTCC AAAGCAATCG GATGCACTTC GTTGGGGTAG AAAGCGTGTG
AAGGTGAGAA TTCTTCAATA A
 
Protein sequence
MLPNMKKISG SLRKNFTVTA SSFIALSATT GFAGYEIAKD DVILTANGKK QEIRTHAKTV 
KEVLQEQNIK PRKEDRVYPS LDTPITDDLN IVWEASKKVT LTVDGKKQEI WTTAKNVAEL
LNSQHIKIEK HDKIAPAPNT KIKKGMKVNI EKAFPVQLNV GGKQQQVWAT STTVADFLKQ
QNVKLDELDR VEPSLQDKLK ENMVVKVIKV EKVTDVVEEP VDFAVVTRQD AQLPKGEQRI
ISPGEKGRVS KKYEVVLENG KEVSRKLIET KMIKESKNRI VAIGTKVAKS RPAHTQSRSV
QTVSRGQKHA AREIYVVASA YTAYCQGCSG TTRMGINLRA NPSAKVIAVD PNVIPLGSKV
YVEGYGYAIA ADTGSGINGY EIDVFIPKQS DALRWGRKRV KVRILQ