Gene GWCH70_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2049 
Symbol 
ID7977285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2110175 
End bp2111383 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content35% 
IMG OID644798867 
Producthypothetical protein 
Protein accessionYP_002950037 
Protein GI239827413 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000426052 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAGAC AACAAAAAGA TAGTCAATCA GTGGAGTCAA ATGCTTATCT AAAATCGTTA 
GTGGGTAAAA GAATAAAAGT ATACAGAGGG GGACCAGAAT CTTGCGAAGG TAGGCTTTTA
GATGTTCAAT CCGATTATGT TGCACTAATG CCAGAGCAAT CAAACGACAA TAAACAAAAT
AACAACGCAA ACAATAACAA CAACGCAAAA AAGGACAACG CAATTATTTA CTATAACTTA
AGACATGTCC AAAGCATTAG TGAAAATTTA AAAGCTAACT CTGTTGAATC TAACTTAAAA
CATGTCCAAA ACATCGTCCA AAGCATCAGT GAAAATATTT ATAGCAACTT AAGACAGGTC
CAAAGCATTA GTGAAAATTT AATCGCTAAC TTAAAACATG TTCAAAGCAT CGATAAAAAT
TCAACAGCTA ACTCTGTTGA ATCTAACTTA AAACTTGCCC AAAACGTCAG TGAAAATTCA
AAAGCGAACT CTATTGAATG GTCGATCGAT TTTGATAATC ATCCAGAATT GGTGTCAGTG
AATAATTTCA CAGAATTATT AAAAAATTTA ACTGGCAGCA TGGTCAAGGT GAATAAAGGC
GGTCCTGAAT CTAAAAAAGG AATAGTGCTC CTCGTTGCTG GTGATTATAT GGGCCTCTTA
ACGGAAGACG ATGGCATTGT ATTTTATAAT ACAACTCACA TCAAAAGTAT AAGCGTGCAA
AATAGAAGCC AAAATATCGA TCAAAATATC GATCAAAGCA CTCCTCTCAG CAATTCCCCT
ATCCATTATG ATAATTATTT TGATGACATC CATGCACAAA ACTTCCTTGA ATTATTTGAT
TATTTTGCTT ATAAATGGGT CTCGATTAAC CGCGGCGGTC CTGAAGCAGC AGAAGGAATT
CTTGTGCAAG AAGAAGGAGA ACATTATACG TTAGTGAACA ATGATGAAGT CATTCGAATT
TACCCTTATC ACATTAAAAG CATCAGTATT GGTACAAAAG GCTTTCTCAA ACAACAACAG
CAGCAACAAA ATAATAATGA AGCTGCTGAG AATGAAAACT CCCAAGATGT GAATATGACC
AACAAAGTAG AAGATAACAG AACAGCGGGC AGAGAACAGC GTTCAAGTCG AAGAAGCTCT
CCACAGGAAA CCATCGTTAA GGAAACAATC GTTAAAACGA TTGATTATAT TTGGGATCCG
AAACGATAA
 
Protein sequence
MIRQQKDSQS VESNAYLKSL VGKRIKVYRG GPESCEGRLL DVQSDYVALM PEQSNDNKQN 
NNANNNNNAK KDNAIIYYNL RHVQSISENL KANSVESNLK HVQNIVQSIS ENIYSNLRQV
QSISENLIAN LKHVQSIDKN STANSVESNL KLAQNVSENS KANSIEWSID FDNHPELVSV
NNFTELLKNL TGSMVKVNKG GPESKKGIVL LVAGDYMGLL TEDDGIVFYN TTHIKSISVQ
NRSQNIDQNI DQSTPLSNSP IHYDNYFDDI HAQNFLELFD YFAYKWVSIN RGGPEAAEGI
LVQEEGEHYT LVNNDEVIRI YPYHIKSISI GTKGFLKQQQ QQQNNNEAAE NENSQDVNMT
NKVEDNRTAG REQRSSRRSS PQETIVKETI VKTIDYIWDP KR