Gene GWCH70_1471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1471 
Symbol 
ID7976917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1546045 
End bp1547115 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content44% 
IMG OID644798375 
Productprotein of unknown function DUF871 
Protein accessionYP_002949548 
Protein GI239826924 
COG category[S] Function unknown 
COG ID[COG3589] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCTATC TATCTTTCTA CTTATCAGAA CAAACGGATC AAATCGAAAA AAGATTTGCA 
CAGGCAAATT TGTTCGGATG TCGGGAACTA TTTACATCGC TTCATATTCC CGAAGACGAT
CTTACCTTTT ATCGCAAGCG ATTACAGGAG ATCGGGCAGT TAGCGAGAAA ATACGGAGTC
GGGATCATCG CTGACGTTAC CCCGGCTTCT TTATCCAAAA TTGGGGTGAA TGGAGACAAT
CTTGATTTGC TTTTTGATGG AGGAATTATT GGACTGCGGC TTGATGATGG ATTTTCTATG
AAAGAAGCAG CAACGTTTTC TCACCGGATG AAAGTTGTTT TCAATGCGAG CACGATGACG
GAAGAAGAAT GCGACGATCT TGCTTTTTGT GACGTGAACT GGAATCAAAT TGAAGCGTGG
CATAATTTTT ATCCACGTCC AGAAACAGGA TTATCAAAAG AATTAGTCAT TCAAAAAAAC
AAGATTTTAC GCCGAAAAGG AATTCGAACG CTTGCCGCGT TTATTCCGGG AAATAAAGAA
AAACGAGGTC CGCTCCATCA AGGGCTTCCT ACGCTAGAGG CTCATCGGTA TATGGATCCT
CTTTGCGCGT ACGTGGAATT AGTGCGGGAT TGTGAGGTAG ATAAAGTATT TGTGGGCGAT
GGCGGGATGA CAGATAATGT GCTTGTGCGA ATGAAGGAGT TTCGGGATGG GGTTATTCCG
CTGCGCTACC GGCCGCTTGT GCAACAACAT GAACTGCTTT CCATGGTCGA AACGGTTCAA
ACGAATCGGC GTGACGCGGC AAGGGATGTC ATTCGATCAC TGGAATCCCG CTTATCTTTC
TCATGGCCAA AGCATTTATT AGCACCAGCA TGTACAATCG AAAGGCAAAA AGGCAGTGTT
ACGATGGATA ATATTCGATA TGGACGATAT GCCGGTGAAC TGCAAATCAC ATTAACCGAT
TTGCCAGCAG ATGAGAAAGT CAATGTGATT GGACGGATTA TCAAAGACGA TCTTCCTCTC
CTTGCGTATG TCAAAGGAGG ACAACAGTTT CGCCTTGTGC GGATCACATA A
 
Protein sequence
MFYLSFYLSE QTDQIEKRFA QANLFGCREL FTSLHIPEDD LTFYRKRLQE IGQLARKYGV 
GIIADVTPAS LSKIGVNGDN LDLLFDGGII GLRLDDGFSM KEAATFSHRM KVVFNASTMT
EEECDDLAFC DVNWNQIEAW HNFYPRPETG LSKELVIQKN KILRRKGIRT LAAFIPGNKE
KRGPLHQGLP TLEAHRYMDP LCAYVELVRD CEVDKVFVGD GGMTDNVLVR MKEFRDGVIP
LRYRPLVQQH ELLSMVETVQ TNRRDAARDV IRSLESRLSF SWPKHLLAPA CTIERQKGSV
TMDNIRYGRY AGELQITLTD LPADEKVNVI GRIIKDDLPL LAYVKGGQQF RLVRIT