Gene GWCH70_0538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0538 
Symbol 
ID7979393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp606629 
End bp608497 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content39% 
IMG OID644797534 
Productputative transcriptional regulator 
Protein accessionYP_002948708 
Protein GI239826084 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.141611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGCGA AGGATTTGTT GCGTATTATA GAAGATGGAG AAAATGCCGA ATTGGAATGT 
AAAGCAGCCA AAGGCGGTTT ACCGAAAGAT GTTTGGGAAA CGTATTCTGC TTTTGCGAAT
ACAAATGGCG GTATTATTTT GCTTGGGGTC GAGCAAAAAG GACAAGAGTT TTTCCCAGTC
GATGTTGATG CAAAAAAGTT AGTCAAAGAC TTTTGGGATG GAATTAATAA TCCCCAAAGG
ATTAGTAAAA ATATCTTAGT GGATCGAAAT GTAGAAATTA TTGAAGTTGA TGGGAAAGAA
ATCGTGAAAA TTTATGTTCC AAGAGCTTCA CGTGAGGACA GGCCAATTTA TATTGGTCAT
AATCCTTTTA CAGGTACATA TCGCCGCAAT TATGAAGGAG ACTATAAATG TACAAAAGAA
GAAGTACAGC GAATGATAGC GGACCAATCG CCAATTCCTC AAGACAGCAA AGTTCTTCCC
AATTACGGTT TAAATGATTT GAATATGGAA TCGGTGAGAA GTTATCGAAA TCGCTTTTCT
TCTTTGAAAC CAGATCACCC TTGGAATGGT TTAGAACTGA AGGAATTTTT ATATAAAATT
GGAGCTTTGG GAAAGCTGCG CGAAACTAAC GAGGAAGGGC TAACATTAGC CGGGTTGCTC
ATGTTTGGGG AAGAGCGCAG CATTACAGAG TATTTACCGC AGTATTTCTT AGAATATCGG
GAAAAGAGTT TGGATGTGCC GGGGGAGAGA TGGACAGATC GGATTATTTC TTCCGACGGC
ACTTGGTCGG GGAATTTGTA TGATTTTTAT TTTAAAGTGA TCCGCAGACT AACGGATGAT
TTAAACATTC CTTTTCAAAT GGAGGGGCTT TTTCGCAAGG ATGATACAAG GGTGCACGCT
GCTCTTCGTG AGGCTTTAGT CAATACGTTG GCTCATGCCG ATTACTACGG GCAACGCGGA
ATTGTTATCG AAAAGGAAAA GATGTTATTT CGATTTTCCA ACCCGGGTGT TCTTCGGATT
CCTTTGCAAC AAGCACTGAA AGGTGGAATA AGTGATCCGA GAAATCCAAC GATTTTTAAA
ATGTTTATGT TAATCGGACT TGGTGAACGG GCAGGGTCGG GGATTGAGAA TATTCATTTG
GCGTGGAAAG AACAACATTG GACTGCCCCA GAATTAATTG AAGAATTTCA GCCGGATCGC
ACGATTCTTA CGTTGCGAAC AACTTCATTA TTACCGCAGG AAAGCGTCGA TTTTCTAAAA
GCAGTTCTTG GGAAACATTA TAAGTTTTTA ACCAATGATG AAGTATTAAT TTTAGTGACG
GCACATCAAG AAAATTATGT GACCAATACG AGATTGCAAA GCCTGACAGA TAAAAAGAGT
GATGAAATAA GCAAATTATT ATCCGTACTG GTGGAAAAAG GATATTTAGA ACCAAATGGA
CAAGGAAGAG GTACAAAATA TACGCTAACG GAAATGTTTT ATCAAAAGTC TATAGGGAAC
TCCAGATATA ATCGAATAAG CTCCGGACCT AGTGCTAATA ACTCCGGACC TAACGCTGCT
AACTCCGGAC CTAACGAAAA CGAACAAACA AAAGATCATG AAAAAGTGCT ATTGGATATT
TCTGAATTAG CAAGGAAGAA AAAGAGATTA CATCCTTCCG AAATGGATGA GATCATTTTA
AATCTTTGTG CAATTAAGCC GTTAACGCTC AAAGAATTAA GTCAATTACT GAATAGGCAG
ATGGATCCTC TTCGCAAAAA GTACATTTCA CGATTGCTTA GGGAAGGAAA GTTAGAACTG
CTATATCCGG AGCAAGTAAA TCATCCGAAA CAAGCATATA TGACACGATC TCTTTTTAAC
CAGCATTAG
 
Protein sequence
MDAKDLLRII EDGENAELEC KAAKGGLPKD VWETYSAFAN TNGGIILLGV EQKGQEFFPV 
DVDAKKLVKD FWDGINNPQR ISKNILVDRN VEIIEVDGKE IVKIYVPRAS REDRPIYIGH
NPFTGTYRRN YEGDYKCTKE EVQRMIADQS PIPQDSKVLP NYGLNDLNME SVRSYRNRFS
SLKPDHPWNG LELKEFLYKI GALGKLRETN EEGLTLAGLL MFGEERSITE YLPQYFLEYR
EKSLDVPGER WTDRIISSDG TWSGNLYDFY FKVIRRLTDD LNIPFQMEGL FRKDDTRVHA
ALREALVNTL AHADYYGQRG IVIEKEKMLF RFSNPGVLRI PLQQALKGGI SDPRNPTIFK
MFMLIGLGER AGSGIENIHL AWKEQHWTAP ELIEEFQPDR TILTLRTTSL LPQESVDFLK
AVLGKHYKFL TNDEVLILVT AHQENYVTNT RLQSLTDKKS DEISKLLSVL VEKGYLEPNG
QGRGTKYTLT EMFYQKSIGN SRYNRISSGP SANNSGPNAA NSGPNENEQT KDHEKVLLDI
SELARKKKRL HPSEMDEIIL NLCAIKPLTL KELSQLLNRQ MDPLRKKYIS RLLREGKLEL
LYPEQVNHPK QAYMTRSLFN QH