Gene GWCH70_3036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3036 
Symbol 
ID7977400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3048807 
End bp3051206 
Gene Length2400 bp 
Protein Length799 aa 
Translation table11 
GC content46% 
IMG OID644799830 
Productflagellin domain protein 
Protein accessionYP_002950969 
Protein GI239828345 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATTA ATCATAACAT TCAAGCATTA AACGCCTATC GCAACTTAGC GGCAAACCAA 
TTGAGCGTCT CTAAAAACTT AGAAAGATTA TCATCTGGTT TGCGAATTAA CCGCGCAGCA
GACGATGCGG CTGGTCTTGC CATCTCTGAA AAAATGCGCT CGCAAATCCG CGGACTTCAA
ATGGCAGAAC GCAATGCGTT AGACGCAATC TCGCTCATCC AAACGGCGGA AGGGGCTTTA
AACGAAGTAC ACAGCATCTT GCAGCGCATG AGAGAATTGG CAGTGCAGGC GGCGAATGAT
ACAAATACGG TAGAGGATCG AGAAGCGATT CAAAAGGAAA TCAATCAGTT GACATCTGAA
ATTAACCGGA TTGCCAACTC CACCGAATAC AATACGAAAA AATTGCTCAA TGAGTCCGTT
TCCAGCCAAG CGAGAAGTGT GCAAATTGCT CAAGGTAGCT TTTCAGCAGG CGGCACAACA
ATCGATGGAG CGAGCCTGAA GCTGGACCCG GAAAGCACGA TGGTGGCTGG AAATTATAAA
GTGAAAATCG AAGATGTTGC TACGAAGTCA ATACAAAGCA CAGGACCAGC TGCCGGCGGG
GTGCAACAAA TCACACTGGA TGCAACATCA AACTTGACGG TCGGTAATAC ATACGGGGTG
AAAATCGAAC AGCAAGATGT GAAAGTTATT ACAAACAATC CGGGGTCGTC GCCGGCCATT
GATACGGTAT CGATCGCGCC GAACTCCCCG TTAAGCGATG GAACAGTGAA TCTCGTGATC
CGTCGCACGA ATGCTGTGAA CAACTTCCAA TCAGGTGGTA CGGGTCTGAC AGGAGTGCGA
GTTTCGGCCT CTGCAGATCT AGATCCAAAT AATACGTTTG TGATAGAAAC AAAGTCCCAA
GCTTCTGGTG TGCAAAACGG GACGGCCAAT GGTCTATATA TTACGAATGT CAATATTGAC
TCTAAAAAAT TTTCACTAAA AGGTGAAGAT TTCCGGCTCG TCTATACAGA AACGTCACCA
GAATCCGGCC AGTTCACGGT AACTTTGCAA GATAAAAACG GGAATGATTT ATCCCAAGCG
GTCATCTTAG ATAATAACCA ACAAAACTAC GAGTTTTATG ATTTATCTGG CAACAAACTT
GGTGTTTCCT TTACGACGAT TTCCGGTATT AACAGAACAC TAGCGGCAGC AAGAGCAAGT
GAAAACGGAA AATACAATGA ATTTGATATT AATATTGCGC TTTCTTTAAA GAAAAATGGC
ACGCAAATCG GAACGGCTGT CATCACTCCG ACAGATGGGA CGGCTGGCAA TGTGACCATT
AACGGTTCGG ATGGGATTAG CTACACAATC AATCATAACG GATATGCCAG CACGAACATC
AATCGAACAG CAACGTTCGG CATACAGAGT ATGCTTTCAT ACTCGACCGA TGGAGCGACG
TTCACGACGT TTAATGCAGG TGACAATCTG ACCTTAAATG GCGGCATTTT CGTTGATACA
GCCGACAATA TCACTGATTA TACGACAGGG GATACGACCG TGCAGATTGA CGTTGGCACG
ACGTCATCGT ATACAGCCAC GCTGGTTGAT GACGTGGGAA ACGCACTGTC CGGCGTACCG
ATGCTCGTCG TTGACAATAA CGGCGTGTAT TCGTTTGGCA GCGGCACAGG CGTTTCGTTT
ACAACTGGCA CGCTGTCGGC TGGAACACGT ACATTTACCG TAGGGGCAAC GGTTACGACG
AAAGCGACAT TAAGCACGAA TGGCGTTGAG GTTGAAACGT TAAACAATAT TGCCCCGAAC
ACGACCCTGC GTTTTCATAA CGGTGATTTA ACCATGAACA TTGGGGCGCT AACTAACGGT
GAAGCTTCTT TCACCATTAC AGGGGGAACT ACGGATCAAT CACTGAAAAT CCAAATTGGC
GCTAACGAAG GTCAAACGTT AAGCATTGGT ATTGATGACA TGCGTTCGCT CGCTTTGAAA
CTATCCAGCG ATACAGCTGG TGCGATAGTT TCTTTCCAAA ATCATTTAGG TCAAATCATT
ACTGCATATT ATGCAAGCAT CGCTGATGTG AATAACGGAA ATTCAAACGA ATTTGCTCTT
GATGTTACTA CTCATGAGAA AGCAGCGGCA GCGGTTCAAG TATATGACTA TGCTATTTCA
AGAGTTTCAG CACAGCGTTC GAATTTAGGC GCCGTCCAAA ACCGCCTAGA ACATACCATC
AACAACTTAA AAACGGCGAA CGAAAACTTA ACGTCCGCAG AATCACGCAT CCGCGATACG
GATATGGCAA TGGAGATGGC AGAGTTTACG AAAAACAATA TCTTAACCCA AGCGGCACAA
GCGATGCTCG CGCAATCAAA CCAGTTGCCG CAAGGAATTT TACAATTATT GAAAAGCTAA
 
Protein sequence
MRINHNIQAL NAYRNLAANQ LSVSKNLERL SSGLRINRAA DDAAGLAISE KMRSQIRGLQ 
MAERNALDAI SLIQTAEGAL NEVHSILQRM RELAVQAAND TNTVEDREAI QKEINQLTSE
INRIANSTEY NTKKLLNESV SSQARSVQIA QGSFSAGGTT IDGASLKLDP ESTMVAGNYK
VKIEDVATKS IQSTGPAAGG VQQITLDATS NLTVGNTYGV KIEQQDVKVI TNNPGSSPAI
DTVSIAPNSP LSDGTVNLVI RRTNAVNNFQ SGGTGLTGVR VSASADLDPN NTFVIETKSQ
ASGVQNGTAN GLYITNVNID SKKFSLKGED FRLVYTETSP ESGQFTVTLQ DKNGNDLSQA
VILDNNQQNY EFYDLSGNKL GVSFTTISGI NRTLAAARAS ENGKYNEFDI NIALSLKKNG
TQIGTAVITP TDGTAGNVTI NGSDGISYTI NHNGYASTNI NRTATFGIQS MLSYSTDGAT
FTTFNAGDNL TLNGGIFVDT ADNITDYTTG DTTVQIDVGT TSSYTATLVD DVGNALSGVP
MLVVDNNGVY SFGSGTGVSF TTGTLSAGTR TFTVGATVTT KATLSTNGVE VETLNNIAPN
TTLRFHNGDL TMNIGALTNG EASFTITGGT TDQSLKIQIG ANEGQTLSIG IDDMRSLALK
LSSDTAGAIV SFQNHLGQII TAYYASIADV NNGNSNEFAL DVTTHEKAAA AVQVYDYAIS
RVSAQRSNLG AVQNRLEHTI NNLKTANENL TSAESRIRDT DMAMEMAEFT KNNILTQAAQ
AMLAQSNQLP QGILQLLKS