Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3036 |
Symbol | |
ID | 7977400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 3048807 |
End bp | 3051206 |
Gene Length | 2400 bp |
Protein Length | 799 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644799830 |
Product | flagellin domain protein |
Protein accession | YP_002950969 |
Protein GI | 239828345 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATTA ATCATAACAT TCAAGCATTA AACGCCTATC GCAACTTAGC GGCAAACCAA TTGAGCGTCT CTAAAAACTT AGAAAGATTA TCATCTGGTT TGCGAATTAA CCGCGCAGCA GACGATGCGG CTGGTCTTGC CATCTCTGAA AAAATGCGCT CGCAAATCCG CGGACTTCAA ATGGCAGAAC GCAATGCGTT AGACGCAATC TCGCTCATCC AAACGGCGGA AGGGGCTTTA AACGAAGTAC ACAGCATCTT GCAGCGCATG AGAGAATTGG CAGTGCAGGC GGCGAATGAT ACAAATACGG TAGAGGATCG AGAAGCGATT CAAAAGGAAA TCAATCAGTT GACATCTGAA ATTAACCGGA TTGCCAACTC CACCGAATAC AATACGAAAA AATTGCTCAA TGAGTCCGTT TCCAGCCAAG CGAGAAGTGT GCAAATTGCT CAAGGTAGCT TTTCAGCAGG CGGCACAACA ATCGATGGAG CGAGCCTGAA GCTGGACCCG GAAAGCACGA TGGTGGCTGG AAATTATAAA GTGAAAATCG AAGATGTTGC TACGAAGTCA ATACAAAGCA CAGGACCAGC TGCCGGCGGG GTGCAACAAA TCACACTGGA TGCAACATCA AACTTGACGG TCGGTAATAC ATACGGGGTG AAAATCGAAC AGCAAGATGT GAAAGTTATT ACAAACAATC CGGGGTCGTC GCCGGCCATT GATACGGTAT CGATCGCGCC GAACTCCCCG TTAAGCGATG GAACAGTGAA TCTCGTGATC CGTCGCACGA ATGCTGTGAA CAACTTCCAA TCAGGTGGTA CGGGTCTGAC AGGAGTGCGA GTTTCGGCCT CTGCAGATCT AGATCCAAAT AATACGTTTG TGATAGAAAC AAAGTCCCAA GCTTCTGGTG TGCAAAACGG GACGGCCAAT GGTCTATATA TTACGAATGT CAATATTGAC TCTAAAAAAT TTTCACTAAA AGGTGAAGAT TTCCGGCTCG TCTATACAGA AACGTCACCA GAATCCGGCC AGTTCACGGT AACTTTGCAA GATAAAAACG GGAATGATTT ATCCCAAGCG GTCATCTTAG ATAATAACCA ACAAAACTAC GAGTTTTATG ATTTATCTGG CAACAAACTT GGTGTTTCCT TTACGACGAT TTCCGGTATT AACAGAACAC TAGCGGCAGC AAGAGCAAGT GAAAACGGAA AATACAATGA ATTTGATATT AATATTGCGC TTTCTTTAAA GAAAAATGGC ACGCAAATCG GAACGGCTGT CATCACTCCG ACAGATGGGA CGGCTGGCAA TGTGACCATT AACGGTTCGG ATGGGATTAG CTACACAATC AATCATAACG GATATGCCAG CACGAACATC AATCGAACAG CAACGTTCGG CATACAGAGT ATGCTTTCAT ACTCGACCGA TGGAGCGACG TTCACGACGT TTAATGCAGG TGACAATCTG ACCTTAAATG GCGGCATTTT CGTTGATACA GCCGACAATA TCACTGATTA TACGACAGGG GATACGACCG TGCAGATTGA CGTTGGCACG ACGTCATCGT ATACAGCCAC GCTGGTTGAT GACGTGGGAA ACGCACTGTC CGGCGTACCG ATGCTCGTCG TTGACAATAA CGGCGTGTAT TCGTTTGGCA GCGGCACAGG CGTTTCGTTT ACAACTGGCA CGCTGTCGGC TGGAACACGT ACATTTACCG TAGGGGCAAC GGTTACGACG AAAGCGACAT TAAGCACGAA TGGCGTTGAG GTTGAAACGT TAAACAATAT TGCCCCGAAC ACGACCCTGC GTTTTCATAA CGGTGATTTA ACCATGAACA TTGGGGCGCT AACTAACGGT GAAGCTTCTT TCACCATTAC AGGGGGAACT ACGGATCAAT CACTGAAAAT CCAAATTGGC GCTAACGAAG GTCAAACGTT AAGCATTGGT ATTGATGACA TGCGTTCGCT CGCTTTGAAA CTATCCAGCG ATACAGCTGG TGCGATAGTT TCTTTCCAAA ATCATTTAGG TCAAATCATT ACTGCATATT ATGCAAGCAT CGCTGATGTG AATAACGGAA ATTCAAACGA ATTTGCTCTT GATGTTACTA CTCATGAGAA AGCAGCGGCA GCGGTTCAAG TATATGACTA TGCTATTTCA AGAGTTTCAG CACAGCGTTC GAATTTAGGC GCCGTCCAAA ACCGCCTAGA ACATACCATC AACAACTTAA AAACGGCGAA CGAAAACTTA ACGTCCGCAG AATCACGCAT CCGCGATACG GATATGGCAA TGGAGATGGC AGAGTTTACG AAAAACAATA TCTTAACCCA AGCGGCACAA GCGATGCTCG CGCAATCAAA CCAGTTGCCG CAAGGAATTT TACAATTATT GAAAAGCTAA
|
Protein sequence | MRINHNIQAL NAYRNLAANQ LSVSKNLERL SSGLRINRAA DDAAGLAISE KMRSQIRGLQ MAERNALDAI SLIQTAEGAL NEVHSILQRM RELAVQAAND TNTVEDREAI QKEINQLTSE INRIANSTEY NTKKLLNESV SSQARSVQIA QGSFSAGGTT IDGASLKLDP ESTMVAGNYK VKIEDVATKS IQSTGPAAGG VQQITLDATS NLTVGNTYGV KIEQQDVKVI TNNPGSSPAI DTVSIAPNSP LSDGTVNLVI RRTNAVNNFQ SGGTGLTGVR VSASADLDPN NTFVIETKSQ ASGVQNGTAN GLYITNVNID SKKFSLKGED FRLVYTETSP ESGQFTVTLQ DKNGNDLSQA VILDNNQQNY EFYDLSGNKL GVSFTTISGI NRTLAAARAS ENGKYNEFDI NIALSLKKNG TQIGTAVITP TDGTAGNVTI NGSDGISYTI NHNGYASTNI NRTATFGIQS MLSYSTDGAT FTTFNAGDNL TLNGGIFVDT ADNITDYTTG DTTVQIDVGT TSSYTATLVD DVGNALSGVP MLVVDNNGVY SFGSGTGVSF TTGTLSAGTR TFTVGATVTT KATLSTNGVE VETLNNIAPN TTLRFHNGDL TMNIGALTNG EASFTITGGT TDQSLKIQIG ANEGQTLSIG IDDMRSLALK LSSDTAGAIV SFQNHLGQII TAYYASIADV NNGNSNEFAL DVTTHEKAAA AVQVYDYAIS RVSAQRSNLG AVQNRLEHTI NNLKTANENL TSAESRIRDT DMAMEMAEFT KNNILTQAAQ AMLAQSNQLP QGILQLLKS
|
| |