Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3024 |
Symbol | |
ID | 7977389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 3039757 |
End bp | 3040776 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644799819 |
Product | flagellin domain protein |
Protein accession | YP_002950958 |
Protein GI | 239828334 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATTA ACCACAATAT TGCGGCGTTG AACACGTATC GTCAATTAAC GATCGGTCAA AACGCAGCAA CGAAAAATAT GGAAAAACTT TCTTCCGGTC TTCGCATTAA CCGCGCGGGC GACGATGCGG CAGGGCTTGC GATTTCGGAA AAAATGCGTG GGCAAATTCG TGGGTTGGAG CAAGCGTCAC GCAATGCACA GGACGGAATC TCGCTCATCC AGACAGCAGA AGGGGCTTTA AATGAAGTTC ATTCTATTCT TCAACGTATG CGTGAATTGG CTGTTCAAGC TGCGAATGAC ACGAATACAG GTACTGACCG TGACGAGATT CAAAAAGAAA TTAATCAATT AACTTCTGAA ATTAATCGTA TCGGTAATAC AACTGAGTTC AATACTCAAA AATTATTAAA CGGAACAAGA ACTACTACTC CTTTATCTAC ACCTGCAGGG GCAACACAGA ATGGTGGTAA TGATCCACAA TCTGCAGGGG CAACACAGAA TGGTGGTAAT GATCCACAAT CTGCAGGGGC AACACAGAAT GGTGGTAATG ATCCACAATC TGCAGGGGCA ACACAGAATG GTGGTAATGA TCCACAATCT GCAGGGGCAA CACAGAATGG TGGTAATGAT TCACAATCTG CAGGGACAAT AGAAATAACT TTACAAATTG GTGCGAACCA AAGCCAAAGC TTAACAATTG ACATTCAGGA CATGCGTGCT AGGGCACTTG GTATCACAGG AACAGCTGGT ACTTCAGGGT TTACTGCTAC AAACACTGTC ACTGATGGTA CAAACAACGA TACTGTGGAA GCTGCACTTG ATGTATCGAA TGCAGCGAAT GCATCTGCTG CCATCACAAG GATTCAAATT GCAATCGATA GAGTATCTGC AGAGCGTTCA AAGCTAGGTG CTTATCAAAA TCGTTTAGAA CATACTATTA ATAATCTCAG CACCTCTGCA GAAAACCTAC AAGCCGCAGA ATCCCGCATC CGCGACGTAG ATTATGCCTT AGCTGCCTAA
|
Protein sequence | MRINHNIAAL NTYRQLTIGQ NAATKNMEKL SSGLRINRAG DDAAGLAISE KMRGQIRGLE QASRNAQDGI SLIQTAEGAL NEVHSILQRM RELAVQAAND TNTGTDRDEI QKEINQLTSE INRIGNTTEF NTQKLLNGTR TTTPLSTPAG ATQNGGNDPQ SAGATQNGGN DPQSAGATQN GGNDPQSAGA TQNGGNDPQS AGATQNGGND SQSAGTIEIT LQIGANQSQS LTIDIQDMRA RALGITGTAG TSGFTATNTV TDGTNNDTVE AALDVSNAAN ASAAITRIQI AIDRVSAERS KLGAYQNRLE HTINNLSTSA ENLQAAESRI RDVDYALAA
|
| |