Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BT9727_1547 |
Symbol | hag |
ID | 2857509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus thuringiensis serovar konkukian str. 97-27 |
Kingdom | Bacteria |
Replicon accession | NC_005957 |
Strand | - |
Start bp | 1625537 |
End bp | 1626640 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637512977 |
Product | flagellin |
Protein accession | YP_035879 |
Protein GI | 49477319 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0000000000229221 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATTA ATACAAACAT TAATAGCATG CGTACTCAAG AGTACATGCG CCAAAACCAA GCTAAAATGA GTAACGCTAT GGACCGTTTA TCAAGCGGTA AACGTATCAA CAACGCTTCT GACGATGCAG CAGGTCTTGC AATCGCAACT CGTATGCGTG CACGTGAAAG TGGATTAGGC GTAGCAGCTA ACAACACTCA AGATGGTATG TCATTAATCC GTACAGCTGA CTCAGCTTTA AACTCTGTAT CTAACATCTT ACTTCGTATG CGTGACATCG CTAACCAATC TGCGAACGGT ACGAACACAG CTGACAACCA ACAAGCTCTA CAAAAAGAAT TTGGTCAATT AAAAGAACAA ATTTCTTACA TTGCAGACAA TACAGAATTT AATGATAAAA CTTTATTAAA GGCTGACAAT AGCGTTAAGA TACAAACTTT AGATTCTGCT GACACTAACA AACAAATTTC TATTGATTTA AAAGGTGTTA CTCTTAATCA GTTAGGTCTT GACACTGTAA ATATCGGTTC TGAAACACTA TCAGCTGAAA GCCTAAATGT AGCAAAAGCT ACAATGGCTC GACTTGTAAA GGCAGATCAG AATGCTGATC CATCTACTTT TGCACTTGAT GTTAACACTG CAAAAGAATC TTTTGATAAA ATTAAAGGCT TTATCACTAA TAAAACTAAC GTTCAAAATG TAGAAAATGC ATTTAATGAT TACACAGTAG CTGATCCAGC TGATAAAGCT GATAAAGCTG ACGCTATTCA AGCTGCATTT AACACAGCAA TTACTGGTCT TACAGCTGGT ACACCAAACA CATCAAATCC ATCTTCAGCA GTGGATGCAA TTGATGCGGC TCTAAAAACA GTTGCTTCTA ACCGTGCAAC TCTAGGTGCT ACATTAAACC GCCTTGACTT TAACGTTAAC AACCTTAAGA GCCAATCTGC TAGCATGGCT TCAGCTGCTT CTCAAATCGA AGACGCTGAC ATGGCGAAAG AAATGTCTGA AATGACTAAG TTCAAAATCT TGAACGAAGC TGGTATCAGC ATGCTTTCTC AAGCAAACCA AACTCCACAA ATGGTTTCTA AATTATTACA ATAA
|
Protein sequence | MRINTNINSM RTQEYMRQNQ AKMSNAMDRL SSGKRINNAS DDAAGLAIAT RMRARESGLG VAANNTQDGM SLIRTADSAL NSVSNILLRM RDIANQSANG TNTADNQQAL QKEFGQLKEQ ISYIADNTEF NDKTLLKADN SVKIQTLDSA DTNKQISIDL KGVTLNQLGL DTVNIGSETL SAESLNVAKA TMARLVKADQ NADPSTFALD VNTAKESFDK IKGFITNKTN VQNVENAFND YTVADPADKA DKADAIQAAF NTAITGLTAG TPNTSNPSSA VDAIDAALKT VASNRATLGA TLNRLDFNVN NLKSQSASMA SAASQIEDAD MAKEMSEMTK FKILNEAGIS MLSQANQTPQ MVSKLLQ
|
| |