Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1400 |
Symbol | |
ID | 8807166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 1491542 |
End bp | 1492669 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | flagellin domain protein |
Protein accession | YP_003460641 |
Protein GI | 289208575 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCAGG TGCTGAGTGT TGCCACCAAC CTCCTGTCGC TCAATGCCCA GCGCCACCTG CGCCAGTCGC AGGCCGCCCT GCAGACGGCC TTTCAGCGCC TCGCTTCGGG CCAGCGCATC AATTCGGCGC GCGACGACGC CGCCGGGCTG GCCATCTCCG AACGCTTCAC AACCCAGATC CGCGGCCTGA ACCAGGCCAT GCGCAACGCG CAGGACGGCA TCTCGCTGGC CCAGACCGCC GAGGGCGCCC TGGGCTCCAT GACCGGCAAC CTGCAACGTA TCCGCCAGCT GGCGGTACAG GCGGCCAACG CCACCCTGTC GGCCTCGGAT CGTGGCGCCA TCGACCGCGA GGTTCGCCAG CTCCTGGCCG AGAACGACCG TATCGCGACC CAGACAACCT TTAACGGCCG TTCACTGCTG GACGGATCCA TGGGCACCGC TTTCTTCCAG ATCGGCCCCA ATGCCGGGCA GACGCTCGCC ATCGACCTCG GCGACAGCAT GCGCAACCGC TTCGTGGGTG CCATTGCCAG CGCCTATTCC GGCGAACTGG AGGCCGCCTT CGCCAGCGAT CCGCTGGAGG TGGATCAGGA TTTCTTCATC CGTGTGGGCG AGCGCGACCC GGTGGAGATC CCTGCAGGCC ACTATCGCAC GCCCCAGGAA CTGACGGCCG CGATCGACCG CGCCCTCGGA TCCACCGGAC GCGCGGTCCT GACCGACGAC GGCCGGCTGA ATCTGATCGC CCGCGAGACC CTGACGATCT CGGGTGAGAC CGCCATCGAC ACGCTCGGGC TGCCGGCCAC CACCGAAACC GGGGGGAGCC TGAACGATGT CGACCTGACC ACGCGCCAAG GGGCCAGCGA TGCCCTCTCA CGTCTGGATG CCGCACTGGA CGCGGTCAAC GCGCGGCGCA GCCAGTTCGG GGCCATCCAG AACCGCTTCG AGTCGACGAT TACCAACCTG GGCATTACCA GTGACAACCT GGCGGCCGCC CGCAGCCGTA TCCTGGATGC CGATTACGCG GCCGAGGTCA CCCGCCTCGT GCGCGCCCAG ATCCTGCAAC AGGCCGGAGT CGCCGTGCTC GTTCAGGCCA ACGCGCTACC GCAGACCTTC CTGCGCCTGC TCCAGTAA
|
Protein sequence | MAQVLSVATN LLSLNAQRHL RQSQAALQTA FQRLASGQRI NSARDDAAGL AISERFTTQI RGLNQAMRNA QDGISLAQTA EGALGSMTGN LQRIRQLAVQ AANATLSASD RGAIDREVRQ LLAENDRIAT QTTFNGRSLL DGSMGTAFFQ IGPNAGQTLA IDLGDSMRNR FVGAIASAYS GELEAAFASD PLEVDQDFFI RVGERDPVEI PAGHYRTPQE LTAAIDRALG STGRAVLTDD GRLNLIARET LTISGETAID TLGLPATTET GGSLNDVDLT TRQGASDALS RLDAALDAVN ARRSQFGAIQ NRFESTITNL GITSDNLAAA RSRILDADYA AEVTRLVRAQ ILQQAGVAVL VQANALPQTF LRLLQ
|
| |