Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0212 |
Symbol | |
ID | 8805942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 220885 |
End bp | 221919 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | type IV pilus assembly protein PilW |
Protein accession | YP_003459463 |
Protein GI | 289207397 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.041624 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCG CGCGGTCATT CGCTTCAATC AGGCAGCGCA TTCGCGGCTT CACACTAGTG GAACTGATGG TCGCCATGGT GCTGGGGCTG CTGATCGTGG GTGGAGTGAT CGCGTTGTTC GTCTCCACCC AGCAGACGTC GCGTACTCAG GAGGCGATGT CACGCGCGCA GGAAACCGGA CGCTTCGTGA TTGAACGGAT CGCTCGCGAT GCCCGGGAAG CCGGTCACCA GGGGTGCCGT GGCGGCAACA TCAACAATCT TCTGGATACG GCAAGCGCTG ATTACGATCC GTGGGTGCAT GGTGTCGAAT CCGCCTTTCT GCCACCGGAA CAACCGAACG ATCATTTGCG CGGCGATGTG CTCACCCTGC ACGGAATGAC CGCCATCGGA TCGATACCCG TATCGGTCCC GAACACCACC GCGCCGATCA ACACCGACGA GACCGTCGAT GTTGCTCAAG GGGAGGTCGT GCTGGTCGCC GACCAGGCCG GCACCACCTG CGAGCTGTTC CAGAACGCAC CGGCCCAACC GGGTGTACTC AGCCGAGCGA CAGGCGCGAA CATCTCGCCC GGCAATGTGG CGGGCGACCT GACGGACTTC TCCGGTCCTG CCACCATTTC GGTCTCGCGC CTGGAGACGA TCACCTACTA CATCGCCGAA TCTTCCGCCA GCCCAGGCGT GGCGAGCCTG TTCCGCCGCA GCACGGCGGA TACGGACAGC AGTGGCAACC CGGTGCGCCG CGAGATCGCC GAAGGCGTGT ATGACTTGCG CCTGGAGTTT GGGCAGGACA CCAACGACAA CCAGCAAATC GATCGCTTCG TGGCCGCGAG CAACACCGCG AGCTGGAGTG ATAGCGACTG GGCCCAGGTC GCGGCTGTGC GTGTCCATCT GCTCGTCTAC AACGGGCTGG ACAACAACGT GGTGGACCAG CCCCGCACCG GGCTGCTCTT CGCCAACGAG CTGTTCGATG CGCCGGACCG GCGTTTGTAT CTGGTGTTCA CCACCACCGT CGCCACGCGC AACCGGCTGG AATGA
|
Protein sequence | MSTARSFASI RQRIRGFTLV ELMVAMVLGL LIVGGVIALF VSTQQTSRTQ EAMSRAQETG RFVIERIARD AREAGHQGCR GGNINNLLDT ASADYDPWVH GVESAFLPPE QPNDHLRGDV LTLHGMTAIG SIPVSVPNTT APINTDETVD VAQGEVVLVA DQAGTTCELF QNAPAQPGVL SRATGANISP GNVAGDLTDF SGPATISVSR LETITYYIAE SSASPGVASL FRRSTADTDS SGNPVRREIA EGVYDLRLEF GQDTNDNQQI DRFVAASNTA SWSDSDWAQV AAVRVHLLVY NGLDNNVVDQ PRTGLLFANE LFDAPDRRLY LVFTTTVATR NRLE
|
| |