Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_2448 |
Symbol | |
ID | 8808229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 2567856 |
End bp | 2571224 |
Gene Length | 3369 bp |
Protein Length | 1122 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | CRISPR-associated helicase Cas3 family |
Protein accession | YP_003461674 |
Protein GI | 289209608 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.416428 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTCC TGTTCATTTC CCAGTGCAGC AAGAACGCCC TGCGTGAAAC CCGGCGCATT CTCGATCAGT TCGCGGAACG CCGCGGCGAA CGGACCTGGC AGACCCCCAT CACCCGGGAA GGGCTCGACA CCGTCCGGCG CATGCTTCGC AAGAGCGCGC GGCGCAACAC GGCCGTTGCC TGCCACTGGA TCCGTGGACG CAACCAGAGC GAGGTTCTGT GGATTGTCGG CAATGCGAGC CGGTTCAACC TCCGCGGCGC GGTTCCGACC AATTCGACGA CCCGCGATGT GCTGCGCAGG CACGACGAAA ACGACTGGCA CACCGGAGAG GACATCAAGC TGCTGGCCGC CATGGCCGCA CTGCTGCATG ACCTTGGCAA GGCGTGCGCG GCTTTTCAGA ACCGGCTTAC CGCGCGGGGC AACCCCGGAC GCAACGACTA CCGCCACGAA TGGATCAGCC TGCGAATGTT TCAGGCATTC GTGGGCGACG ACGATGACCG GGGCTGGCTC GAACGCCTGA AGGACACGGC CCCGACCAGT GATCAAGACT GGATAGACGG TCTTCTGCGC GACGGGCTGG ATCAAAAGGG CGGCTCCCCC TTTCGGACAC TTCCACCGCT GGCAGCGGCG ATCGGCTGGC TGGTGGTTAG CCATCACCGT CTTCCGCTGC TTCCGGACGA CGACGGTCGC CCGGGCGGAA AGATCAGCGG ATTTCGGACC GAAACCCTGC GAGGGCTTCC GGACATCATC GAGGCCAACT GGAACGAACA GCCCGGCGAA ACCCGTCAGG CAGCCATAAG GCCCTATTGG GAATTCCGCC ATTCGCTCCC CACGAGCAGC GAGAAATGGC ATCTGCGGGC CCGGCACATC GCCGAACGGC TACTGGACCG ACGGGCACAC GCCGGAGATA ACTGGCTCGA CAGGCCCTAT GTCATGCACG TGGCCCGCAT GAGCCTCATG CTTGCCGACC ATCACTATTC CAGCGTTACC GACCCGAGAA AGCGACTCAG GGGTGATCCG GACCACCGGC TGTACGCCAA CACCAACCGC GAAACCGGAC GGCTCAACCA GCCACTGGAT GAACACCTGA TCGGGGTGGA GGCCCATGCG CGTGCCATTA CCGGCGCCCT CCCCGGGGCC GAGCAGCACC TGCCCCGCAT CGCCCGGCAC AGGGGCTTTC GCAAGCGTGC GTCCGACCCC CGATTCCAAT GGCAGAACCG GGCATTCGAT CTCGCCACCT CGCTGCGGGA ACGCGCATCC GCCCAGGGTT TTTTCGGCAT CAACATGGCT TCCACCGGGA GGGGCAAGAC CCTGGCGAAC GGCCGCATCC TGTATGGACT GGCCGATCCC GAGCGAGGAA CACGCTTCTC CATCGCCCTG GGCCTGAGGA CCCTGACGCT GCAGACCGGA CAGGCCTATC GCCAACTGCT CAACCTGGGT GAGGATGAGC TGGCCATCCG CGTCGGTGGA GCCGCCAGCC GCACCCTGTT CGAACACTTC GAGGAACAGG CCGAACACCA CGGATCGGCC TCGGCCCAGG AATTGATGGA CGAGGACGGC CATGTCTACT TCGACGGCGA CTTCACGAAC CATCCGGTCC TCCGTCGCCT GAGCCGCGAC CCGGCCACAC GTTCCCTGAT CGCCGCACCC ATCCTGGCCT GCACCGTGGA CCACCTGACG CCCGCGACCG AGAGCCTGCG TGGCGGCCGT CAGATTGCGC CCATGCTGCG GCTGATGAGC AGCGACCTGG TGCTCGATGA AATCGACGAC TTCGGGCTGG AAGACCTGCC GGCTCTCACA CGCCTGGTCC ACTGGGCAGG ACTCCTGGGC TCCCGGGTGC TGCTGTCGTC CGCAACGCTT TCACCCAGCC TGGTGGAAGG GCTGTTCGAG GCCTATCGCG AGGGACGGCG GGAGTTTCAG CGCAACCGCG GTGAACCCGG GCGTCCCGTC AACATCGCCT GTGCATGGTT CGACGAATAT TCCGCCCAGC ACCAGGATTG CGCGGACCAG CCGCAGATCG CGGCCGCCCA TCGCCAGTTT GCGCAGGCCC GCAAGGCCCG GCTGGCCCGC GAACCGGTTC GCCGCAGTGC CACACTGGCC CCGGTTGCCA TTACGAGCAC CGAATCGCGC GAGGAGACCC GACACCAGTA TGCGGACGAT CTTCGTGAGC ATGCACTGGA ACTGCATCGG CAACACCACG CCACGGACCC CGAAAGCGGA AAGCGAATCA GCTTCGGACT GATCCGGCTG GCCAACATCG AACCCCTTGT CGATATTGCG CGCCATATCT TCCGGGAAGG CGCCCCGGAA GGACACCGCC TCCACCTGTG TGTCTATCAC TCGCGTTTTC CGCTGGTGAC CCGGTCGGAA ATCGAAAGAA ACCTCGACAC GCTGCTGAAC CGGAAAGCCC CCGATGCGGT ATTCCAGGCA CCGGCGGTGC GCCGGCTTAT CGACTCCAGC CCCGAACCGG ACCAGCTGTT TGTCGTGATC GGGTCGCCCG TGACGGAGGT GGGGCGCGAT CACGACTACG ACTGGGCCGT CGTGGAGCCA TCCTCGATGC GTTCGATCAT CCAGCTTGCC GGGCGTGTCC GGCGTCACCG GGAGCCGGCC AACAGTTGCC CCAACATCAT TCTGCCGCGG AAAAACCTGA AGGCGCTGGA AACCCCGAAC AAACCGGCCT TCTGCCGTCC CGGCTTCGAG AGCGAGACGC ACCGGCTGCG CAGCCATGAT CTTCAGGAGC TTCTACGCCA GGAAGAATAC GCCCGGGTGG ACTCCATCCC CAGAATTCTC GAGCCCGAAC CACTGCAGCC GACGGGTTAC CTGGTGGACC TCGAACATGC ACGCCTTCGC GACACGGTCG CACCCCGAGA CACCGGACCG TCACCCGAGA CCCTGTCTCC CAGAGAACGC CGGGCATGCA AGTCCGACAG CCCCCCGCCG CTGAACGCCG CATCCTGGTA TCAGGTTTCC CGCGCCAGCC TGACCGGCAC GCTCCAGCGC ATCCAGCCAT TCCGCGAGGC GTCCGCGCTG GAGGAGATCG AACTGGTCCT GATGCCGGAT GACAGCGGCG AAGACTGGCA GCTGCAGCAG ATCTGGAGCG CCCCGGGGAA ACGGGGAAAG TCGGTCTATA TCGATGTCGA GAAGAGCCTT CTCCTGCGCG AAGACCTGGA GGACAACATC GGCCCCCGAA CCGACGTCTG GATCACGAGC CCCTACATGG ATGCCCTCAC GAGCCTCGCC GACGAGCTGG AGATGCCTCT CGAGGAATGC GCCCGGCGCT TCGGGACCGT CTCGGTGCCG AAGCACGAAA GCGGCTGGCG TGTTCACGAC ACCCTTGGTT TCACCGTGCG GCATCCGGCC AGGCAATAA
|
Protein sequence | MNVLFISQCS KNALRETRRI LDQFAERRGE RTWQTPITRE GLDTVRRMLR KSARRNTAVA CHWIRGRNQS EVLWIVGNAS RFNLRGAVPT NSTTRDVLRR HDENDWHTGE DIKLLAAMAA LLHDLGKACA AFQNRLTARG NPGRNDYRHE WISLRMFQAF VGDDDDRGWL ERLKDTAPTS DQDWIDGLLR DGLDQKGGSP FRTLPPLAAA IGWLVVSHHR LPLLPDDDGR PGGKISGFRT ETLRGLPDII EANWNEQPGE TRQAAIRPYW EFRHSLPTSS EKWHLRARHI AERLLDRRAH AGDNWLDRPY VMHVARMSLM LADHHYSSVT DPRKRLRGDP DHRLYANTNR ETGRLNQPLD EHLIGVEAHA RAITGALPGA EQHLPRIARH RGFRKRASDP RFQWQNRAFD LATSLRERAS AQGFFGINMA STGRGKTLAN GRILYGLADP ERGTRFSIAL GLRTLTLQTG QAYRQLLNLG EDELAIRVGG AASRTLFEHF EEQAEHHGSA SAQELMDEDG HVYFDGDFTN HPVLRRLSRD PATRSLIAAP ILACTVDHLT PATESLRGGR QIAPMLRLMS SDLVLDEIDD FGLEDLPALT RLVHWAGLLG SRVLLSSATL SPSLVEGLFE AYREGRREFQ RNRGEPGRPV NIACAWFDEY SAQHQDCADQ PQIAAAHRQF AQARKARLAR EPVRRSATLA PVAITSTESR EETRHQYADD LREHALELHR QHHATDPESG KRISFGLIRL ANIEPLVDIA RHIFREGAPE GHRLHLCVYH SRFPLVTRSE IERNLDTLLN RKAPDAVFQA PAVRRLIDSS PEPDQLFVVI GSPVTEVGRD HDYDWAVVEP SSMRSIIQLA GRVRRHREPA NSCPNIILPR KNLKALETPN KPAFCRPGFE SETHRLRSHD LQELLRQEEY ARVDSIPRIL EPEPLQPTGY LVDLEHARLR DTVAPRDTGP SPETLSPRER RACKSDSPPP LNAASWYQVS RASLTGTLQR IQPFREASAL EEIELVLMPD DSGEDWQLQQ IWSAPGKRGK SVYIDVEKSL LLREDLEDNI GPRTDVWITS PYMDALTSLA DELEMPLEEC ARRFGTVSVP KHESGWRVHD TLGFTVRHPA RQ
|
| |