Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1325 |
Symbol | |
ID | 8807091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 1412018 |
End bp | 1414060 |
Gene Length | 2043 bp |
Protein Length | 680 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | transglutaminase domain protein |
Protein accession | YP_003460569 |
Protein GI | 289208503 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00529222 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACCGTT CGAGCCGGCA GCATTTACCG GAGTTGCGGG GCCTGGAACT CGCGCTGCTC GCCCTGCCAG TGGCCGCGCT GCCACATGCC TGGCACCAGC CACTGTGGGT CACGCTGCTC GTAGCGGCCG CCATCCTGCT GCGCAGCTGG CTCCATCTGC GCGACCGCAA GCCCCCTTCC ATCGGACTGA TGGCCCTGCT CGCCGCTTTC GCGGGCGGGC TGACGTTCCT GCAGTACGGC ACCCTGTTCG GGCAGGAGGC AGGCACCGCC CTGCTGCTGG TCATGATGGC GCTCAAACTG CTCGAGACCC GCAACCGGCG CGATATCGTG ATCGGGCTGT TCCTCGGCTA TTTCGTGGTC GTCACCACCT ATTTCTTCGA CCAGTCGATG CTGATCGCCG CCTGGTCTCT GCTGTCTGCC TGGCTGCTCA CCGCCGCCCT GGTCCAGGTG CATGCCGGGC GCCCCCTCGA ACGGCGGCGC CTGGCGAAGC ATTCCGGCTG GATGATGGTC CATGCCTTCC CGTTCATGCT GATCCTGTTC GTCGCCTTCC CGCGCGTACA GGGACCACTG TGGGGCGACC CGCAACGCGA CGAGGTGGCT ACCACGGCCC TTTCCGGCGA ACTGAACCCT GGCGACATCG CGGAACTGCT GCAGGACGAA ACCACCACCA TGCGCGTCCA GTTTCACGGC TCCGTCCCGC CGCCCCGGGC GCAATACTGG CGGGCACTGG TGATGACGGA CTTTGACGGC CGCCGCTGGC AGGCCGAAGG GGGACAATCC AGCATCGAGC TGCCACAGCC CGGAGACTCG GACCGCGTGG TGGCCTACAC CGCGACGCTG GAGCCCACCC GCATGCGCTA CCTGCCGGTA CTCGACTACC CCGTGGCGCT GCCCGACAAC GCCGAGTACC GCGACAACCA CCAGGTCGTC CGCGACCGAC GGATCGTGAA CCGCATCCAG TACGACGCCG AGGCCGACCT GACACGTCCG CCAGGCGCCG GCGAAGCCCT GAGCAGGTCC GCCCGTGAAC GCGCTCTTGA GCTCCCACCC AGCGCAGCCC CTCGCGCACG TGCCGAGGTC GCTCTCTGGC GCGCCACGCA TGGTGACGAT GACCGCGCGA TCATCCAGGC CGCCCTGGAT CGCTTCGCCG CCGCCCCCTA TCGTTACACC TTGCAGCCCC CAACGCTCGA GGGGGATGTC ACCGACCAGT TCCTGTTCGA GACCCAGGCC GGTTACTGCG AGCACTATGC CTCCGCGTTC GCCGTGCTGA TGCGCTCGGC GGGGATCCCG ACCCGCGTGG TTACGGGCTA CCAGGGGGGC GAATGGATGC AGCGGGGCGA ATACCTGCGC CTGCGCAATG CCGACGCGCA CGCCTGGAAC GAGGTCTGGC TGGATGGCGA GGGCTGGATT CGCGTCGACC CCACCGCGGC GATCGCGCCG GAGCGGATCG AGGCCGGCAT CGGCGGCCTC ACCGGTGGCG ACGGCGAACC CATGCCCGAT TTCCTGCGTC GTGACGGGCT GGGCTGGGTG CAGCAACTTC GATTTGGCTT CGAAGACTGG CGCGATTTCG CCCGGTTCCG CTGGGAGAGC TGGGTGCTGG CATTCGATCC CGAGCGCCAG CGGGAACTGT TTGCCCGTTT CGGGCTCGAC GCGACGGACT GGCGTGACAT TGTCACGGCA CTGGGCGTCG GCTTTGGCCT GCTGGCTGCC GTGGCCCTGG TCTGGAGCGG CTGGCGCAGG CCTCGTCGCC AGCTGGAGGT GCCGGATCGC CTCCTGCACC GACTTTCCCG GCGGATCGAA CGCCAACAGA CAGGACTGGG ACGGCGCCCG CACGAGCCCG TCATCACCTG GACCCGACGC GTCAAGGTTG CACGGCCGGA TCTCGCGCCC CTCGTCGATG CCTTCGCCGA GCACTACAAC CGGGTCCGCT TCGCGCCCGC CCGGCCCGAC GACCGGGCCA GACACCTGAC AACCCTGCGC CGCCTCGCTG CATCCGACCT TCGCAAACCC GCCGCCGACC CGCCCTTGCC GCGGACAACC TAG
|
Protein sequence | MNRSSRQHLP ELRGLELALL ALPVAALPHA WHQPLWVTLL VAAAILLRSW LHLRDRKPPS IGLMALLAAF AGGLTFLQYG TLFGQEAGTA LLLVMMALKL LETRNRRDIV IGLFLGYFVV VTTYFFDQSM LIAAWSLLSA WLLTAALVQV HAGRPLERRR LAKHSGWMMV HAFPFMLILF VAFPRVQGPL WGDPQRDEVA TTALSGELNP GDIAELLQDE TTTMRVQFHG SVPPPRAQYW RALVMTDFDG RRWQAEGGQS SIELPQPGDS DRVVAYTATL EPTRMRYLPV LDYPVALPDN AEYRDNHQVV RDRRIVNRIQ YDAEADLTRP PGAGEALSRS ARERALELPP SAAPRARAEV ALWRATHGDD DRAIIQAALD RFAAAPYRYT LQPPTLEGDV TDQFLFETQA GYCEHYASAF AVLMRSAGIP TRVVTGYQGG EWMQRGEYLR LRNADAHAWN EVWLDGEGWI RVDPTAAIAP ERIEAGIGGL TGGDGEPMPD FLRRDGLGWV QQLRFGFEDW RDFARFRWES WVLAFDPERQ RELFARFGLD ATDWRDIVTA LGVGFGLLAA VALVWSGWRR PRRQLEVPDR LLHRLSRRIE RQQTGLGRRP HEPVITWTRR VKVARPDLAP LVDAFAEHYN RVRFAPARPD DRARHLTTLR RLAASDLRKP AADPPLPRTT
|
| |