Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_1952 |
Symbol | |
ID | 8807726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 2074670 |
End bp | 2076091 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | protease Do |
Protein accession | YP_003461179 |
Protein GI | 289209113 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.151255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGAC CGATGCAATG GGCCCTGCTG GGGCTGATCC TGTTTGCCTT CTCCCTGCCG GCCTCGGCGT GCATGCTGCC GGACTTTGTC CCGCTGGCGG AGGAGCACAG CCCCGCGGTG GTCAATATCT CCACCACGCG TGAGCGCGAG GGTGGTCAGA CGGGCGGACA CCCCGATTTC GAGGGTTCTC CGTTCGAGGA CCTGTTCGAA CGCTTCTTCG GTTCGCCCCC GGGCGGTGAG GGGGGGCAGG GACGGATGCC CGAGCGCAGC TCGCTGGGGT CCGGCTTTAT CTACACCGAG GACGGTTACA TCATCACGGC GAACCACGTC GTGGAGGGGG CTTCCGAGGT CGTCGTGCAT CTCTCCGACC GGCGTGTGTT CGATGCCGAA GTGGTCGGCA AGGACCCGCA AAGCGACGTG GCTTTGCTGA AGATCGATGC CGATGATCTG CCGACGCTGG AGCTGGGCTC CTCGGACGAC CTCAAGGTCG GCGAGTGGGT GCTCGCGATC GGCTCGCCGT TTGGCTTTGA CCACTCCGTG ACGGCCGGGA TCGTCAGCGC CAAGGGGCGC AATCTCCCGA CCGAGAACTA CGTCCCGTTC ATCCAGACCG ATGTCGCGAT CAACCCCGGG AATTCGGGTG GCCCGCTGCT GAATCTGGAC GGCAAGGTCG TGGGGATTAA TGCCCAGATC TACAGCCGCA CCGGTGGCTT CATGGGGCTG TCGTTCGCCG TCCCGATCGA GATGGTCGAG GATGTCGTCA AGCAGCTGCG CGAGCACGGC GAGGTCACGC GAGGCTGGCT GGGCGTGCTG ATCCAGGAGG TGACGCGTGA TCTGGCTGAG TCGTTTGGTA TGGACAAGCC CAGCGGCGCC CTGGTCGCCC GCGTGCAGTC GGACAGCCCG GCCGAGAAGG CCGGCTTCGA GACGGGGGAT GTGATCCTCA AGTTCAATGG GATCGAGGTC CCGAACTCCT CCGCCCTGCC GCCGATCGTG GGACGCACAC CGGTCGGCAC CGAGGCCGAG GTGGAGATCC GTCGCGGCGA GGAGACCAGG ACCCTGATGG TCGAGATCGA ACGCCTGCCG GACGATATCG CGGCGGAGCG CGGTGGTCCC CAGCCGGAGC AGGGAGAGCG TGCCGAGCCG CAAAGCCTGC TCGGGATGCA TCTGGAGCCG CTGGACGCGG CCCAGGCCGA GGAGCTGGGT CTCGACGGCG GGCTCGTGGT TACGGAGGTG ACCGGCAACC CGGCGCGTTC CTCCGGCATT CGTCCGGGCG ACGTGATCGT GCAGTTCGGT CGCCACTCGG TGGATTCGCT GGACGCGCTG GAAGAGCAGA TCGAGGCCGC CGGTAGTGGG CGCACGGTGC CGGTCCTGAT TCACCGCGAC GGCAACCCGA CGTTTATCGC CTTGCGTATC CCGTCCGAAT GA
|
Protein sequence | MTRPMQWALL GLILFAFSLP ASACMLPDFV PLAEEHSPAV VNISTTRERE GGQTGGHPDF EGSPFEDLFE RFFGSPPGGE GGQGRMPERS SLGSGFIYTE DGYIITANHV VEGASEVVVH LSDRRVFDAE VVGKDPQSDV ALLKIDADDL PTLELGSSDD LKVGEWVLAI GSPFGFDHSV TAGIVSAKGR NLPTENYVPF IQTDVAINPG NSGGPLLNLD GKVVGINAQI YSRTGGFMGL SFAVPIEMVE DVVKQLREHG EVTRGWLGVL IQEVTRDLAE SFGMDKPSGA LVARVQSDSP AEKAGFETGD VILKFNGIEV PNSSALPPIV GRTPVGTEAE VEIRRGEETR TLMVEIERLP DDIAAERGGP QPEQGERAEP QSLLGMHLEP LDAAQAEELG LDGGLVVTEV TGNPARSSGI RPGDVIVQFG RHSVDSLDAL EEQIEAAGSG RTVPVLIHRD GNPTFIALRI PSE
|
| |