Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_0154 |
Symbol | |
ID | 8805884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | + |
Start bp | 159175 |
End bp | 160185 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | metalloendopeptidase, glycoprotease family |
Protein accession | YP_003459406 |
Protein GI | 289207340 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.677092 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.49606 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTCC TGGGTATCGA GACCTCCTGC GACGAGACGG GCGTGGCGCT GATCGATGAT CAGCGCGGCC TGCTCGCGCA CCGGCTGTAC AGCCAGACTG ACCTGCACGC GGTCTACGGC GGCGTGGTAC CGGAGCTCGC TTCGCGCGAT CACATCCGGC GCCTGCTGCC GCTGCTGCGG GCGGTGCTGG ACGAGGCGGG CGTGAAGGGC CCCGAGCTGG ATGCGATCGC CTATACCGGC GGGCCGGGCC TGCTTGGGGC CCTACTGACG GGGGCCTCGG TCGCGCGTTC GCTCGCCTGG GGCTGGGGCG TGCCGGCGAT GCCGGTACAT CACCTCGAAG GCCATCTGCT GGCCCCGTTC CTGGAAGACG CCGGGCTCGA TTTCCCGTTC CTGGTGCTGC TGGTCTCGGG CGGGCACACG CAGCTGATCC ACGCCCGATC TCTGGGCGAC TACGAACTGC TGGGCGAGAG CATCGACGAC GCCGCTGGCG AGGCCTTCGA CAAGACCGCC AAGCTGCTGG GGCTGGGCTA TCCCGGAGGT CCCGCCCTGT CGCGCCTGGC CGAGCAGGGG GCGAGCGATG CCCTGCGCCT GCCGCGCCCG ATGCTCGACC GGCCGGGGCT GGACATGAGC TTTTCCGGCC TGAAGACGGC CGTGCTGACT GCCCTGAACA AGCAGGAATA CCGCCCGCAG GATGTCGCGC GCGCGTTCGA GGAGGCGGTC AGCGAGACCC TGGTGGAGAA GACCCGGAGG GCGCTGGAGC AGAGTCAGGC CCCGGCGCTG GTGGTCGCCG GTGGGGTGGC CGCGAATACC CGGCTGCGCG CGGGACTGCA GGCGATGGCC GCGCAGCAGG GGGTGCCGGT GTATTTTCCG CGGATCGAGT TCTGTACGGA CAACGGGGCG ATGATCGCGC TCGCCGGGCT GCGCCGCCTG CAGGCGGGCT GGACCCCCGA CGACCAGGCC CCGGCGATCA CGGCGCGTGC CCGCTGGCCG TTGGCGGAGC TCTCCGCCTG A
|
Protein sequence | MKVLGIETSC DETGVALIDD QRGLLAHRLY SQTDLHAVYG GVVPELASRD HIRRLLPLLR AVLDEAGVKG PELDAIAYTG GPGLLGALLT GASVARSLAW GWGVPAMPVH HLEGHLLAPF LEDAGLDFPF LVLLVSGGHT QLIHARSLGD YELLGESIDD AAGEAFDKTA KLLGLGYPGG PALSRLAEQG ASDALRLPRP MLDRPGLDMS FSGLKTAVLT ALNKQEYRPQ DVARAFEEAV SETLVEKTRR ALEQSQAPAL VVAGGVAANT RLRAGLQAMA AQQGVPVYFP RIEFCTDNGA MIALAGLRRL QAGWTPDDQA PAITARARWP LAELSA
|
| |