Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TK90_2020 |
Symbol | |
ID | 8807795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. K90mix |
Kingdom | Bacteria |
Replicon accession | NC_013889 |
Strand | - |
Start bp | 2141915 |
End bp | 2143201 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | |
Product | carboxyl-terminal protease |
Protein accession | YP_003461247 |
Protein GI | 289209181 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0858479 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAGT CGTGTCGCAC CGGGTACGCC CTGGTGGTGG GTTTGGTGGT CGGTGTCATG CTTAGCGTGT CGGTGGCGGT GTATGCCGAT CGCGAGAATG GCGCACAGAA TGCCTTGCCG GTGGAGGACC TGCAGCGGTT TACCGAGGTG TATATGCGCA TCAAGCGCAA TTACGTCACC GAGGTGGACG ACAAGGAGCT GCTGGATAAC GCCATCCAGG GGATGCTGTC CGGGCTGGAT CCGCATTCGG CCTACCTGGA CGAACAGGAC TTCGAGGACA TGCAGGTCGG CACCTCCGGC GAGTTCGGCG GTCTGGGGAT CGAGGTCGGT ATGGAGGATG GCTTCGTCAA GGTCATTGCC CCGATCGATG GCACCCCGGC GAGCAAGGCC GGCATCGAGG CGGGTGACCT GATCATTCGC CTGGACGGTG AATCGGTGCA GGGCATGACG CTGTCCGATG CCGTCTCCAA GATGCGCGGC GAGAAGGGCT CCGACATTAC TCTGACCATC GTCCGCGAGG GCGAGGACCA GCCGAAGGAG ATCACCCTTA CCCGTGACCG TATCCAGGTC CAGAGTGTGC GCTCCGAGAT TCTCGAAGAC GGGTATGGCT ACCTGCGTAT CAGCAACTTC CAGCAGCGCA CCGCGCGTGA CGTTGTGCGA GCCGTCGAAG AGCTGAAGGA AGAGGGCGAT CTGCGCGGCC TGGTGCTGGA TCTGCGCAAC AATCCGGGCG GCATCCTGAA TGGTGCGGTC GGTGTGTCCG ATGCTTTCCT GGAGGAAGGG CTGATCGTTT ATACCGAGGG TCGGTTGGAG GACTCTCAGT TCCGCTATCA GGCCTCGCCG GGTGATGTGC TCGGCGGTGC CCCGATGGTG GTACTGGTGA ACCGGGGTTC GGCCTCGGCC TCCGAGATCG TGGCCGGCGC CCTGCAGGAT CACAAGCGCG CGGTGGTCAT GGGCCAGAAC ACCTTTGGCA AGGGTTCGGT GCAGACAATC CTGCCGCTGA CCGAGAACAC CGGTATCAAG CTGACCACGG CGCGTTACTT CACGCCGGAT GGGCGCAACA TCGAGGAAGA GGGAGTAGCA CCCGACATCC GGCTGGAAAA CCTGACGGTC ACGCGTGCCG AAGGCGAGGA TGAGCGTGAC GCCCAGGCGC GTATGCAGCG CGAGCTGCAG GGCGAGGACG TGCCGGAAGA CGATGACGAC AATGGCGAGA GCCTGGCCGA GCGTGATTAC GGTCTGAGCG AGGCCCTGAA TCTGCTCAAA GGGCTGAACA TCTACAGTCA GCGCTGA
|
Protein sequence | MKKSCRTGYA LVVGLVVGVM LSVSVAVYAD RENGAQNALP VEDLQRFTEV YMRIKRNYVT EVDDKELLDN AIQGMLSGLD PHSAYLDEQD FEDMQVGTSG EFGGLGIEVG MEDGFVKVIA PIDGTPASKA GIEAGDLIIR LDGESVQGMT LSDAVSKMRG EKGSDITLTI VREGEDQPKE ITLTRDRIQV QSVRSEILED GYGYLRISNF QQRTARDVVR AVEELKEEGD LRGLVLDLRN NPGGILNGAV GVSDAFLEEG LIVYTEGRLE DSQFRYQASP GDVLGGAPMV VLVNRGSASA SEIVAGALQD HKRAVVMGQN TFGKGSVQTI LPLTENTGIK LTTARYFTPD GRNIEEEGVA PDIRLENLTV TRAEGEDERD AQARMQRELQ GEDVPEDDDD NGESLAERDY GLSEALNLLK GLNIYSQR
|
| |