Gene TK90_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTK90_2020 
Symbol 
ID8807795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. K90mix 
KingdomBacteria 
Replicon accessionNC_013889 
Strand
Start bp2141915 
End bp2143201 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content62% 
IMG OID 
Productcarboxyl-terminal protease 
Protein accessionYP_003461247 
Protein GI289209181 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0858479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGT CGTGTCGCAC CGGGTACGCC CTGGTGGTGG GTTTGGTGGT CGGTGTCATG 
CTTAGCGTGT CGGTGGCGGT GTATGCCGAT CGCGAGAATG GCGCACAGAA TGCCTTGCCG
GTGGAGGACC TGCAGCGGTT TACCGAGGTG TATATGCGCA TCAAGCGCAA TTACGTCACC
GAGGTGGACG ACAAGGAGCT GCTGGATAAC GCCATCCAGG GGATGCTGTC CGGGCTGGAT
CCGCATTCGG CCTACCTGGA CGAACAGGAC TTCGAGGACA TGCAGGTCGG CACCTCCGGC
GAGTTCGGCG GTCTGGGGAT CGAGGTCGGT ATGGAGGATG GCTTCGTCAA GGTCATTGCC
CCGATCGATG GCACCCCGGC GAGCAAGGCC GGCATCGAGG CGGGTGACCT GATCATTCGC
CTGGACGGTG AATCGGTGCA GGGCATGACG CTGTCCGATG CCGTCTCCAA GATGCGCGGC
GAGAAGGGCT CCGACATTAC TCTGACCATC GTCCGCGAGG GCGAGGACCA GCCGAAGGAG
ATCACCCTTA CCCGTGACCG TATCCAGGTC CAGAGTGTGC GCTCCGAGAT TCTCGAAGAC
GGGTATGGCT ACCTGCGTAT CAGCAACTTC CAGCAGCGCA CCGCGCGTGA CGTTGTGCGA
GCCGTCGAAG AGCTGAAGGA AGAGGGCGAT CTGCGCGGCC TGGTGCTGGA TCTGCGCAAC
AATCCGGGCG GCATCCTGAA TGGTGCGGTC GGTGTGTCCG ATGCTTTCCT GGAGGAAGGG
CTGATCGTTT ATACCGAGGG TCGGTTGGAG GACTCTCAGT TCCGCTATCA GGCCTCGCCG
GGTGATGTGC TCGGCGGTGC CCCGATGGTG GTACTGGTGA ACCGGGGTTC GGCCTCGGCC
TCCGAGATCG TGGCCGGCGC CCTGCAGGAT CACAAGCGCG CGGTGGTCAT GGGCCAGAAC
ACCTTTGGCA AGGGTTCGGT GCAGACAATC CTGCCGCTGA CCGAGAACAC CGGTATCAAG
CTGACCACGG CGCGTTACTT CACGCCGGAT GGGCGCAACA TCGAGGAAGA GGGAGTAGCA
CCCGACATCC GGCTGGAAAA CCTGACGGTC ACGCGTGCCG AAGGCGAGGA TGAGCGTGAC
GCCCAGGCGC GTATGCAGCG CGAGCTGCAG GGCGAGGACG TGCCGGAAGA CGATGACGAC
AATGGCGAGA GCCTGGCCGA GCGTGATTAC GGTCTGAGCG AGGCCCTGAA TCTGCTCAAA
GGGCTGAACA TCTACAGTCA GCGCTGA
 
Protein sequence
MKKSCRTGYA LVVGLVVGVM LSVSVAVYAD RENGAQNALP VEDLQRFTEV YMRIKRNYVT 
EVDDKELLDN AIQGMLSGLD PHSAYLDEQD FEDMQVGTSG EFGGLGIEVG MEDGFVKVIA
PIDGTPASKA GIEAGDLIIR LDGESVQGMT LSDAVSKMRG EKGSDITLTI VREGEDQPKE
ITLTRDRIQV QSVRSEILED GYGYLRISNF QQRTARDVVR AVEELKEEGD LRGLVLDLRN
NPGGILNGAV GVSDAFLEEG LIVYTEGRLE DSQFRYQASP GDVLGGAPMV VLVNRGSASA
SEIVAGALQD HKRAVVMGQN TFGKGSVQTI LPLTENTGIK LTTARYFTPD GRNIEEEGVA
PDIRLENLTV TRAEGEDERD AQARMQRELQ GEDVPEDDDD NGESLAERDY GLSEALNLLK
GLNIYSQR