Gene TK90_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTK90_2021 
Symbol 
ID8807796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. K90mix 
KingdomBacteria 
Replicon accessionNC_013889 
Strand
Start bp2143343 
End bp2144482 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content70% 
IMG OID 
ProductPeptidase M23 
Protein accessionYP_003461248 
Protein GI289209182 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGGCCC CCGCGGGCAC CGTGTCGGCG TTTGAACCAG AGGCCGAACT GGAGGAGACG 
CTGGAGGCCA TCCGCGACCT GGAGCGCAGC CAGGAGGAGC GCCAGGCGGC GCTCGAGCGC
CTGGAGGACG AGCTCGAGCG GGCGGCCCGG GGCAGCAGCG AGTCGCGGCG CGAACTGCGC
GAACTGGAGG CCGAACGCGA GCAGCAGGCC GAGGTGATCG CCGAGCACGA GGCACGTGTC
GAACAGGAAG AGGATCGCCT GCGCGAAGAA CGCGTGCAGG CGGGGCGACT GCTGCGAGAC
CAGTGGCAGC GCGACCGGCA CCCGGGGCGG GTGCCGGGTA CCGGTGGCGA CGGCGAGCTG
AGCCGGCTGC ATCCGGAGAT CGCCGCGCGC TTGCGCGAGG CACGGGCCGA GGCACTGGCG
GCCCTGGGCG AGCAACTCGA GGTGTTGCGG GCCGCCCGCG ATGATCTGGA GCGCGAGCAG
GCGGTGCTGG CCGAGCAGGA GGCGGAGCTG CGCGAGGTGG TGGCCGAGCT CGAGCGCGAG
GAGGAACGGC AGCGCGCGGC GATGGACGAG CTGGAACGCG CGATCGAGGA CGAGGCGCTG
GAGCTGGCGC GCCTGGAGCG CAATGCCGAG ACGCTGGAGG AGCTGATCCG CGAGGTGGAG
CGTGATGCGG CGGAGCGCGA AGAGCGCGCG GCGCGTGGCG ACCCTCCGCC CGATCGGGGG
CCCGTACGGT CCGATGTGGC ATTTTCCGAC CTCCAGGGGG AACTCCCCAG ACCCGCCGAA
GGCTCGGTCG TCCGGCGTTT CAACGAGCCG CGTGGCAGTC GTCTGCAGTC CCGTTGGCGG
GGGACCGTTC TGGAGGTCGA CAATGGCGAG GCGGTACATG CCGTCCACTT TGGCCGCGTG
GTCTACGCCG ACTGGATGCA GGGATACGGC TTTCTGGTCA TCCTCGATCA CGGGGGCGGT
TACCTGACGC TGTACAGCAA CCTGGAGGAG ATCCTGGTCG CCGAGGGCGA GGAAATCGAA
GGCGGCGAGC GCATGGCTCT GGCCGGCGCG GGTCGCGAGG CGATCGCGCC GGGGCTGTAC
TTCGAAATTC GGCGAAATGG CGATCCGTTG AACCCTGAGG ATTGGTGGCT ATCTCAATGA
 
Protein sequence
MLAPAGTVSA FEPEAELEET LEAIRDLERS QEERQAALER LEDELERAAR GSSESRRELR 
ELEAEREQQA EVIAEHEARV EQEEDRLREE RVQAGRLLRD QWQRDRHPGR VPGTGGDGEL
SRLHPEIAAR LREARAEALA ALGEQLEVLR AARDDLEREQ AVLAEQEAEL REVVAELERE
EERQRAAMDE LERAIEDEAL ELARLERNAE TLEELIREVE RDAAEREERA ARGDPPPDRG
PVRSDVAFSD LQGELPRPAE GSVVRRFNEP RGSRLQSRWR GTVLEVDNGE AVHAVHFGRV
VYADWMQGYG FLVILDHGGG YLTLYSNLEE ILVAEGEEIE GGERMALAGA GREAIAPGLY
FEIRRNGDPL NPEDWWLSQ