Gene TK90_1334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTK90_1334 
Symbol 
ID8807100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. K90mix 
KingdomBacteria 
Replicon accessionNC_013889 
Strand
Start bp1421611 
End bp1422819 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content66% 
IMG OID 
Producttryptophan synthase, beta subunit 
Protein accessionYP_003460578 
Protein GI289208512 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.999699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.825059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGA TCGACTGGGC GGCGTTCCCG GATGCCCGCG GGCATTTCGG GCCTTATGGC 
GGGCGCTTTG TGTCCGAGAC GCTGATGGGC GCGCTGGACG AACTGCAAAA GGCGTACGAG
CAGTACATGG GCGATGCCGA TTTCATGGCG GAGTTGGACC ATGACCTGAA GCAGTTCGTC
GGGCGCCCCA ATCCGCTGTA TCACGCCGAG CGCCTGTCGA AGGAGCTCGG CGGGGCGCAG
GTGTACTTCA AGCGGGAAGA CCTGAACCAC ACCGGTGCGC ACAAGGTGAA CAACACCATC
GGCCAGGCCC TGCTGGCCAA GCGCCTGGGC AAGACCCGGA TCATCGCGGA GACCGGGGCC
GGCCAGCATG GCGTGGCGAC CGCGACCGTG GCCGCGCGTC TGGGCCTGGA ATGCGTGATC
TATATGGGCG AGGAGGACAC GCGCCGGCAG ACCCCGAACG TCTATCGGAT GCGTCTGCTC
GGGGCCGAGG TCGTGGCGGT GAAATCGGGC ACGCGCACGC TGAAGGACGC CCTGAACGAG
GCGATGCGCG ACTGGGTTAC CAACGTGGAC GACACCTTCT ACATCATCGG CACCGTTGCA
GGGCCACATC CTTATCCGGC CATGGTGCGG GACTTCCAGG CGGTGATCGG CCGCGAGGCG
CGTGCGCAGC ACCTGGAGAT GACCGGGAAG CTTCCGGACG CGCTAGTAGC TTGTGTGGGT
GGAGGATCCA ACGCCATCGG GTTGTTCCAT CCGTTCCTGG ACGACGAATC GGTGGAAATG
ATCGGCGTCG AGGCCGCGGG TGCCGGGATC GAGTCCGGCC GGCACTCCGC GCCGCTTTGT
GCCGGCCAGC CAGGGGTGCT GCATGGCAAC CGCACCTATC TGATGGAGGA CGAGCACGGG
CAGATCATCG GGACGCACTC GGTCTCCGCC GGGCTCGATT ATCCCGGAGT CGGGCCGGAA
CACGCCTGGC TGAAGGATAC GGGCCGTGCG CGCTACGTCG CCGTGACCGA CGATCAGGCG
CTGGAGGGCT TTCATCGGCT CACCCGTACC GAGGGGATCA TCCCGGCACT GGAGACCTCC
CACGCGATCG CGCATGTCCT GGAGCTGGCC CCGACCATGC GTCCGGACCA GAGCATCATC
GTCAATCTGT CGGGCCGGGG GGACAAGGAC CTCAACACCG TGGCGGAACG CGAAGGAATC
ACGCTATGA
 
Protein sequence
MSKIDWAAFP DARGHFGPYG GRFVSETLMG ALDELQKAYE QYMGDADFMA ELDHDLKQFV 
GRPNPLYHAE RLSKELGGAQ VYFKREDLNH TGAHKVNNTI GQALLAKRLG KTRIIAETGA
GQHGVATATV AARLGLECVI YMGEEDTRRQ TPNVYRMRLL GAEVVAVKSG TRTLKDALNE
AMRDWVTNVD DTFYIIGTVA GPHPYPAMVR DFQAVIGREA RAQHLEMTGK LPDALVACVG
GGSNAIGLFH PFLDDESVEM IGVEAAGAGI ESGRHSAPLC AGQPGVLHGN RTYLMEDEHG
QIIGTHSVSA GLDYPGVGPE HAWLKDTGRA RYVAVTDDQA LEGFHRLTRT EGIIPALETS
HAIAHVLELA PTMRPDQSII VNLSGRGDKD LNTVAEREGI TL