Gene TK90_1358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTK90_1358 
Symbol 
ID8807124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. K90mix 
KingdomBacteria 
Replicon accessionNC_013889 
Strand
Start bp1450876 
End bp1452075 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content66% 
IMG OID 
Producttryptophan synthase, beta subunit 
Protein accessionYP_003460600 
Protein GI289208534 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.33574 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTACC TGAAGGATTT TCCGAACCAG GAAGGCTTTT TCGGCGAGTT CGGGGGTGCG 
TTCCTGCCCC CCGAACTGGA ACCGCATTTT GCCGAGATCA ACCGGGCCTA CCTCGCCCTC
GCCCGCTCGG CCGATTTCCT GAACGAGCTG CGCTACATCC GCAAGCACTA TCAGGGTCGC
CCCACCCCGG TGTACTACGC CCACAACCTG AGCCGCGAGG CCGGCGCACA TATCTATCTC
AAGCGCGAAG ACCTGAACCA CTCCGGAGCA CACAAGCTGA ACCACTGCAT GGGCGAGGCC
CTGCTGGCCA AGCACATGGG CAAGCGCAAG CTGATCGCCG AGACCGGCGC CGGCCAGCAC
GGGGTCGCGC TGGCCACGGC AGCTGCCTAC TTCGGCATGG AATGCGAGAT CCACATGGGC
GAGATCGACA TCGCCAAGGA AGCGCCCAAC GTCACCCGCA TGAAGCTCAT GGGCGCACAG
GTCGTGCCGG TGTCCTTCGG CGGGCGCTCG CTCAAGGAGG CCGTGGACTC CGCCTTCCAG
TCCTACCTGT CGCAAGCCGA GCAGGCGCTG TTCGCGATCG GCTCCGTGGT GGGTCCGCAC
CCCTTCCCGC TGATGGTGCG CAACTTCCAG TCGGTGGTCG GCATCAAGGC GCGCGAGCAG
TTCATGGAGA TGACCGGCGG GGAACTGCCC GACCACGTGG TCGCCTGCGT TGGCGGCGGA
TCCAACGCGA TGGGCATGTT TGCCGGCTTC ATCGAGGACG CCGGCGTCCA GCTGAACGGG
GTCGAGCCAC TCGGACGCGG CACGACGCTG GGCGAGCACT CCGCCACCAT GACCTACGGC
AAGCCCGGCA TGATCCACGG GTTCAAGTGC ATGTTGCTGG CCGACGAGGA AGGCAACCCG
GCCCCGGTCC ACTCCATCGC CTCGGGCCTC GACTACCCCG GCGTCGGCCC GGAGCACTCC
TACCTGAAGA CCATCGAGCG CGTGGCCTAC CATGCGATCA GCGACGACGA AACGCTGGAG
GCCTTCTATC GACTGTCGCG CGCCGAGGGC ATCATTCCGG CGCTGGAGAG TGCCCATGCC
GTCGCCTGGG CGATGAAATA TGGCCGCGAG AATCCCGGCG TCACGATCCT CGCCAACCTG
TCCGGCCGGG GCGACAAGGA CATCGACTAC GTCACCCGCG AATTCGGCCA CGGCGACTAA
 
Protein sequence
MSYLKDFPNQ EGFFGEFGGA FLPPELEPHF AEINRAYLAL ARSADFLNEL RYIRKHYQGR 
PTPVYYAHNL SREAGAHIYL KREDLNHSGA HKLNHCMGEA LLAKHMGKRK LIAETGAGQH
GVALATAAAY FGMECEIHMG EIDIAKEAPN VTRMKLMGAQ VVPVSFGGRS LKEAVDSAFQ
SYLSQAEQAL FAIGSVVGPH PFPLMVRNFQ SVVGIKAREQ FMEMTGGELP DHVVACVGGG
SNAMGMFAGF IEDAGVQLNG VEPLGRGTTL GEHSATMTYG KPGMIHGFKC MLLADEEGNP
APVHSIASGL DYPGVGPEHS YLKTIERVAY HAISDDETLE AFYRLSRAEG IIPALESAHA
VAWAMKYGRE NPGVTILANL SGRGDKDIDY VTREFGHGD