Gene TK90_0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTK90_0140 
Symbol 
ID8805869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThioalkalivibrio sp. K90mix 
KingdomBacteria 
Replicon accessionNC_013889 
Strand
Start bp142782 
End bp144671 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content67% 
IMG OID 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003459392 
Protein GI289207326 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.579099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.193074 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCA TCCCCGAGTC GTTTGCCCGC AAGACCGCCG AGGTCTCCGC GGAGGTCACC 
CGCCCGTTCC CGAACTCCGA GAAGCGTTAC GTCGAGGGTT CGCGCGCGGA CATCCGCGTG
CCGTATCGCG AGGTCGCGCA GACGCCGACG ATGACCTCCG GCACCCCCGA AGAGAACCCG
CCGATCCCGG TCTACGACAC CTCCGGCCCC TACACCGACC CCGACGCAAG CATCGACCTG
CACCGGGGGC TGGCGGACCT GCGCGGCCCC TGGATCGACG AACGCGCGGA CACCGAGTGG
CTGGACGGCC CGAGCTCCGA GTACGGCCGC GCCCGCGCCG CCGATCCGGA ACTGGCCAGC
CTGCGCTTCG GGCACATCCG CCCGCCGCGC CGCGCGCAGG CCGGCGCGAA CGTCTCGCAG
ATGCACTACG CCCGGAAGGG CATCATCACC CCGGAGATGG AGTACGTCGC GATCCGCGAG
AACAACCGCC TGCAGGAGCT GCGCGCCGAC CCGCGCTACC AGAAGCTGCT GCGCCAGCAC
GCCGGCGAGT CGTTCGGCGC CTCCATCCCG GATGAAATCA CCCCCGAGTT CGTGCGTGAC
GAGATTGCCC GCGGTCGCGC GATCATCCCC GCCAACATCA ACCACCCCGA GCTCGAGCCG
ATGATCATCG GCCGCAACTT CCGGGTGAAG GTGAACACCA ACATCGGCAA CTCCGCGGTC
ACCTCCTCGA TCGAGGAGGA GGTAGAAAAG CTTGTCTGGT CCGCGCGCTG GGGCGGCGAC
ACCCTGATGG ACCTGTCCAC CGGCAAGAAC ATCCACGAGA CCCGCGAGTG GATCCTGCGC
AACTCGCCGG TACCGATCGG CACCGTGCCG ATCTACCAGG CGCTGGAGAA GGTCGACGGC
AAGCCCGAGG ACCTGACCTG GGAGCTGTTC CGCGACACCC TGATCGAGCA GGCCGAGCAG
GGCGTGGATT ACTTCACCAT CCACGCCGGC GTGCGGTTGC AGTACGTGCC GCTGACTGCT
GACCGCGTGA CCGGCATCGT CTCCCGCGGC GGTTCGATCA TGGCCAAGTG GTGCCTGGCG
CATCACCAGG AGAACTTCCT CTACACCCAC TTTGACGAGA TCTGCGAGAT CATGAAGGCC
TTCGACGTCT CGTTCTCGCT GGGGGACGGC CTGCGCCCCG GCTGCCTGGC CGATGCCAAC
GACGCCGCCC AGTTCGGCGA GCTGGAGACC CTGGGCGAGC TGACGCAGAA GGCCTGGGCG
CACGACGTGC AGGTGATGAT CGAGGGCCCC GGCCACGTCC CGCTGCAGAA GGTGAAGGAG
AACGTCGACA AGGAGCTGGA GGACTGCTTC GAGGCGCCGT TCTACACCCT TGGCCCGCTG
GTGACCGACA TCGCCCCGGC CTACGACCAC ATCACCTCCG GTATCGGCGC GGCCAACATC
GGCTGGTACG GCACCGCGAT GCTCTGCTAC GTGACGCCGA AAGAACATCT GGGCCTGCCC
AACAAGCAGG ACGTGCGCGA CGGCATCATC ACCTACAAGA TCGCCGCCCA CGCCGCCGAC
CTGGCCAAGG GCTTCCCCGG CGCGCAGCTG CAGGACAACG CCCTGTCCAA GGCACGCTTC
GAGTTCCGCT GGGAGGACCA GTTCAACCTG GCACTGGACC CGGAGCGCGC GCGCGAATTC
CACGACGAGA CCCTGCCGAA GGAGGCGCAC AAGGTCGCCC ACTTCTGCTC CATGTGCGGC
CCGAACTTCT GCTCCATGAA GATCACCCAG GACGTGCGCG ACTATGCCGC GGCCCAGGGC
ATCAGCGCTG ACGAGGCCCT GCAGAAGGGC ATGCAGGAGA AGGCGGTGGA ATTCGTCGAG
AAGGGCGCGG AGATCTACCG CAAGACCTGA
 
Protein sequence
MSAIPESFAR KTAEVSAEVT RPFPNSEKRY VEGSRADIRV PYREVAQTPT MTSGTPEENP 
PIPVYDTSGP YTDPDASIDL HRGLADLRGP WIDERADTEW LDGPSSEYGR ARAADPELAS
LRFGHIRPPR RAQAGANVSQ MHYARKGIIT PEMEYVAIRE NNRLQELRAD PRYQKLLRQH
AGESFGASIP DEITPEFVRD EIARGRAIIP ANINHPELEP MIIGRNFRVK VNTNIGNSAV
TSSIEEEVEK LVWSARWGGD TLMDLSTGKN IHETREWILR NSPVPIGTVP IYQALEKVDG
KPEDLTWELF RDTLIEQAEQ GVDYFTIHAG VRLQYVPLTA DRVTGIVSRG GSIMAKWCLA
HHQENFLYTH FDEICEIMKA FDVSFSLGDG LRPGCLADAN DAAQFGELET LGELTQKAWA
HDVQVMIEGP GHVPLQKVKE NVDKELEDCF EAPFYTLGPL VTDIAPAYDH ITSGIGAANI
GWYGTAMLCY VTPKEHLGLP NKQDVRDGII TYKIAAHAAD LAKGFPGAQL QDNALSKARF
EFRWEDQFNL ALDPERAREF HDETLPKEAH KVAHFCSMCG PNFCSMKITQ DVRDYAAAQG
ISADEALQKG MQEKAVEFVE KGAEIYRKT