Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0172 |
Symbol | thrS |
ID | 6373826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 165416 |
End bp | 167395 |
Gene Length | 1980 bp |
Protein Length | 659 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642682691 |
Product | threonyl-tRNA synthetase |
Protein accession | YP_001958628 |
Protein GI | 189499158 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGAAA ACATAGATGT ACAGGCAACT GTAACCGTTA CCTTTCCCGA TGGCAGGAAT ATGTCTATTC CGTCCGGGTC TTCAGGTTAC GATATCGCAC AATCAATAGG GCACAGCCTC GCCAGGGAGG CTCTCGCGAT ACGTATCAAC GGTGAACTTG CTGATCTTGG AACCGCGGTC ACCGATGACG CCACAGTTGA AATCATCACC TTTGATCATC CGGGTGCAAC AGGCAAACAC ATATTCTGGC ACAGCGCCAG CCATATCATG GCTCAGGCTA TCGAAGAGCT TTTTCCCGGC ACGAAGTTCG GCGCCGGACC GGCTGTTGAG CAGGGCTTCT ATTACGATAT TGCCTCTGAA CACCGTTTCA ATGAAGAAGA TCTGCAAAAG ATAGAGCAGC AAATGCTTGA CATTTCTAAA CGCAGCATCG ACATCAGGCG TGAAGAGATG CCCCGAGAAA AAGCCATAGC GTTCTTTTCT GAATCCAGAA AAGATCCCTA CAAGGTGGAG ATTCTTCAGG ACACACTCAA AGAGGCCGAT TCAGTGTCGA TATACCATCA GGGAGCGTTT GCCGATCTCT GCAGCGGCCC TCACCTGCCG AACACCTCAA AGCTGAAAGC CGTCAAACTG ACAAATATTT CAGCATCTTT CTGGAGAGGA GACTCTTCCC GCGAAAGCAT GCAGAGAATC TACGGGATAG CGTTTCCTTC CGCCAAACTC CTGAAACAGC ATCTCGCCCG GTTAGAGGAA GCCAAAAAAC GGGATCATAG AAAACTGGGG GCTGAACTTG AGCTTTTTAT GCTCTCTCAG GATGTCGGCA GCGGCTTGCC GATCTGGCTG CCCAAAGGGG CGATCATTCG CAGCGAGCTC GAGGCTTTTC TGAAAGAAGA GCAGAGAAAA CGCGGCTATG TTCCTGTCTA TACTCCACAT ATCGGCAATA TCGACCTGTA CAAACGTTCG GGTCACTATC CCTACTACAG CGACTCACAG TTTCCTCCTC TTACCTACAA GGATGACCTG GGAAGAGAGG AACAGTACCT GCTCAAACCG ATGAACTGTC CTCACCATCA CCTTATTTAC AGTTCACAAT TGCGCAGCTA CCGTGATTTG CCAATCCGTA TGGCGGAATT CGGTACGGTA TACCGCCATG AACAGTCAGG TGAACTGAAC GGTCTGATCA GGGCGAGAGG CTTTACACAG GACGATTCGC ATATATACTG CCGACCAGAC CAGCTGGTTG ATGAAATCTG CGCTGCCATA GACCTGACCA AATTTGTCTT TACCACACTT GGCTTCGATG ATATAGAGGT TCGCCTCTCC CTGCATGACC CGGAGAACCA GGGGAAATAC GGCGGAACCG AGGAGGTCTG GAAACAGGCG GAAAAGGATG TCAGGGAGGC TGCTGACCGT ATGGAGATCA ACTATGTTAT CGGTATCGGC GAAGCCAGCT TTTACGGACC GAAAATTGAT TTCATTGTAC GCGACGCCCT GGGAAGAAAA TGGCAGCTCG GCACTGTCCA GGTTGATTAC GTCATGCCTG AACGGTTTGA TCTTTCCTAT ATCGGCAGTG ATGGAAAACC GCACCGTCCG GTCATTATTC ACCGAGCACC GTTTGGTTCG ATGGAACGCT TTATCGGAGT TCTCATTGAA CATACCGCAG GTAACTTCCC GTTATGGCTT GCTCCTGTTC AGGTAGCTGT TCTGCCGATT ACCGAGGAGG TTCACGCCTA TGCGGAAAGG GTTCACCAGA TGCTGATTGA CAATGGCATT CGGGCAGATC TCGATATCCG CAGCGAGAAA ATCGGCAAAA AAATACGTGA AGCAGAGGTC GGCAAAATCC CGTATATGGT TATCATCGGC CAGAAGGAAG CTGACTCGGA AGAGATTTCA TTGAGACGTC ACCGTAAAGG GGATCAAGGC TCATTGACGC TTCAGGCACT CAAAGATATG TTAGTAAAGG AAGTCCGAAA CAAATCCTGA
|
Protein sequence | MSENIDVQAT VTVTFPDGRN MSIPSGSSGY DIAQSIGHSL AREALAIRIN GELADLGTAV TDDATVEIIT FDHPGATGKH IFWHSASHIM AQAIEELFPG TKFGAGPAVE QGFYYDIASE HRFNEEDLQK IEQQMLDISK RSIDIRREEM PREKAIAFFS ESRKDPYKVE ILQDTLKEAD SVSIYHQGAF ADLCSGPHLP NTSKLKAVKL TNISASFWRG DSSRESMQRI YGIAFPSAKL LKQHLARLEE AKKRDHRKLG AELELFMLSQ DVGSGLPIWL PKGAIIRSEL EAFLKEEQRK RGYVPVYTPH IGNIDLYKRS GHYPYYSDSQ FPPLTYKDDL GREEQYLLKP MNCPHHHLIY SSQLRSYRDL PIRMAEFGTV YRHEQSGELN GLIRARGFTQ DDSHIYCRPD QLVDEICAAI DLTKFVFTTL GFDDIEVRLS LHDPENQGKY GGTEEVWKQA EKDVREAADR MEINYVIGIG EASFYGPKID FIVRDALGRK WQLGTVQVDY VMPERFDLSY IGSDGKPHRP VIIHRAPFGS MERFIGVLIE HTAGNFPLWL APVQVAVLPI TEEVHAYAER VHQMLIDNGI RADLDIRSEK IGKKIREAEV GKIPYMVIIG QKEADSEEIS LRRHRKGDQG SLTLQALKDM LVKEVRNKS
|
| |