Gene Cphamn1_0172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0172 
SymbolthrS 
ID6373826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp165416 
End bp167395 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content50% 
IMG OID642682691 
Productthreonyl-tRNA synthetase 
Protein accessionYP_001958628 
Protein GI189499158 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAAA ACATAGATGT ACAGGCAACT GTAACCGTTA CCTTTCCCGA TGGCAGGAAT 
ATGTCTATTC CGTCCGGGTC TTCAGGTTAC GATATCGCAC AATCAATAGG GCACAGCCTC
GCCAGGGAGG CTCTCGCGAT ACGTATCAAC GGTGAACTTG CTGATCTTGG AACCGCGGTC
ACCGATGACG CCACAGTTGA AATCATCACC TTTGATCATC CGGGTGCAAC AGGCAAACAC
ATATTCTGGC ACAGCGCCAG CCATATCATG GCTCAGGCTA TCGAAGAGCT TTTTCCCGGC
ACGAAGTTCG GCGCCGGACC GGCTGTTGAG CAGGGCTTCT ATTACGATAT TGCCTCTGAA
CACCGTTTCA ATGAAGAAGA TCTGCAAAAG ATAGAGCAGC AAATGCTTGA CATTTCTAAA
CGCAGCATCG ACATCAGGCG TGAAGAGATG CCCCGAGAAA AAGCCATAGC GTTCTTTTCT
GAATCCAGAA AAGATCCCTA CAAGGTGGAG ATTCTTCAGG ACACACTCAA AGAGGCCGAT
TCAGTGTCGA TATACCATCA GGGAGCGTTT GCCGATCTCT GCAGCGGCCC TCACCTGCCG
AACACCTCAA AGCTGAAAGC CGTCAAACTG ACAAATATTT CAGCATCTTT CTGGAGAGGA
GACTCTTCCC GCGAAAGCAT GCAGAGAATC TACGGGATAG CGTTTCCTTC CGCCAAACTC
CTGAAACAGC ATCTCGCCCG GTTAGAGGAA GCCAAAAAAC GGGATCATAG AAAACTGGGG
GCTGAACTTG AGCTTTTTAT GCTCTCTCAG GATGTCGGCA GCGGCTTGCC GATCTGGCTG
CCCAAAGGGG CGATCATTCG CAGCGAGCTC GAGGCTTTTC TGAAAGAAGA GCAGAGAAAA
CGCGGCTATG TTCCTGTCTA TACTCCACAT ATCGGCAATA TCGACCTGTA CAAACGTTCG
GGTCACTATC CCTACTACAG CGACTCACAG TTTCCTCCTC TTACCTACAA GGATGACCTG
GGAAGAGAGG AACAGTACCT GCTCAAACCG ATGAACTGTC CTCACCATCA CCTTATTTAC
AGTTCACAAT TGCGCAGCTA CCGTGATTTG CCAATCCGTA TGGCGGAATT CGGTACGGTA
TACCGCCATG AACAGTCAGG TGAACTGAAC GGTCTGATCA GGGCGAGAGG CTTTACACAG
GACGATTCGC ATATATACTG CCGACCAGAC CAGCTGGTTG ATGAAATCTG CGCTGCCATA
GACCTGACCA AATTTGTCTT TACCACACTT GGCTTCGATG ATATAGAGGT TCGCCTCTCC
CTGCATGACC CGGAGAACCA GGGGAAATAC GGCGGAACCG AGGAGGTCTG GAAACAGGCG
GAAAAGGATG TCAGGGAGGC TGCTGACCGT ATGGAGATCA ACTATGTTAT CGGTATCGGC
GAAGCCAGCT TTTACGGACC GAAAATTGAT TTCATTGTAC GCGACGCCCT GGGAAGAAAA
TGGCAGCTCG GCACTGTCCA GGTTGATTAC GTCATGCCTG AACGGTTTGA TCTTTCCTAT
ATCGGCAGTG ATGGAAAACC GCACCGTCCG GTCATTATTC ACCGAGCACC GTTTGGTTCG
ATGGAACGCT TTATCGGAGT TCTCATTGAA CATACCGCAG GTAACTTCCC GTTATGGCTT
GCTCCTGTTC AGGTAGCTGT TCTGCCGATT ACCGAGGAGG TTCACGCCTA TGCGGAAAGG
GTTCACCAGA TGCTGATTGA CAATGGCATT CGGGCAGATC TCGATATCCG CAGCGAGAAA
ATCGGCAAAA AAATACGTGA AGCAGAGGTC GGCAAAATCC CGTATATGGT TATCATCGGC
CAGAAGGAAG CTGACTCGGA AGAGATTTCA TTGAGACGTC ACCGTAAAGG GGATCAAGGC
TCATTGACGC TTCAGGCACT CAAAGATATG TTAGTAAAGG AAGTCCGAAA CAAATCCTGA
 
Protein sequence
MSENIDVQAT VTVTFPDGRN MSIPSGSSGY DIAQSIGHSL AREALAIRIN GELADLGTAV 
TDDATVEIIT FDHPGATGKH IFWHSASHIM AQAIEELFPG TKFGAGPAVE QGFYYDIASE
HRFNEEDLQK IEQQMLDISK RSIDIRREEM PREKAIAFFS ESRKDPYKVE ILQDTLKEAD
SVSIYHQGAF ADLCSGPHLP NTSKLKAVKL TNISASFWRG DSSRESMQRI YGIAFPSAKL
LKQHLARLEE AKKRDHRKLG AELELFMLSQ DVGSGLPIWL PKGAIIRSEL EAFLKEEQRK
RGYVPVYTPH IGNIDLYKRS GHYPYYSDSQ FPPLTYKDDL GREEQYLLKP MNCPHHHLIY
SSQLRSYRDL PIRMAEFGTV YRHEQSGELN GLIRARGFTQ DDSHIYCRPD QLVDEICAAI
DLTKFVFTTL GFDDIEVRLS LHDPENQGKY GGTEEVWKQA EKDVREAADR MEINYVIGIG
EASFYGPKID FIVRDALGRK WQLGTVQVDY VMPERFDLSY IGSDGKPHRP VIIHRAPFGS
MERFIGVLIE HTAGNFPLWL APVQVAVLPI TEEVHAYAER VHQMLIDNGI RADLDIRSEK
IGKKIREAEV GKIPYMVIIG QKEADSEEIS LRRHRKGDQG SLTLQALKDM LVKEVRNKS