Gene Cag_1715 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1715 
SymbolthrS 
ID3746981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2227260 
End bp2229233 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content48% 
IMG OID637774252 
Productthreonyl-tRNA synthetase 
Protein accessionYP_380009 
Protein GI78189671 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGATC ATAAAGAATC CACGGGTGCC ATTGCCCTTA CGCTTCCTGA TAGAAGTGTA 
CGCAACGTTG CAATGGGTAG CACGGGTTAC GATGTAGCGC TCTCCATTGG GCGTAAGTTA
GCGCAAGATG CCCTTGCTAT TAAGCTCAAT GGCGTTGTTT GCGACCTTAA CACCCTTATC
AATAGCGATG CGGCTATAGA AATTATCACC TTTACTTCGC CTGAGGGTCC TGAGATTTTT
TGGCATAGTT CCAGCCATTT AATGGCGCAA GCGATTGAGG AGCTTTTTGC GGGAAGTAAG
TTTGGGGCTG GTCCTGCTAT TGAGCAAGGC TTTTACTACG ATGTTTCTTC GGAGCACCGT
TTTCGTGAAG AGGATTTACG TGCTATTGAA GCACGTATGT TGGAAATCAG CAAGCGCGAT
AGTAGTGTGC AGCGGCAGGA GATGAGCCGT GAAGAGGCTA TTGCTTTTTT CACCTCCGTT
CGTAACGATC CTTATAAAGT TGAAATTCTT ACCGAAACGC TAAAAAATGT AGAGCGCGTT
TCGCTCTACC ATCAAGGCGA CTTTACCGAC CTTTGTACGG GTCCACATTT GCCTTCTACT
GGCAAAATAA AGGCGGTGTT GCTAACGAAC ATTTCTGCCT CTTATTGGCG TGGTGATTCG
AACCGTGAGC AAATGCAGCG TATTTATGGC ATTACTTTCC CTTCGGAAAA GTTGTTGAAA
GAGCATGTTG CTCGAATTGA GGAGGCAAAG CGGCGCGACC ATCGCAAGCT TGGTGCTGAG
CTTGAGCTTT TCTTGCTCTC TCCTGAAGTT GGTAGCGGCT TGCCGATGTG GTTGCCTAAA
GGTGCCATTA TTCGTAGTGA GTTGGAATCT TTTTTGCGCG AGGAGCAGCG TAAACGTGGT
TACGTGCCTG TTTACACGCC GCATATTGGC AACATTGAGC TATACAAGCG TTCGGGGCAC
TATCCATATT ACAGCGATTC GCAATTTCCG CCACTTACCT ATCACGATGA AGATGGTAAG
CAGGAGCAGT ATTTGCTTAA GCCAATGAAC TGCCCGCACC ATCATCTTAT TTATAGCTCT
AAAATGCGGA GTTACCGCGA TTTACCGCTT CGCTTTACGG AGTTTGGAAC GGTGTATCGC
CATGAGCAAT CGGGTGAATT AAATGGCTTG GCGCGTGCGC GTGGCTTTAC GCAAGACGAT
TCGCACATTT ATTGCCGTCC CGATCAGTTG GTGGATGAAA TTTGTAGTGC TATTGAGCTG
ACGCAATTTG TCTTTAAAAC GCTCGGCTTT GCTGAAGTAC AAACACGCCT TTCGCTGCAC
GATCCTGCTA ACCAAGCAAA ATATGGTGGT ACGGCTGAAG TGTGGGAGCA GGCGGAAAAG
GATGTTCAAG AAGCCGCTGA ACGCATGGGG GTTGATTACT TTATTGGCAT TGGTGAAGCA
AGTTTTTACG GTCCCAAAAT TGACTTTATT GTGCGCGATG CTATTGGGCG CAAATGGCAG
CTTGGCACCG TGCAGGTTGA CTACGTTATG CCCGAGCGCT TCGATTTAAC TTACGTTGGT
AGCGATGGGC AAAAGCATCG TCCCGTTGTT ATTCACCGTG CGCCATTCGG CTCAATGGAG
CGCTTTATTG GTTTGCTTAT TGAGCACACC GCAGGGAACT TCCCGCTCTG GTTGGCACCT
GTGCAAGTTG CCGTGCTGCC TATTGCTGAA GAAAATCACG ATTATGCAAC CACTGTTTAC
CGCCGTTTGC TTGCGGCAGG TATTCGTGCT GAACTTGATA CGCGAAGCGA AAAAATTAAT
CGCAAAATTC GCGATGCTGA AATGAGCAAA ACCCCTTGCA TGTTGGTAAT TGGTCAAAAA
GAGCAAGCAA ATGGTGAAGT GTCGCTTCGT CGCCATCGTC AAGGTGATGC TGGGCGCTTT
GCTACCGATG AGCTGATTGA AACGTTGAAG CAAGAGATTG CTAATCGCCA GTAA
 
Protein sequence
MSDHKESTGA IALTLPDRSV RNVAMGSTGY DVALSIGRKL AQDALAIKLN GVVCDLNTLI 
NSDAAIEIIT FTSPEGPEIF WHSSSHLMAQ AIEELFAGSK FGAGPAIEQG FYYDVSSEHR
FREEDLRAIE ARMLEISKRD SSVQRQEMSR EEAIAFFTSV RNDPYKVEIL TETLKNVERV
SLYHQGDFTD LCTGPHLPST GKIKAVLLTN ISASYWRGDS NREQMQRIYG ITFPSEKLLK
EHVARIEEAK RRDHRKLGAE LELFLLSPEV GSGLPMWLPK GAIIRSELES FLREEQRKRG
YVPVYTPHIG NIELYKRSGH YPYYSDSQFP PLTYHDEDGK QEQYLLKPMN CPHHHLIYSS
KMRSYRDLPL RFTEFGTVYR HEQSGELNGL ARARGFTQDD SHIYCRPDQL VDEICSAIEL
TQFVFKTLGF AEVQTRLSLH DPANQAKYGG TAEVWEQAEK DVQEAAERMG VDYFIGIGEA
SFYGPKIDFI VRDAIGRKWQ LGTVQVDYVM PERFDLTYVG SDGQKHRPVV IHRAPFGSME
RFIGLLIEHT AGNFPLWLAP VQVAVLPIAE ENHDYATTVY RRLLAAGIRA ELDTRSEKIN
RKIRDAEMSK TPCMLVIGQK EQANGEVSLR RHRQGDAGRF ATDELIETLK QEIANRQ