Gene TM1040_1721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1721 
SymbolthrS 
ID4075785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1819028 
End bp1820974 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content60% 
IMG OID638007035 
Productthreonyl-tRNA synthetase 
Protein accessionYP_613716 
Protein GI99081562 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAGA TTTCCCTCAC ATTCCCCGAT GGCAACGCAC GTTCCTACGA CGCAGGGATT 
ACCCCTGCCG AGGTCGCAGC CGACATCTCC ACTTCCCTTG CCAAAAAAGC GATCTCTGCC
ACCGTGGACG GTCAACACTG GGACCTGCAA TGGCCAGTGA CCCAGGATGC CGCCATCGCC
ATCCACACGA TGAAGGACGA GGCGCAGGCC AATGAGCTCA TCCGGCACGA CCTTGCGCAT
ATCATGGCGC GCGCGGTGCA GGAGATCTGG CCCGATACCA AGGTCACCAT TGGCCCGGTC
ATCGAAAATG GCTGGTATTA TGACTTTGAT CGTGCGGAAC CCTTCACCCC CGAAGATCTC
GGCACGATCG AAAAGAAGAT GAAAGAGATC ATCAACAAGC GCGAACCCGT GACCACAGAG
GTTTGGGAGC GCGAGAAGGC GATCCAGTAT TACACCGACA ACAACGAGCC CTATAAGGTC
GAGCTGATCG ACAGCATCCC CGGCGATGAG CCGCTGCGGA TGTATTGGCA TGGCGACTGG
CAGGATCTCT GCCGCGGGCC GCACCTGCAG CACACCGGCC AGGTGCCGGG CGATGCCTTC
AAGCTGATGT CGATCGCCGG CGCCTACTGG CGCGGCGACA GCGATCGCCA GATGCTCCAG
CGCATCTACG GCGTGGCCTT CACCGGCAAG GAAAAGCTCA AGGCGCATCT CACCATGCTC
GAGGAGGCCG CCAAGCGCGA CCACCGCAAG CTGGGCCGCG AGATGAACCT TTTCCACATG
CAGGAAGAAG CACCCGGCCA GGTGTTCTGG CATCCGAACG GATGGCGCAT CTACACCACG
CTTCAGGACT ACATGCGCCG CATGCAGGAT CGCGACGGCT ATGTAGAGGT CAACACGCCT
CAGGTGGTGG ACCGCAAACT CTGGGAAAAG TCCGGCCACT GGGACAAGTA TCAGGAAAAC
ATGTTCATTG TTGAGGTCGA CGAGGATCAC GCCCGCGAAA AGGCCGTGAA CGCCCTGAAG
CCGATGAACT GCCCCTGCCA CGTGCAGGTG TTCAACCAGG GGCTCAAGTC CTATCGCGAC
TTGCCTCTGC GCATGGCAGA GTTCGGCTCC TGCGCGCGCT ACGAACCCTC GGGCGCCCTG
CATGGCATCA TGCGCGTGCG CGGCTTCACG CAGGATGATG GCCACATCTT CTGCACCGAA
GATCAGATTA CATCCGAGAC CGCAAAGTTC ATCGCCTTCC TCTCCAAAGT CTATGCAGAT
CTCGGCTTTG ACAATTGGAC CATCAAGCTC TCCACCCGCC CCGAACAGCG GATCGGCTCG
GATGAGACAT GGGACAATAT GGAACAGGCC CTTGGCGATG CCTGCAAGGC CGCGGGCTAC
GACTATGAGA TCCTCGAAGG CGAAGGCGCC TTCTATGGTC CCAAACTCGA GTTCACCCTG
ACCGACGCCA TCGGCCGGAA CTGGCAATGC GGCACACTTC AGGTGGATGC CAACCTGCCC
GAACGGCTCG AAGCGAGCTT TATCGGTCAG GACGGCAGCA AGCACCGTCC GGTCATGCTG
CACCGCGCCA CCCTTGGCTC GTTCGAGCGC TTCATCGGCA TCCTGATCGA AGAGCACGCT
GGCAAGCTCC CGTTCTGGCT CGCGCCGCGT CAGGTGGTGG TCGCCTCGAT TACGTCAGAG
GCGGACGACT ATGTGAACGA AGTGGTCGAG ACCCTGCGCG CCGCCGGTGT GCGAGCCGAG
GCCGATACGC GCAATGAGAA GATCAACTAC AAGGTCCGCG AGCATTCGGT TGGCAAAGTG
CCGGTGATTC TCGCCGTTGG CCACCGCGAG GTTGAGGAGC GCACCGTCTC CGTGCGTCGT
CTTGGCGAGA AACAGACCAA GGTTGAGAGC CTCACAAATG TTACAGAGGA ACTGGCAAAG
GCCGCAACGC CGCCAGATCT TCTGTAA
 
Protein sequence
MAQISLTFPD GNARSYDAGI TPAEVAADIS TSLAKKAISA TVDGQHWDLQ WPVTQDAAIA 
IHTMKDEAQA NELIRHDLAH IMARAVQEIW PDTKVTIGPV IENGWYYDFD RAEPFTPEDL
GTIEKKMKEI INKREPVTTE VWEREKAIQY YTDNNEPYKV ELIDSIPGDE PLRMYWHGDW
QDLCRGPHLQ HTGQVPGDAF KLMSIAGAYW RGDSDRQMLQ RIYGVAFTGK EKLKAHLTML
EEAAKRDHRK LGREMNLFHM QEEAPGQVFW HPNGWRIYTT LQDYMRRMQD RDGYVEVNTP
QVVDRKLWEK SGHWDKYQEN MFIVEVDEDH AREKAVNALK PMNCPCHVQV FNQGLKSYRD
LPLRMAEFGS CARYEPSGAL HGIMRVRGFT QDDGHIFCTE DQITSETAKF IAFLSKVYAD
LGFDNWTIKL STRPEQRIGS DETWDNMEQA LGDACKAAGY DYEILEGEGA FYGPKLEFTL
TDAIGRNWQC GTLQVDANLP ERLEASFIGQ DGSKHRPVML HRATLGSFER FIGILIEEHA
GKLPFWLAPR QVVVASITSE ADDYVNEVVE TLRAAGVRAE ADTRNEKINY KVREHSVGKV
PVILAVGHRE VEERTVSVRR LGEKQTKVES LTNVTEELAK AATPPDLL