Gene SeD_A2012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2012 
SymbolthrS 
ID6874348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1943508 
End bp1945436 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content51% 
IMG OID642785127 
Productthreonyl-tRNA synthetase 
Protein accessionYP_002215793 
Protein GI198243456 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.759384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value0.921502 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTTA TTACTCTTCC TGATGGCAGC CAACGCCATT ATGACCACCC TGTAAGCCCG 
ATGGATGTTG CTCTGGACAT TGGTCCTGGC CTGGCGAAAG CCACCATTGC GGGCCGTGTG
AATGGCGAGC TGGTTGATGC TTCCGATCTG ATTGAAAATG ATGCGACGCT TGCCATCATC
ACCGCAAAAG ATGAAGAGGG TCTGGAGATC ATTCGTCACT CTTGCGCGCA TCTGTTAGGT
CACGCAATCA AGCAACTTTG GCCGCACACG AAAATGGCGA TCGGCCCGGT TGTCGACAAC
GGTTTTTACT ATGACGTTGA TCTTGACCGC ACGCTAACTC AGGAAGATGT CGAAGCGCTC
GAAAAGCGGA TGCATGAGCT CGCCGAGAAA AATTACGACG TTATCAAGAA AAAGGTGAGC
TGGCATGACG CGCGCGAAAC CTTCGTGAAG CGCGGCGAGA CTTACAAAGT CGCTATTCTT
GATGAAAATA TCGCCCATGA TGATAAGCCA GGCTTGTACC ATCATGAAGA ATATGTCGAC
ATGTGTCGTG GTCCGCACGT GCCGAATATG CGTTTCTGCC ATCACTTTAA ACTGATGAAA
ACTGCTGGCG CATACTGGCG CGGTGACAGC AATAATAAGA TGCTGCAGCG TATTTACGGT
ACGGCATGGG CAGATAAAAA AGCCCTGAAC GCTTATCTGC AGCGCCTGGA AGAGGCCGCA
AAACGCGACC ACCGTAAAAT TGGTAAGCAG CTCGACCTGT ATCATATGCA GGAAGAAGCG
CCGGGCATGG TGTTCTGGCA TAACGACGGC TGGACTATCT TCCGCGAGCT GGAAGTCTTT
GTTCGTTCTA AACTCAAAGA GTACCAGTAT CAAGAAGTTA AAGGCCCGTT CATGATGGAC
CGTGTGCTGT GGGAAAAAAC CGGGCACTGG GACAACTATA AAGATGCGAT GTTCACCACA
TCCTCAGAAA ACCGCGAATA TTGCATCAAG CCGATGAACT GCCCGGGCCA CGTTCAGATC
TTTAACCAGG GTCTGAAATC CTATCGTGAT TTGCCGCTGC GTATGGCGGA ATTCGGTAGC
TGCCACCGTA ACGAGCCATC AGGCGCGCTG CATGGTCTGA TGCGCGTACG CGGCTTTACG
CAGGATGATG CGCATATCTT CTGCACCGAA GAGCAGATTC GCGATGAAGT TAACGCTTGT
ATTCGTATGG TCTACGATAT GTACAGCACC TTTGGCTTCG AGAAGATCGT CGTCAAGCTT
TCCACTCGTC CTGACAAGCG TATCGGCAGC GATGAGATGT GGGATCGTGC TGAGGCGGAT
CTGGCGGTTG CGCTGGAAGA AAATAATATC CCGTTTGAGT ATCAACTGGG TGAAGGCGCA
TTCTACGGTC CGAAAATTGA ATTTACCTTA TATGACTGCC TCGATCGTGC ATGGCAGTGC
GGTACAGTAC AGCTGGACTT CTCCTTACCG TCTCGTCTGA GCGCCTCCTA TGTAGGCGAA
GACAACGAGC GCAAGGTGCC GGTAATGATT CACCGTGCGA TTCTTGGGTC GATGGAACGC
TTCATCGGTA TCCTGACCGA AGAGTTCGCT GGTTTCTTCC CGACATGGCT CGCGCCTGTT
CAGGTAGTCG TGATGAATAT TACCGATTCG CAGTCTGAAT ACGTTAACGA ATTGACGCAG
AAACTACAAA ATGCGGGCAT TCGTGTAAAA GCAGACTTGA GAAATGAGAA GATTGGCTTT
AAAATCCGCG AGCACACTTT ACGTCGTGTC CCTTATATGT TGGTCTGTGG TGATAAAGAG
GTGGAAGCAG GCAAAGTTGC CGTTCGCACC CGCCGCGGTA AAGACCTGGG CAGTCTGGAC
GTAAATGACG TGATTGAGAA GCTGCAACAA GAGATTCGCA GCCGCAGTCT TCAACAACTG
GAGGAATAA
 
Protein sequence
MPVITLPDGS QRHYDHPVSP MDVALDIGPG LAKATIAGRV NGELVDASDL IENDATLAII 
TAKDEEGLEI IRHSCAHLLG HAIKQLWPHT KMAIGPVVDN GFYYDVDLDR TLTQEDVEAL
EKRMHELAEK NYDVIKKKVS WHDARETFVK RGETYKVAIL DENIAHDDKP GLYHHEEYVD
MCRGPHVPNM RFCHHFKLMK TAGAYWRGDS NNKMLQRIYG TAWADKKALN AYLQRLEEAA
KRDHRKIGKQ LDLYHMQEEA PGMVFWHNDG WTIFRELEVF VRSKLKEYQY QEVKGPFMMD
RVLWEKTGHW DNYKDAMFTT SSENREYCIK PMNCPGHVQI FNQGLKSYRD LPLRMAEFGS
CHRNEPSGAL HGLMRVRGFT QDDAHIFCTE EQIRDEVNAC IRMVYDMYST FGFEKIVVKL
STRPDKRIGS DEMWDRAEAD LAVALEENNI PFEYQLGEGA FYGPKIEFTL YDCLDRAWQC
GTVQLDFSLP SRLSASYVGE DNERKVPVMI HRAILGSMER FIGILTEEFA GFFPTWLAPV
QVVVMNITDS QSEYVNELTQ KLQNAGIRVK ADLRNEKIGF KIREHTLRRV PYMLVCGDKE
VEAGKVAVRT RRGKDLGSLD VNDVIEKLQQ EIRSRSLQQL EE