Gene SeAg_B1840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B1840 
SymbolthrS 
ID6794289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp1801854 
End bp1803782 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content50% 
IMG OID642776070 
Productthreonyl-tRNA synthetase 
Protein accessionYP_002146704 
Protein GI197251550 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value3.02796e-09 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGTTA TTACTCTTCC TGATGGCAGC CAACGCCATT ATGACCACCC TGTAAGCCCG 
ATGGATGTTG CTCTGGACAT TGGTCCTGGC CTGGCGAAAG CCACCATTGC GGGCCGTGTG
AATGGCGAGC TGGTTGATGC TTCCGATCTG ATTGAAAATG ATGCGACGCT TTCCATCATC
ACCGCAAAAG ATGAAGAGGG TCTGGAGATC ATTCGTCACT CTTGCGCGCA TCTGTTAGGT
CACGCTATCA AGCAACTTTG GCCGCACACG AAAATGGCGA TCGGCCCGGT TGTCGACAAC
GGTTTTTACT ATGACGTTGA TCTTGACCGC ACGCTAACTC AGGAAGATGT CGAAGCGCTC
GAAAAGCGGA TGCATGAGCT CGCCGAGAAA AATTACGACG TTATCAAGAA GAAAGTCAGC
TGGCACGAAG CGCGTGAAAC CTTCGTGAAG CGTGGTGAAA GCTACAAAGT TTCCATTCTT
GATGAAAACA TTGCTCATGA TGACAAGCCA GGCTTGTACC ATCATGAAGA ATATGTCGAC
ATGTGTCGTG GTCCGCACGT GCCGAATATG CGTTTCTGCC ATCACTTTAA ACTGATGAAA
ACTGCCGGCG CATACTGGCG CGGTGACAGC AATAATAAGA TGTTGCAGCG TATTTACGGT
ACGGCATGGG CAGATAAAAA AGCCCTGAAC GCTTATCTGC AGCGCCTGGA AGAGGCCGCA
AAACGCGACC ACCGTAAAAT TGGTAAGCAG CTCGACCTGT ATCATATGCA GGAAGAAGCG
CCGGGCATGG TGTTCTGGCA TAACGACGGC TGGACTATCT TCCGCGAGCT GGAGGTCTTT
GTTCGTTCTA AACTCAAAGA GTACCAGTAT CAAGAAGTTA AAGGCCCGTT CATGATGGAC
CGTGTGCTGT GGGAAAAAAC CGGGCACTGG GACAACTATA AAGATGCGAT GTTCACCACA
TCTTCAGAAA ACCGCGAATA TTGCATCAAG CCGATGAACT GCCCGGGCCA CGTTCAGATC
TTTAACCAGG GTCTGAAATC CTATCGTGAT TTGCCGCTGC GTATGGCGGA ATTCGGTAGC
TGCCACCGTA ACGAGCCATC AGGCGCGCTG CATGGTCTGA TGCGCGTACG CGGCTTTACG
CAGGATGATG CGCATATCTT CTGCACCGAA GAGCAGATCC GCGATGAAGT TAACGCTTGT
ATTCGTATGG TCTACGATAT GTACAGCACC TTTGGCTTCG AGAAGATCGT CGTCAAGCTT
TCCACTCGTC CTGATAAGCG TATCGGCAGC GATGAGATGT GGGATCGTGC TGAGGCGGAT
CTGGCGGTTG CGCTGGAAGA AAATAATATC CCGTTTGAGT ATCAACTGGG TGAAGGCGCA
TTCTACGGTC CGAAAATTGA ATTTACCTTA TATGACTGCC TCGATCGTGC ATGGCAGTGC
GGTACAGTAC AGCTGGACTT CTCCTTACCG TCTCGTCTGA GCGCCTCCTA TGTAGGCGAA
GACAACGAGC GTAAGGTGCC GGTAATGATT CACCGTGCGA TTCTTGGGTC GATGGAACGC
TTCATCGGTA TCCTGACCGA AGAGTTCGCT GGTTTCTTCC CGACATGGCT CGCGCCTGTT
CAGGTAGTCG TGATGAATAT TACCGATTCG CAGTCTGAAT ACGTTAACGA ATTGACGCAG
AAACTACAAA ATGCGGGCAT TCGTGTAAAA GCAGACTTGA GAAATGAGAA GATTGGCTTT
AAAATCCGCG AGCACACTTT ACGTCGTGTC CCTTATATGT TGGTCTGTGG TGATAAAGAG
GTGGAAGCAG GCAAAGTTGC CGTTCGCACC CGCCGCGGTA AAGACCTGGG CAGTCTGGAC
GTAAATGACG TGATTGAGAA GCTGCAACAA GAGATTCGCA GCCGCAGTCT TCAACAACTG
GAGGAATAA
 
Protein sequence
MPVITLPDGS QRHYDHPVSP MDVALDIGPG LAKATIAGRV NGELVDASDL IENDATLSII 
TAKDEEGLEI IRHSCAHLLG HAIKQLWPHT KMAIGPVVDN GFYYDVDLDR TLTQEDVEAL
EKRMHELAEK NYDVIKKKVS WHEARETFVK RGESYKVSIL DENIAHDDKP GLYHHEEYVD
MCRGPHVPNM RFCHHFKLMK TAGAYWRGDS NNKMLQRIYG TAWADKKALN AYLQRLEEAA
KRDHRKIGKQ LDLYHMQEEA PGMVFWHNDG WTIFRELEVF VRSKLKEYQY QEVKGPFMMD
RVLWEKTGHW DNYKDAMFTT SSENREYCIK PMNCPGHVQI FNQGLKSYRD LPLRMAEFGS
CHRNEPSGAL HGLMRVRGFT QDDAHIFCTE EQIRDEVNAC IRMVYDMYST FGFEKIVVKL
STRPDKRIGS DEMWDRAEAD LAVALEENNI PFEYQLGEGA FYGPKIEFTL YDCLDRAWQC
GTVQLDFSLP SRLSASYVGE DNERKVPVMI HRAILGSMER FIGILTEEFA GFFPTWLAPV
QVVVMNITDS QSEYVNELTQ KLQNAGIRVK ADLRNEKIGF KIREHTLRRV PYMLVCGDKE
VEAGKVAVRT RRGKDLGSLD VNDVIEKLQQ EIRSRSLQQL EE