Gene SNSL254_A1444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1444 
SymbolthrS 
ID6483336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1411616 
End bp1413544 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content51% 
IMG OID642736836 
Productthreonyl-tRNA synthetase 
Protein accessionYP_002040590 
Protein GI194446059 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0960598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTTA TTACTCTTCC TGATGGCAGC CAACGCCATT ATGACCACCC TGTAAGCCCG 
ATGGATGTTG CTCTGGACAT TGGTCCTGGC CTGGCGAAAG CCACCATTGC GGGCCGTGTG
AATGGCGAGC TGGTTGATGC TTCCGATCTG ATTGAAAATG ATGCGACGCT TGCCATCATC
ACCGCAAAAG ATGAAGAGGG TCTGGAGATC ATTCGTCACT CTTGCGCGCA TCTGTTAGGT
CACGCAATCA AGCAACTTTG GCCGCACACG AAAATGGCGA TCGGCCCGGT TGTCGACAAC
GGTTTTTACT ATGACGTTGA TCTTGACCGC ACGCTAACTC AGGAAGATGT CGAAGCGCTC
GAAAAGCGGA TGCATGAGCT CGCCGAGAAA AATTACGACG TTATCAAGAA AAAGGTGAGC
TGGCATGACG CGCGCGAAAC CTTCGTGAAG CGCGGCGAGA CTTACAAAGT CGCTATTCTT
GATGAAAATA TCGCCCATGA TGATAAGCCA GGCTTGTACC ATCATGAAGA ATATGTCGAC
ATGTGTCGTG GTCCGCACGT GCCGAATATG CGTTTCTGCC ATCACTTTAA ACTGATGAAA
ACTGCCGGCG CATACTGGCG CGGTGACAGC AATAATAAGA TGCTGCAGCG TATTTACGGT
ACGGCATGGG CAGATAAAAA AGCCCTGAAC GCTTATCTGC AGCGCCTGGA AGAGGCCGCA
AAACGCGACC ACCGTAAAAT TGGTAAGCAG CTCGACCTGT ATCATATGCA GGAAGAAGCG
CCGGGCATGG TGTTCTGGCA TAACGACGGC TGGACTATCT TCCGCGAGCT GGAGGTCTTT
GTTCGTTCTA AACTCAAAGA GTACCAGTAT CAAGAAGTTA AAGGCCCGTT CATGATGGAC
CGTGTGCTGT GGGAAAAAAC CGGGCACTGG GACAACTATA AAGATGCGAT GTTCACCACA
TCCTCAGAAA ACCGCGAATA TTGCATCAAG CCGATGAACT GCCCGGGCCA CGTTCAGATC
TTTAACCAGG GTCTGAAATC CTATCGTGAT TTGCCGCTGC GTATGGCGGA ATTCGGTAGC
TGCCACCGTA ACGAGCCATC AGGCGCGCTG CATGGTCTGA TGCGCGTACG CGGCTTTACG
CAGGATGATG CGCATATCTT CTGCACCGAA GAGCAGATCC GCGATGAAGT TAACGCTTGT
ATTCGTATGG TCTACGATAT GTACAGCACC TTTGGCTTCG AGAAGATCGT CGTCAAGCTT
TCCACTCGTC CTGACAAGCG TATCGGCAGC GATGAGATGT GGGATCGTGC TGAGGCGGAT
CTGGCGGTTG CGCTGGAAGA AAATAATATC CCGTTTGAGT ATCAACTGGG TGAAGGCGCA
TTCTACGGTC CGAAAATTGA ATTTACCTTA TATGACTGCC TCGATCGTGC ATGGCAGTGC
GGTACAGTAC AGCTGGACTT CTCCTTACCG TCTCGTCTGA GCGCCTCCTA TGTAGGCGAA
GACAACGAGC GTAAGGTGCC GGTAATGATT CACCGTGCGA TTCTTGGGTC GATGGAACGC
TTCATCGGTA TCCTGACCGA AGAGTTCGCT GGTTTCTTCC CGACATGGCT CGCGCCTGTT
CAGGTAGTCG TGATGAATAT TACCGATTCG CAGTCTGAAT ACGTTAACGA ATTGACGCAG
AAACTACAAA ATGCGGGCAT TCGTGTAAAA GCAGACTTGA GAAATGAGAA GATTGGCTTT
AAAATCCGCG AGCACACTTT ACGTCGTGTC CCTTATATGT TGGTCTGTGG TGATAAAGAG
GTGGAAGCAG GCAAAGTTGC CGTTCGCACC CGCCGCGGTA AAGACCTGGG CAGTCTGGAC
GTAAATGACG TGATTGAGAA GCTGCAACAA GAGATTCGCA GCCGCAGTCT TCAACAACTG
GAGGAATAA
 
Protein sequence
MPVITLPDGS QRHYDHPVSP MDVALDIGPG LAKATIAGRV NGELVDASDL IENDATLAII 
TAKDEEGLEI IRHSCAHLLG HAIKQLWPHT KMAIGPVVDN GFYYDVDLDR TLTQEDVEAL
EKRMHELAEK NYDVIKKKVS WHDARETFVK RGETYKVAIL DENIAHDDKP GLYHHEEYVD
MCRGPHVPNM RFCHHFKLMK TAGAYWRGDS NNKMLQRIYG TAWADKKALN AYLQRLEEAA
KRDHRKIGKQ LDLYHMQEEA PGMVFWHNDG WTIFRELEVF VRSKLKEYQY QEVKGPFMMD
RVLWEKTGHW DNYKDAMFTT SSENREYCIK PMNCPGHVQI FNQGLKSYRD LPLRMAEFGS
CHRNEPSGAL HGLMRVRGFT QDDAHIFCTE EQIRDEVNAC IRMVYDMYST FGFEKIVVKL
STRPDKRIGS DEMWDRAEAD LAVALEENNI PFEYQLGEGA FYGPKIEFTL YDCLDRAWQC
GTVQLDFSLP SRLSASYVGE DNERKVPVMI HRAILGSMER FIGILTEEFA GFFPTWLAPV
QVVVMNITDS QSEYVNELTQ KLQNAGIRVK ADLRNEKIGF KIREHTLRRV PYMLVCGDKE
VEAGKVAVRT RRGKDLGSLD VNDVIEKLQQ EIRSRSLQQL EE