Gene SeSA_A1428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A1428 
SymbolthrS 
ID6518692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp1376170 
End bp1378098 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content50% 
IMG OID642746545 
Productthreonyl-tRNA synthetase 
Protein accessionYP_002114350 
Protein GI194736705 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00369354 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00283181 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTGTTA TTACTCTTCC TGATGGCAGC CAACGCCATT ATGACCACCC TGTAAGCCCG 
ATGGATGTTG CTCTGGACAT TGGTCCTGGC CTGGCGAAAG CCACCATTGC GGGCCGTGTG
AATGGCGAGC TGGTTGATGC TTCCGATCTG ATTGAAAATG ATGCGACGCT TTCCATCATC
ACCGCAAAAG ATGAAGAGGG TCTGGAGATT ATTCGTCACT CTTGCGCGCA TCTGTTAGGT
CACGCAATCA AGCAACTTTG GCCGCACACG AAAATGGCGA TCGGCCCGGT TGTCGACAAC
GGTTTTTACT ATGACGTTGA TCTTGACCGC ACGCTAACTC AGGAAGATGT CGAAGCGCTC
GAAAAGCGGA TGCATGAGCT CGCCGAGAAA AATTACGACG TTATCAAGAA GAAAGTCAGC
TGGCACGAAG CGCGTGAAAC CTTCGTGAAG CGTGGTGAAA GCTACAAAGT TTCCATTCTT
GATGAAAACA TTGCTCATGA TGACAAGCCA GGCTTGTACC ATCATGAAGA ATATGTCGAC
ATGTGTCGTG GTCCGCACGT GCCGAATATG CGTTTCTGCC ATCACTTTAA ACTGATGAAA
ACTGCTGGCG CATACTGGCG CGGTGACAGC AATAATAAGA TGCTGCAGCG TATTTACGGT
ACGGCATGGG CAGATAAAAA AGCTCTGAAC GCTTATCTGC AGCGCCTGGA AGAGGCCGCA
AAACGCGACC ACCGTAAAAT TGGTAAGCAG CTCGACCTGT ATCATATGCA GGAAGAGGCG
CCGGGCATGG TGTTCTGGCA TAACGACGGC TGGACTATCT TCCGCGAGCT GGAGGTCTTT
GTTCGTTCTA AACTCAAAGA GTACCAGTAT CAAGAAGTTA AAGGCCCGTT CATGATGGAC
CGTGTGCTGT GGGAAAAAAC CGGGCACTGG GACAACTATA AAGATGCGAT GTTCACCACG
TCCTCAGAAA ACCGCGAATA TTGCATTAAG CCAATGAACT GCCCGGGCCA CGTTCAGATC
TTTAACCAGG GTCTGAAATC CTATCGTGAT TTGCCGCTGC GTATGGCGGA ATTCGGTAGC
TGCCACCGTA ACGAGCCATC AGGCGCGCTG CATGGTCTGA TGCGCGTACG CGGCTTTACG
CAGGATGATG CGCATATCTT CTGCACAGAA GAGCAGATCC GCGATGAAGT TAACGCTTGT
ATTCGTATGG TCTACGATAT GTACAGCACC TTTGGCTTCG AGAAGATCGT CGTCAAGCTT
TCCACTCGTC CTGACAAGCG TATTGGCAGC GATGAGATGT GGGATCGTGC TGAGGCGGAT
CTGGCGGTTG CGCTGGAAGA AAATAATATC CCGTTTGAGT ATCAACTGGG TGAAGGCGCA
TTCTACGGTC CGAAAATTGA ATTTACCTTA TATGACTGCC TCGATCGTGC ATGGCAGTGC
GGTACAGTAC AGCTGGACTT CTCCTTACCG TCTCGTCTGA GCGCCTCCTA TGTAGGCGAA
GACAACGAGC GTAAGGTACC GGTAATGATT CACCGTGCGA TTCTTGGGTC GATGGAACGC
TTCATCGGTA TCCTGACCGA AGAGTTCGCT GGTTTCTTCC CGACATGGCT TGCGCCGGTT
CAGGTAGTCG TGATGAATAT TACCGATTCG CAGTCTGAAT ACGTTAACGA ATTGACGCAG
AAACTACAAA ATGCGGGCAT TCGTGTAAAA GCAGACTTGA GAAATGAGAA GATTGGCTTT
AAAATCCGCG AGCACACTTT ACGTCGTGTC CCTTATATGT TGGTCTGTGG TGATAAAGAG
GTGGAAGCAG GCAAAGTTGC CGTTCGTACC CGCCGCGGTA AAGACCTGGG CAGTCTGGAC
GTAAATGACG TGATTGAGAA GCTGCAACAA GAGATTCGCA GCCGCAGTCT TCAACAACTG
GAGGAATAA
 
Protein sequence
MPVITLPDGS QRHYDHPVSP MDVALDIGPG LAKATIAGRV NGELVDASDL IENDATLSII 
TAKDEEGLEI IRHSCAHLLG HAIKQLWPHT KMAIGPVVDN GFYYDVDLDR TLTQEDVEAL
EKRMHELAEK NYDVIKKKVS WHEARETFVK RGESYKVSIL DENIAHDDKP GLYHHEEYVD
MCRGPHVPNM RFCHHFKLMK TAGAYWRGDS NNKMLQRIYG TAWADKKALN AYLQRLEEAA
KRDHRKIGKQ LDLYHMQEEA PGMVFWHNDG WTIFRELEVF VRSKLKEYQY QEVKGPFMMD
RVLWEKTGHW DNYKDAMFTT SSENREYCIK PMNCPGHVQI FNQGLKSYRD LPLRMAEFGS
CHRNEPSGAL HGLMRVRGFT QDDAHIFCTE EQIRDEVNAC IRMVYDMYST FGFEKIVVKL
STRPDKRIGS DEMWDRAEAD LAVALEENNI PFEYQLGEGA FYGPKIEFTL YDCLDRAWQC
GTVQLDFSLP SRLSASYVGE DNERKVPVMI HRAILGSMER FIGILTEEFA GFFPTWLAPV
QVVVMNITDS QSEYVNELTQ KLQNAGIRVK ADLRNEKIGF KIREHTLRRV PYMLVCGDKE
VEAGKVAVRT RRGKDLGSLD VNDVIEKLQQ EIRSRSLQQL EE