Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A1428 |
Symbol | thrS |
ID | 6518692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 1376170 |
End bp | 1378098 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642746545 |
Product | threonyl-tRNA synthetase |
Protein accession | YP_002114350 |
Protein GI | 194736705 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00369354 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00283181 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTGTTA TTACTCTTCC TGATGGCAGC CAACGCCATT ATGACCACCC TGTAAGCCCG ATGGATGTTG CTCTGGACAT TGGTCCTGGC CTGGCGAAAG CCACCATTGC GGGCCGTGTG AATGGCGAGC TGGTTGATGC TTCCGATCTG ATTGAAAATG ATGCGACGCT TTCCATCATC ACCGCAAAAG ATGAAGAGGG TCTGGAGATT ATTCGTCACT CTTGCGCGCA TCTGTTAGGT CACGCAATCA AGCAACTTTG GCCGCACACG AAAATGGCGA TCGGCCCGGT TGTCGACAAC GGTTTTTACT ATGACGTTGA TCTTGACCGC ACGCTAACTC AGGAAGATGT CGAAGCGCTC GAAAAGCGGA TGCATGAGCT CGCCGAGAAA AATTACGACG TTATCAAGAA GAAAGTCAGC TGGCACGAAG CGCGTGAAAC CTTCGTGAAG CGTGGTGAAA GCTACAAAGT TTCCATTCTT GATGAAAACA TTGCTCATGA TGACAAGCCA GGCTTGTACC ATCATGAAGA ATATGTCGAC ATGTGTCGTG GTCCGCACGT GCCGAATATG CGTTTCTGCC ATCACTTTAA ACTGATGAAA ACTGCTGGCG CATACTGGCG CGGTGACAGC AATAATAAGA TGCTGCAGCG TATTTACGGT ACGGCATGGG CAGATAAAAA AGCTCTGAAC GCTTATCTGC AGCGCCTGGA AGAGGCCGCA AAACGCGACC ACCGTAAAAT TGGTAAGCAG CTCGACCTGT ATCATATGCA GGAAGAGGCG CCGGGCATGG TGTTCTGGCA TAACGACGGC TGGACTATCT TCCGCGAGCT GGAGGTCTTT GTTCGTTCTA AACTCAAAGA GTACCAGTAT CAAGAAGTTA AAGGCCCGTT CATGATGGAC CGTGTGCTGT GGGAAAAAAC CGGGCACTGG GACAACTATA AAGATGCGAT GTTCACCACG TCCTCAGAAA ACCGCGAATA TTGCATTAAG CCAATGAACT GCCCGGGCCA CGTTCAGATC TTTAACCAGG GTCTGAAATC CTATCGTGAT TTGCCGCTGC GTATGGCGGA ATTCGGTAGC TGCCACCGTA ACGAGCCATC AGGCGCGCTG CATGGTCTGA TGCGCGTACG CGGCTTTACG CAGGATGATG CGCATATCTT CTGCACAGAA GAGCAGATCC GCGATGAAGT TAACGCTTGT ATTCGTATGG TCTACGATAT GTACAGCACC TTTGGCTTCG AGAAGATCGT CGTCAAGCTT TCCACTCGTC CTGACAAGCG TATTGGCAGC GATGAGATGT GGGATCGTGC TGAGGCGGAT CTGGCGGTTG CGCTGGAAGA AAATAATATC CCGTTTGAGT ATCAACTGGG TGAAGGCGCA TTCTACGGTC CGAAAATTGA ATTTACCTTA TATGACTGCC TCGATCGTGC ATGGCAGTGC GGTACAGTAC AGCTGGACTT CTCCTTACCG TCTCGTCTGA GCGCCTCCTA TGTAGGCGAA GACAACGAGC GTAAGGTACC GGTAATGATT CACCGTGCGA TTCTTGGGTC GATGGAACGC TTCATCGGTA TCCTGACCGA AGAGTTCGCT GGTTTCTTCC CGACATGGCT TGCGCCGGTT CAGGTAGTCG TGATGAATAT TACCGATTCG CAGTCTGAAT ACGTTAACGA ATTGACGCAG AAACTACAAA ATGCGGGCAT TCGTGTAAAA GCAGACTTGA GAAATGAGAA GATTGGCTTT AAAATCCGCG AGCACACTTT ACGTCGTGTC CCTTATATGT TGGTCTGTGG TGATAAAGAG GTGGAAGCAG GCAAAGTTGC CGTTCGTACC CGCCGCGGTA AAGACCTGGG CAGTCTGGAC GTAAATGACG TGATTGAGAA GCTGCAACAA GAGATTCGCA GCCGCAGTCT TCAACAACTG GAGGAATAA
|
Protein sequence | MPVITLPDGS QRHYDHPVSP MDVALDIGPG LAKATIAGRV NGELVDASDL IENDATLSII TAKDEEGLEI IRHSCAHLLG HAIKQLWPHT KMAIGPVVDN GFYYDVDLDR TLTQEDVEAL EKRMHELAEK NYDVIKKKVS WHEARETFVK RGESYKVSIL DENIAHDDKP GLYHHEEYVD MCRGPHVPNM RFCHHFKLMK TAGAYWRGDS NNKMLQRIYG TAWADKKALN AYLQRLEEAA KRDHRKIGKQ LDLYHMQEEA PGMVFWHNDG WTIFRELEVF VRSKLKEYQY QEVKGPFMMD RVLWEKTGHW DNYKDAMFTT SSENREYCIK PMNCPGHVQI FNQGLKSYRD LPLRMAEFGS CHRNEPSGAL HGLMRVRGFT QDDAHIFCTE EQIRDEVNAC IRMVYDMYST FGFEKIVVKL STRPDKRIGS DEMWDRAEAD LAVALEENNI PFEYQLGEGA FYGPKIEFTL YDCLDRAWQC GTVQLDFSLP SRLSASYVGE DNERKVPVMI HRAILGSMER FIGILTEEFA GFFPTWLAPV QVVVMNITDS QSEYVNELTQ KLQNAGIRVK ADLRNEKIGF KIREHTLRRV PYMLVCGDKE VEAGKVAVRT RRGKDLGSLD VNDVIEKLQQ EIRSRSLQQL EE
|
| |