Gene SbBS512_E1961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1961 
SymbolthrS 
ID6272749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp1792374 
End bp1794302 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content50% 
IMG OID641726015 
Productthreonyl-tRNA synthetase 
Protein accessionYP_001880509 
Protein GI187734126 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000224309 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGTTA TAACTCTTCC TGATGGCAGC CAACGCCATT ACGACCACGC TGTAAGCCCC 
ATGGATGTTG CGCTGGACAT TGGTCCAGGT CTGGCGAAAG CCTGTATCGC AGGGCGCGTT
AATGGCGAAC TGGTTGATGC TTGCGATCTG ATTGAAAACG ACGCACAACT GTCGATCATT
ACCGCCAAAG ACGAAGAAGG TCTGGAGATC ATTCGTCACT CCTGTGCGCA CCTGTTAGGG
CATGCGATTA AACAACTTTG GCCGCATACC AAAATGGCAA TCGGCCCGGT TATTGACAAC
GGTTTTTATT ACGACGTTGA TCTTGACCGC ACGTTAACCC AGGAAGATGT CGAAGCACTC
GAGAAGCGGA TGCATGAGCT TGCTGAGAAA AACTACGACG TCATTAAGAA GAAAGTCAGC
TGGCACGCAG CGCGTGAAAC TTTCGCCAAC CGTGGGGAGA GCTACAAAGT CTCCATTCTT
GACGAAAACA TCGCCCATGA TGACAAGCCA GGTCTGTACT TCCATGAAGA ATATGTCGAT
ATGTGCCGCG GTCCGCACGT ACCGAACATG CGTTTCTGCC ATCATTTCAA ACTAATGAAA
ACGGCAGGGG CTTACTGGCG TGGCGACAGC AACAACAAAA TGTTGCAACG TATTTACGGT
ACGGCGTGGG CAGACAAAAA AGCACTTAAC GCTTACCTGC AGCGCCTGGA AGAAGCCGCG
AAACGCGACC ACCGTAAAAT CGGTAAACAG CTCGACCTGT ACCATATGCA GGAAGAAGCG
CCGGGTATGG TATTCTGGCA CAACGACGGC TGGACCATCT TCCGTGAACT GGAAGTGTTT
GTTCGTTCTA AACTGAAAGA GTACCAGTAT CAGGAAGTTA AAGGTCCGTT CATGATGGAC
CGTGTCCTGT GGGAAAAAAC CGGTCACTGG GACAACTACA AAGATGCAAT GTTCACCACA
TCTTCTGAGA ACCGTGAATA CTGCATTAAG CCGATGAACT GCCCGGGTCA CGTACAAATT
TTCAACCAGG GGCTGAAGTC TTATCGCGAT CTGCCGCTGC GTATGGCCGA GTTTGGTAGC
TGCCACCGTA ACGAGCCGTC AGGTTCGCTG CATGGCCTGA TGCGCGTGCG TGGATTTACC
CAAGATGACG CGCATATCTT CTGTACTGAA GAACAAATTC GCGATGAAGT TAACGGATGT
ATCCGTTTAG TCTATGATAT GTACAGCACT TTTGGCTTCG AGAAGATCGT CGTCAAACTC
TCCACTCGTC CTGAAAAACG TATTGGCAGC GACGAAATGT GGGATCGTGC TGAGGCGGAC
CTGGCGGTTG CGCTGGAAGA AAACAACATC CCGTTTGAAT ATCAACTGGG TGAAGGCGCT
TTCTACGGTC CGAAAATTGA ATTTACCCTG TATGACTGCC TCGATCGTGC ATGGCAGTGC
GGTACAGTAC AGCTGGACTT CTCCTTGCCG TCTCGTCTGA GCGCCTCCTA TGTGGGCGAA
GACAACGAGC GTAAGGTACC GGTAATGATT CACCGCGCAA TTCTGGGGTC GATGGAACGT
TTCATCGGTA TCCTGACCGA AGAATTCGCT GGTTTCTTCC CGACCTGGCT TGCGCCAGTT
CAGGTTGTTA TCATGAATAT TACCGATTCA CAGTCTGAAT ACGTTAACGA ATTGACGCAA
AAACTATCAA ATGCGGGCAT TCGTGTTAAA GCAGACTTGA GAAATGAGAA GATTGGCTTT
AAAATCCGCG AGCACACTTT GCGTCGCGTC CCATATATGC TGGTCTGTGG TGATAAAGAG
GTGGAATCAG GCAAAGTTGC CGTTCGCACC CGCCGTGGTA AAGACCTGGG AAGCATGGAC
GTAAATGAAG TGATCGAGAA GTTGCAACAA GAGATTCGCA GCCGCAGTCT TAAACAATTG
GAGGAATAA
 
Protein sequence
MPVITLPDGS QRHYDHAVSP MDVALDIGPG LAKACIAGRV NGELVDACDL IENDAQLSII 
TAKDEEGLEI IRHSCAHLLG HAIKQLWPHT KMAIGPVIDN GFYYDVDLDR TLTQEDVEAL
EKRMHELAEK NYDVIKKKVS WHAARETFAN RGESYKVSIL DENIAHDDKP GLYFHEEYVD
MCRGPHVPNM RFCHHFKLMK TAGAYWRGDS NNKMLQRIYG TAWADKKALN AYLQRLEEAA
KRDHRKIGKQ LDLYHMQEEA PGMVFWHNDG WTIFRELEVF VRSKLKEYQY QEVKGPFMMD
RVLWEKTGHW DNYKDAMFTT SSENREYCIK PMNCPGHVQI FNQGLKSYRD LPLRMAEFGS
CHRNEPSGSL HGLMRVRGFT QDDAHIFCTE EQIRDEVNGC IRLVYDMYST FGFEKIVVKL
STRPEKRIGS DEMWDRAEAD LAVALEENNI PFEYQLGEGA FYGPKIEFTL YDCLDRAWQC
GTVQLDFSLP SRLSASYVGE DNERKVPVMI HRAILGSMER FIGILTEEFA GFFPTWLAPV
QVVIMNITDS QSEYVNELTQ KLSNAGIRVK ADLRNEKIGF KIREHTLRRV PYMLVCGDKE
VESGKVAVRT RRGKDLGSMD VNEVIEKLQQ EIRSRSLKQL EE