Gene Acid345_4736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4736 
SymbolthrS 
ID4070674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5597947 
End bp5599926 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content58% 
IMG OID637986780 
Productthreonyl-tRNA synthetase 
Protein accessionYP_593809 
Protein GI94971761 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.556159 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGACC AAATCAAAGT AAAACTCCCC GACGGAAGCG TAAAAGAAGT CTCCAAAGGC 
ACCACTGCCC TCGATATCGC AAAAGGCATC AGCCCGCGCC TGGCCGACGC CGCTTTGGCC
GCGAAGGTCG CACCGCTTCC ACACAACGGC GACCAGCCCG AAGCAAGGCT CGTGGACCTG
ACTCGTCCGC TTGAAGAAGA CAGCGAACTG AAACTCCTCA CCGATCGCGA CCCCGAAGCG
CTCGAGGTCT ATCGCCACTC GTCCGCGCAC TTGCTGGCGG CAGCCGTACT CGATCTCTTC
CCGGAAACCA AGCTCGGCCA TGGTCCCTCC ACCGAGAACG GTTTCTTCTA CGATTTCTAT
CGGCAAACGC CGTTCACCCC CGAAGACCTC GAGAAGATCG AGAAGCGCAT GCAGGAGTTG
GTGAAGGAAG ACGTGCCTTA CGCGCGTGAG TTCCTGCCCC GAGAGGAAAG TCTGGAGCGC
TTCAAGACCG AAGGCGACTT CATGAAGTGC CACTTCATCG AACAGTTCAC CAAGCCCGAT
GAAAAGATCT CGATTTATAA GACCGGCAAG TTCCTCGACT TTTGCCGCGG CCCGCACATT
CCCTCGACCG GGAAGATCAA GGCGTTCAAG CTGCTGAATA TCGCCGGCGC CTACTGGCTC
GGCGACGAGA AGAACCCGCA ACTCCAGCGC ATCTATGGCA CCTCGTTCTT TTCGAAGAAA
GACATGGACG AGTACTTCGC CAAGCTGGAA GAAGCGAAGA AGCGCGATCA TCGCGTGCTC
GGCAAGCAGC TCGATTTGTT CTCGATTCAA GAACTCGCCG GCCCCGGGCT GATCTTCTGG
CATCCGAAGG GCGGCATCAT TCGCAAGGAG ATGGAAGACT GGATGCGCGA GGAGTATCTG
AAACGCGGAT ACTCGCTCGT CGTAACTCCG CATGTGGCGC GCACCGACCT CTGGAAGATC
AGCGGCCACA CCGGTTATTA CAAGCAGAAC TTCTTCACGC CCATGGAACT CGATGATGCC
GAGTACATGC TGAAGCCGAT GAACTGCCCC GGCCATGTCC TCATTTATCG TGACCAGCTC
CGTTCCTATC GCGATCTGCC CATGCGTCTC GGTGAGATGG GAACGGTATA CCGCTACGAG
CGCTCCGGCG TGATGCACGG GTTGTTGCGT GTCCGCGGCT TTACCCAGGA CGATGCGCAC
ATCTTCTGCA CGCCCAGCCA GATTGAAGAC GAAATCAGCG GCTGTATCGA TTTCGCCATC
TCTGTCCTGC ACACCTACGG CTTCAACGAG TTCAAGGTTG AACTGAGCGA GTGGGATCCG
AATGATCGCA AGAGCTTCAT CGGAACCGAC GAGCAGTGGA ACCTCGCACA GGGCTCGCTG
AAGAAGGTGC TCGACGCGCG TGGGATTCCG TATAAGTCCA TGCCAGGCGA AGCGGCATTC
TACGGGCCGA AGATTGACGT CAAGCTCGTG GACGCCATCG GACGCCTCTG GCAGCTCTCG
ACGGTGCAGT TCGACTTCAC CTTGCCGCAG CGCTTCGAAC TTGAGTACGT GGGCGAAGAC
GGCAAGCGCC ATCAGCCGCT CATGGTGCAC CGTGCGCTCT ACGGCTCCAT TGAACGCTTC
TTCGGCGTGC TCATCGAGCA CTATGCGGGC GCGTTCCCGG TGTGGCTATC ACCAGTGCAG
ACGGTGCTGG TGCCCATCGG CGAAAAGCAC CTTGAGTATG CCAACAAGGT TGGAGACGTG
CTTAAGGCCA AGGGCATCCG CGTGGAAGTG GACGGGCGCA ACGAGAAGAT GAACGCGAAG
ATCCGCGAGC ATGCGTTGCA GAAAGTGCCG TTCATCCTCG TCGTGGGCGA CAAGGAGGCA
GAGGCCACCT CGGTGAATGT CCGCACCCGC GGCAAAGATA AGACGGAGAC GGTGCCACTC
GATTCCTTCG TGGAGCGAAT TGAGAAGCTG ATCGCCGAGA AGAAGCCTAC GCTGGATTAG
 
Protein sequence
MSDQIKVKLP DGSVKEVSKG TTALDIAKGI SPRLADAALA AKVAPLPHNG DQPEARLVDL 
TRPLEEDSEL KLLTDRDPEA LEVYRHSSAH LLAAAVLDLF PETKLGHGPS TENGFFYDFY
RQTPFTPEDL EKIEKRMQEL VKEDVPYARE FLPREESLER FKTEGDFMKC HFIEQFTKPD
EKISIYKTGK FLDFCRGPHI PSTGKIKAFK LLNIAGAYWL GDEKNPQLQR IYGTSFFSKK
DMDEYFAKLE EAKKRDHRVL GKQLDLFSIQ ELAGPGLIFW HPKGGIIRKE MEDWMREEYL
KRGYSLVVTP HVARTDLWKI SGHTGYYKQN FFTPMELDDA EYMLKPMNCP GHVLIYRDQL
RSYRDLPMRL GEMGTVYRYE RSGVMHGLLR VRGFTQDDAH IFCTPSQIED EISGCIDFAI
SVLHTYGFNE FKVELSEWDP NDRKSFIGTD EQWNLAQGSL KKVLDARGIP YKSMPGEAAF
YGPKIDVKLV DAIGRLWQLS TVQFDFTLPQ RFELEYVGED GKRHQPLMVH RALYGSIERF
FGVLIEHYAG AFPVWLSPVQ TVLVPIGEKH LEYANKVGDV LKAKGIRVEV DGRNEKMNAK
IREHALQKVP FILVVGDKEA EATSVNVRTR GKDKTETVPL DSFVERIEKL IAEKKPTLD