Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dvul_0707 |
Symbol | thrS |
ID | 4662957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris DP4 |
Kingdom | Bacteria |
Replicon accession | NC_008751 |
Strand | + |
Start bp | 872568 |
End bp | 874502 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639818925 |
Product | threonyl-tRNA synthetase |
Protein accession | YP_966157 |
Protein GI | 120601757 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00542671 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.896215 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGTGT CCATAGAAGG GCAGATGCTC GAAGTGGCAT CGGGCGCTTC CTGTGGTGAC GCCCTCAAGG GTGCCCTCAG CGGGAAGAAG TTCAAGAACG TGCTTGCCTG TCGCCTCGAC GGCGGCCTCG TGGACATCAC CGCCACCGTC CCCGACGGAA CGACCACCAT CGAGCCGGTC TACGCCGACT CTCCCGAAGG TCTCGACCTC ATCCGCCATT CGACGGCGCA CATCATGGCA TGTGCGGTGA AGCGTCTCTT CCCCGCCGCC AAGGTGACCA TCGGTCCCTC CATCGACAAC GGCTTCTATT ACGACTTCGA CGCCGAGCGT CCCTTCAGCC CCGAAGACTT CGAGGCCATC GAACGCGAGA TGCAGAAGAT CGTCGACGCC GCCACGCCCT TCGAACGGAG CGAGATGCCC CGTGACGAGG CCGTCGCCCT GTTCGAGGGC ATGGGCGAGA CCTACAAGGT GGAGATCATC CGCGACCTGC CCAACGACAC GGTGTCGCTG TACCGGTGCG GCGAGTTCGT CGACCTGTGC CGCGGCCCGC ACATCCCTCA TGCCGGGTTC GCCAAGGCGT TCAAGCTCAT GTCCGTGGCG GGTGCCTACT GGCGCGGTGA CGAGAAGAAC CCCATGCTCT CGCGCATTTA TGGCACGGCC TTCGCCGATG CCAAGACGCT CAAGGAGCAT CTGCACCGCA TCGAAGAGGC CAAGCGCCGC GACCACCGCA AGCTGGGGCA GCAACTCGAC CTTTTCGCCT TCCACGAGGA CGTGGCCCCC GGTATGGTCT TCTGGCATCC GAAGGGGATG CTGGTGCGCA CCATCATCGA GGACTTCCTG CGCAAGGAAC ACCTCAAGCG CCGCTACGAC ATCGTGCAGG GGCCGCAGCT GCTGCGCCGT GAACTGTGGG AGAAGTCGGG CCACTACGAC AACTACCGCG AGAACATGTA CTTCACCGAG ATAGACGAGA ACGCCTATGG CGTGAAGCCC ATGAACTGCC TCGCGCACAT GCTCATCTAT CGTAGTGCCA TCCGCAGTTA CCGTGACCTT CCGAAGCGTT TCTTCGAGCT CGGCGTCGTG CACCGGCACG AGAAGTCGGG CGTGTTGCAC GGGCTTCTGC GAGTACGCCA GTTCACGCAG GACGATGCAC ACATCATCTG CCGCCCCGAC CAGCTTGAAG ATGAAATCAT TGATGTCATC GCGCTCGTGC GCGACCTTAT GAATCTGTTC GGCTTCGACT ACAAGGTCGC CGTCTCCACC CGCCCCGAGA AGTCCATCGG CTCTGACGAG GCGTGGGAAC TGGCGACCAA CGCTCTGGTG AAAGCCGTCG AGCGCGCGGG GATTCCGTAT ACCATCAACG AGGGCGACGG CGCCTTCTAC GGCCCCAAGA TCGACGTGCG GCTCATGGAC TGCATCGGCC GCGAATGGCA GTGCTCCACC ATCCAGTGCG ATTTCACCTT GCCTGAGCGT TTCGACCTGG TGTATGTCGG TCAGGATGGC GAGCGGCATC GCCCGGTTAT GGTACACCGG GCCATACTCG GCTCGCTGGA GCGCTTCATC GGTGTGCTCA TAGAGCAGTA TGCCGGGGCC TTTCCTGCGT GGCTGGCTCC GGTTCAGGCA CGTCTGCTCA CCGTGACGGA CGCGCAGAAC GAGTTTGTCG AGTCGGCTCG CGCCGCACTC GCAAAAGCCG GTATCCGCGT CGAAGCCGAT GTGCGCAACG AGAAGCTGGG CTACAAGGTC CGTGAAGCCC AGCTTGAGAA GATACCGTAC ATCCTTGTCG TGGGAGACAA GGAGGTCGAG GCGGGGGGCG TCAACGTACG GTTGCGTACG GGCGAGAACC TTGGTCTCAA GAGTCTCGAC GAGGTCGTGT CGCTGCTCGA ATCGGACTGC CAGGAACCGT TTAAACGTGG AGGGATGAGC TATAGCTTCT CCTAA
|
Protein sequence | MNVSIEGQML EVASGASCGD ALKGALSGKK FKNVLACRLD GGLVDITATV PDGTTTIEPV YADSPEGLDL IRHSTAHIMA CAVKRLFPAA KVTIGPSIDN GFYYDFDAER PFSPEDFEAI EREMQKIVDA ATPFERSEMP RDEAVALFEG MGETYKVEII RDLPNDTVSL YRCGEFVDLC RGPHIPHAGF AKAFKLMSVA GAYWRGDEKN PMLSRIYGTA FADAKTLKEH LHRIEEAKRR DHRKLGQQLD LFAFHEDVAP GMVFWHPKGM LVRTIIEDFL RKEHLKRRYD IVQGPQLLRR ELWEKSGHYD NYRENMYFTE IDENAYGVKP MNCLAHMLIY RSAIRSYRDL PKRFFELGVV HRHEKSGVLH GLLRVRQFTQ DDAHIICRPD QLEDEIIDVI ALVRDLMNLF GFDYKVAVST RPEKSIGSDE AWELATNALV KAVERAGIPY TINEGDGAFY GPKIDVRLMD CIGREWQCST IQCDFTLPER FDLVYVGQDG ERHRPVMVHR AILGSLERFI GVLIEQYAGA FPAWLAPVQA RLLTVTDAQN EFVESARAAL AKAGIRVEAD VRNEKLGYKV REAQLEKIPY ILVVGDKEVE AGGVNVRLRT GENLGLKSLD EVVSLLESDC QEPFKRGGMS YSFS
|
| |