Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3319 |
Symbol | |
ID | 8743939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 3418327 |
End bp | 3421113 |
Gene Length | 2787 bp |
Protein Length | 928 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646513902 |
Product | valyl-tRNA synthetase |
Protein accession | YP_003404856 |
Protein GI | 284166577 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACGG ACGCGCCCGA ATCGGAGCCG GAGAGCGATC CCGCTGATAG CGAGCCGACC CTCGAGGGCG GTTACGATCC CGAAGCGGTC GAGGATCGCT GGCAGCGGCG CTGGATCGAC GCGGACGTCT ACGCCTACGA GAGCGACGCC GAGCGCGATC CCAACACGGT CTACGCGATC GACACGCCGC CGCCGACGGT CTCGGGCAGC CTGCACATGG GCCACCTCTA CGGCTCGACC CTACAGGACT TCGCCGCGCG ATACCAGCGG ATGGCCGACG GCGACGTGCT CTTCCCCTTC GGCTACGACG ACAATGGGAT CGCCAGCGAG CGACTGACCG AGTCGGAACT GGACATCCGC CACCAGGACT ACGAGCGCCG CGAGTTCCAG GAGCTCTGCC GCGAGGTCTG TCAGGAGTAC GAGGCCGAGT TCACGGAGAA GATGCAGGAT CTCGGCACCT CGATCGACTG GAATAACACC TACAAGACGA TCGAACCTCG CGTCCAGCGG GTCTCACAGC TCTCCTTCCT CGATCTCTAC GAGAAGGGAC GGGAGTATCG CAAGAAGGCG CCCGCGATCT GGTGTCCCGA CTGCGAGACG GCCATCTCGC AGGTCGAGAT GGAAGACGAC GAGCGCGGCT CGCACTTCAA CGACATCGCG TTCGACGTGG CAAGCGACGG TGCGGACCGC GAGGAGTTCG TCATCTCCAC GACGCGACCG GAACTCATCC CCGCCTGCGT CTCCGTCTTC GTCCACCCCG ACGACGAGGA GAACCAGGAT CTGGTCGGCG AGACCGCCCG CGTCCCGATC TTCGAGCAGG AGGTGCCGAT CATCGAGGAC GAGCGCGTCG ACATGGAGAA GGGCAGCGGC GTCGTGATGT GCTGTACCTT CGGCGACCAG AACGACATCG AGTGGTACCA GGCCCACGAC CTGCCGCTGC GCGTGGCCAT CGACGAGTCC GCGACGATGA CCGACCTCGC CGGCGACTAC GAGGGAATGT CGACTGAGGA GGCCCGCGAG GCCATCGTCG AAGACCTCGA GGACGAGGGG TACCTGCGCG ACCGCTGGGA GATCTCCCAC GCGGTGCAGG TCCACGAACG CTGTGACACC GCCGTCGAGT ACCGCGTCTC CAAGCAGTGG TACGTCGAAA TTCTGGACCA CAAGGAGGAG TACCTCGAGG CCGGCCGGGA GATGGACTGG TACCCCGAGA AGATGTTCAG CCGCTACAAA CACTGGATCG AGGGCCTCGA GTGGGACTGG CTCATCTCGC GCCAGCGCGA CTCGGGGATC CCATTCCCGG TCTGGTACTG CGCGGACTGC GACCACGAGA TCATGGCCGA CCGGGAGACG CTCCCGGTCG ATCCGCTCTC GGACGAGCCG CCCGTCGATA GCTGTCCCGA GTGCGGGGCT GACGACTTCG TCGCCGAAGA GGACGTCTTC GACACGTGGG CCACCTCCTC GCTGACCCCC CTGATCAACG CGGGCTGGGA CTGGGACGAA GACGCCGAGG AGTTCCGAAT GGAGAAACCG GAGCTGTACC CGTTCGACCT CCGTCCGCAG GGCCACGACA TCATCTCGTT CTGGCTGTTC CACACCATCG TCAAGTGCTA CGAGCACACC GGCGAGGTGC CCTTCGACGC GACGATGATC AACGGCCACG TCTTAGACGA GAACCGCGAG AAGATGTCCA AGTCCCGTGG CAACGTCGTC GAACCCGACG AGGTGCTCGC CGACTTCCCC GTCGACGCCG TCCGCTTCTG GGCCGCGAGC GCGGCGGTCG GCGACGACTT CCCGTATCAG GAGAAGGACC TCACCGCGGG CGAGAAGCTC CTGCGCAAGC TCTGGAACGC CTCAAAGCTC GTCGACACGC TCGCGCCCCG CGAGCCCGAG GAACCCGCGG AGCTCGAGGC GATCGACCGC TGGCTGCTGG CGGAACTCGA CGACGCCGTC GACGATCTGA CCGCCCACCT CGAGGAGTAC GAGTTCGCGA AGGCCCGCGA TCGGCTGCGG ACCTTCTTCT GGAACACGTT CTGCGACGAC TATCTCGAGA TCGCCAAGAC GCGCGAGGAC GAGCCGTCGA CGCAGTACGC GCTGCGGACG GCCCACCGGA CCTTCCTCGA GCTGTGGGCG CCGTTCCTGC CCCACGCGAC CGAGGAGATC TGGCAGGCCG TCTACAGCGA TGATCCCGCG GACCTCGAGA ACACCAGCAT CCACGTCCGC GACTGGCCCA GCCCGCAGGG GTACGAGGCC GACCTCGAGG CCGGCGAGGC CGCCATGGAC GTCATCTCGG CGCTCCGGCG CTACAAGAGC GAGAATCAGC TGCCGCTGAA CGCCGATCTC GAGTCGGTGT CGGTCTACGG CCCCATCGAA GGCTTCGAGG ACGCCATCCA GAACGTCATG CACGTCCAGG AGCTGACCGT CCTCGAAGAA GAACCCGAGG TCACCACCGA GATCGCGTCG ATTGACCTCG ACTACTCGAC GCTCGGACCG AAGTTCGGCT CGAAGGTCGG CGAGATCGAT TCCGGAATCG AGAGCGGCGA GTACGAGATC GTGAGCGCGG AGCAACGCTC CGCGGAAGAG TCGAGTAGCG CGGAACGAAG TTCCGCGGAC AGTCGAGCGG CGCAGCCGCG AGACGACGGC GATGAGCCGC GAGACCGAGA TGACGGCGTG CTCCGCGTCG CCGGCGAGGA ACTCGAGGAC GACCTCTTCG AGGTCGAACG CGAGCGTACC TACTCGGGCG AGGGCGAGAT GATCGAGACC GAATCGGCGG TCGTCATCCT CGAATAG
|
Protein sequence | MSTDAPESEP ESDPADSEPT LEGGYDPEAV EDRWQRRWID ADVYAYESDA ERDPNTVYAI DTPPPTVSGS LHMGHLYGST LQDFAARYQR MADGDVLFPF GYDDNGIASE RLTESELDIR HQDYERREFQ ELCREVCQEY EAEFTEKMQD LGTSIDWNNT YKTIEPRVQR VSQLSFLDLY EKGREYRKKA PAIWCPDCET AISQVEMEDD ERGSHFNDIA FDVASDGADR EEFVISTTRP ELIPACVSVF VHPDDEENQD LVGETARVPI FEQEVPIIED ERVDMEKGSG VVMCCTFGDQ NDIEWYQAHD LPLRVAIDES ATMTDLAGDY EGMSTEEARE AIVEDLEDEG YLRDRWEISH AVQVHERCDT AVEYRVSKQW YVEILDHKEE YLEAGREMDW YPEKMFSRYK HWIEGLEWDW LISRQRDSGI PFPVWYCADC DHEIMADRET LPVDPLSDEP PVDSCPECGA DDFVAEEDVF DTWATSSLTP LINAGWDWDE DAEEFRMEKP ELYPFDLRPQ GHDIISFWLF HTIVKCYEHT GEVPFDATMI NGHVLDENRE KMSKSRGNVV EPDEVLADFP VDAVRFWAAS AAVGDDFPYQ EKDLTAGEKL LRKLWNASKL VDTLAPREPE EPAELEAIDR WLLAELDDAV DDLTAHLEEY EFAKARDRLR TFFWNTFCDD YLEIAKTRED EPSTQYALRT AHRTFLELWA PFLPHATEEI WQAVYSDDPA DLENTSIHVR DWPSPQGYEA DLEAGEAAMD VISALRRYKS ENQLPLNADL ESVSVYGPIE GFEDAIQNVM HVQELTVLEE EPEVTTEIAS IDLDYSTLGP KFGSKVGEID SGIESGEYEI VSAEQRSAEE SSSAERSSAD SRAAQPRDDG DEPRDRDDGV LRVAGEELED DLFEVERERT YSGEGEMIET ESAVVILE
|
| |