Gene Dret_2206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2206 
Symbol 
ID8420062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2508355 
End bp2509554 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content57% 
IMG OID645038805 
Producttyrosyl-tRNA synthetase 
Protein accessionYP_003199068 
Protein GI258406326 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0162] Tyrosyl-tRNA synthetase 
TIGRFAM ID[TIGR00234] tyrosyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAGG CCGTGGAACT GATCCGCCGG GGCAGTGAGG AGATCATTAA CGAAGAGGAA 
CTCCTCGAAC GGCTACGAGG CGACCGTCCC TTGACCATTA AGGCCGGCTT CGATCCCACG
GCCCCGGATT TGCATCTTGG GCACACGGTC TTGATCCAAA AACTCAAGCA TTTTCAAGAG
CTCGGGCACA ACGTGGTCTT TTTGATCGGA GATTTCACCG GCATGATCGG AGATCCGAGC
GGGAAGTCCG AGACGCGAAA GGCCCTGACC CGCGAAGAGG TCCGGATCAA CGCCGAGACC
TACAAACGGC AGATATTCCG GATCCTGGAC CGGGAGAAAA CCAGGATCGC CTTCAATGCA
GAGTGGATGG ACAGCTTCAG CGCAGCGGAC TTCGTCCACT TGTGCTCCCA GTATACAGTG
GCCCGGATGT TGGAACGCGA TGACTTCGCC AAACGGTACC AGGCCAACAA GCCGATCTCC
GTGCATGAGT TTTTGTACCC CCTCGTTCAG GGCTACGATT CCGTAGCTCT ACAGGCCGAT
GTCGAGCTCG GCGGCACGGA CCAGAAATTC AATCTGCTCA TGGGGCGGAC TCTCCAACGG
GAATACGGAC AGACTCCGCA GGTGATTTTG ACCATGCCTA TTTTGGAGGG GCTCGACGGT
GTCCAGAAGA TGAGCAAATC CCTGGGCAAT TATGTGGGCA TTGAAGAGCC GGCCCAGGAT
ATGTTCGGCA AACTCATGTC CATTTCGGAC GAGCTTATGT GGCGGTATTT CGAGCTGCTA
TCCGATCTGA GCCTGGAGGC CATTGCCGCC CTGCAGGCAC AGGTGGCTTC TGGTGAACTC
CACCCGAAAG CGGCCAAGGA ACAGCTCGCC TTTGAAATTA CTGAACGGTT CCACTCCACG
GCTGAGGCCG AAAAGGCTAA AGCCAATTTT GCCAACGTCT TCAGTCAGCA CCAGCTGCCT
GAGGACATGC CCGAGATCCG GATCGCCCCT GAAGAAGCCT CGGTTATTGC GACCATCGAC
CACAGCGGAC TGTGCGCCTC CCGGGCCGAA ATCAAGCGGT TGTGCAAACA GGGGGCTGTC
ACCTGCGATG GAGAGAAGGT CACGGCGTTC GGGGACATGC TTGCCCCTGG TGAGCATGTC
TTCAAGATCG GCAAGAAACG GTTCTTCAAG GCGGTGGTCA GCGAGGAGTC CCAGGCGTGA
 
Protein sequence
MQEAVELIRR GSEEIINEEE LLERLRGDRP LTIKAGFDPT APDLHLGHTV LIQKLKHFQE 
LGHNVVFLIG DFTGMIGDPS GKSETRKALT REEVRINAET YKRQIFRILD REKTRIAFNA
EWMDSFSAAD FVHLCSQYTV ARMLERDDFA KRYQANKPIS VHEFLYPLVQ GYDSVALQAD
VELGGTDQKF NLLMGRTLQR EYGQTPQVIL TMPILEGLDG VQKMSKSLGN YVGIEEPAQD
MFGKLMSISD ELMWRYFELL SDLSLEAIAA LQAQVASGEL HPKAAKEQLA FEITERFHST
AEAEKAKANF ANVFSQHQLP EDMPEIRIAP EEASVIATID HSGLCASRAE IKRLCKQGAV
TCDGEKVTAF GDMLAPGEHV FKIGKKRFFK AVVSEESQA