Gene Htur_2787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2787 
Symbol 
ID8743402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp2859296 
End bp2861245 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content64% 
IMG OID646513374 
Productthreonyl-tRNA synthetase 
Protein accessionYP_003404333 
Protein GI284166054 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAAC CAGATTCCCA GTCACAGGAA CAGATAACGG TCGTACTGCC CGACGGATCC 
GAACTCGAGG TCGCATCCGA CGCGACGGTC GAGGACTGCG CCTACGAAAT CGGTCCCGGT
CTCGGCCGCG ACACGGTCGC CGGCAAACTC GACGGCGAAC TTGTCGCCAA GGAAGAACCC
GTCTACGACG GCGTCGAGCT CGAGATCGTC ACCGACGGCT CCGAGGAGTA CCTCGAGGTC
ATGCGCCACT CCGCGGCACA TTGCCTCGCC CAGGCCGTCG AACGGCACTA CGACGACGTC
GACCTCGCGA TCGGGCCGCC GACGGACGAG GGCTTCTACT ACGACTTCGA TAATCTCGAC
GTCGACGAGG AGGATCTCGC GGCCCTCGAG GACGAAATCG AGGCGATCAT CGCCGAGGAC
TACGAGATCG AGCGCGAGGA TGTCTCGATC GAGGAGGCCG AGGAGCGGCT GGCCGGCGAG
CCCTACAAGC TCGAACTGCT CGAGGAGTTC GCCGACGAGA ACGACACCGT CACCTTCTAC
AAACAGGGCG AGTGGGAGGA CCTCTGTGCC GGTCCGCACG TCGACTCGAC GGGCGAAATC
GGCGTCGTCG AGCTGTTGGA AATCGCCGGC GCCTACTGGC GCGGCGACGA GGAGAATACG
ATGCAGACGC GCATCTACGG GACGGCCTTC GAAGACGAGA GCGATCTCGA AGACTTCCTC
GAGCGCAAGC AGGAAGCCGA GAAGCGCGAC CACCGCCGGA TCGGCAACGA GATGAACCTG
TTCTCGATTC AGGACGTCAC CGGCCCCGGA CTGCCGCTGT ATCACCCGCC GGGGAAGACC
GTCCTGAAGG AACTCGAGGA CTTCGTCGAG GACCTCAACA AGGACGCGGG CTACGACTAC
GTCGAGACGC CCCACGTCTT CAAAACGGAT CTCTGGCACC GCTCGGGTCA CTACGAGAAC
TACCAGGACG ACATGTTCAT CTTCGACGTC GGCGACGACG AGTTCGGCCT GAAGCCGATG
AACTGTCCCG GCCACGCCGC CATCTTCCAG GATCAGTCCT GGAGCTACCG CGACCTCCCC
ATTCGCTACG CGGAGAACGG GAAGGTCTAC CGCAAGGAGC AGCGCGGCGA ACTCTCGGGC
CTCTCGCGGG TCTGGGCCTT TACGATCGAC GACGGCCACC TGTTCATCCG CCCCGACCAG
ATCAGACAGG AGGTCGAGGA GATCATGGAC ATGATCACCG ACGTCCTCGA GACGTTCGAC
CTCGAGTACG AGATGGCTCT CGCCACCCGG CCCGAGAAGT CGGTCGGCTC CGACGAGATC
TGGGACCGCG CGGAGGAGCA ACTCGAGAAC GTCCTCGAGA ATCGCGCCCA CGACTACGAG
GTCGAGGAGG GCGACGGCGC GTTCTACGGG CCGAAGATCG ACTTCGCGTT CGAGGACGCC
ATCGGCCGCT CGTGGGACGG CCCCACGGTC CAACTCGACT TCAACATGCC CGAGCGGTTC
GACCTGAACT ACGTCGGCGA GGACAACGAG GAACACCGTC CGGTCATGAT CCACCGCGCG
CTCTACGGCA GCTACGAGCG GTTCTTCATG ATGCTCATCG AGCACTACGA GGGTCGGTTC
CCGCTGTGGC TCGCGCCCGA ACAGGTCCGC GTGCTACCCA TTTCGGACGA CAACCTCGGT
TACGCTCACC GCGTCGCCAA CGAGTTCGAC GACTTCCGCG TCGAGGTCGA CGGTCGCGAC
TCCACCTTGG AGCGGAAGAT CCGCGCGGCC CACGACGATC GGGTCCCCTA CCAGATCATC
GTCGGCGACA ACGAGGAGGA CGACGGCAAC ATCTCCGTCC GCGATCGCTT CGAGGACCAG
GAGTACGACG TCGAGATCGA GGACTTCAAG CAGCACCTCG AGGCCGAAAT CGAGGAGCAG
CGGACCCAGC CGGACTTCCT GCAGGACTGA
 
Protein sequence
MSEPDSQSQE QITVVLPDGS ELEVASDATV EDCAYEIGPG LGRDTVAGKL DGELVAKEEP 
VYDGVELEIV TDGSEEYLEV MRHSAAHCLA QAVERHYDDV DLAIGPPTDE GFYYDFDNLD
VDEEDLAALE DEIEAIIAED YEIEREDVSI EEAEERLAGE PYKLELLEEF ADENDTVTFY
KQGEWEDLCA GPHVDSTGEI GVVELLEIAG AYWRGDEENT MQTRIYGTAF EDESDLEDFL
ERKQEAEKRD HRRIGNEMNL FSIQDVTGPG LPLYHPPGKT VLKELEDFVE DLNKDAGYDY
VETPHVFKTD LWHRSGHYEN YQDDMFIFDV GDDEFGLKPM NCPGHAAIFQ DQSWSYRDLP
IRYAENGKVY RKEQRGELSG LSRVWAFTID DGHLFIRPDQ IRQEVEEIMD MITDVLETFD
LEYEMALATR PEKSVGSDEI WDRAEEQLEN VLENRAHDYE VEEGDGAFYG PKIDFAFEDA
IGRSWDGPTV QLDFNMPERF DLNYVGEDNE EHRPVMIHRA LYGSYERFFM MLIEHYEGRF
PLWLAPEQVR VLPISDDNLG YAHRVANEFD DFRVEVDGRD STLERKIRAA HDDRVPYQII
VGDNEEDDGN ISVRDRFEDQ EYDVEIEDFK QHLEAEIEEQ RTQPDFLQD