Gene Htur_2025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_2025 
Symbol 
ID8742624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013743 
Strand
Start bp2096452 
End bp2098038 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content69% 
IMG OID646512607 
Productprotein of unknown function DUF790 
Protein accessionYP_003403582 
Protein GI284165303 
COG category[S] Function unknown 
COG ID[COG3372] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGACCA AGGACCTGCT CCGCGTCTCG CGGGCCGGAG GCGGCTACCA CCTCCAGTTC 
GCCGATCGGG AGCACCGTCC GCTCGCCGCC CGCGTCATTG GGACGTATCA GGGCCACGTC
GGCGAATCTC GCGCGGAACT CGAGGCGGCC GTGACTGAAC TTGAACGCGG TGCGGACGAT
TTCAAGCTCG TCAGAGGGCT GTCGGCGCTG CTCGAGCGCG ACGCGACGTT CGAGACCGAC
GCCGAGATCG ATCCCGAACG CGCTCGTCGG GCTGCCTTCG AGGCCGCCGA GGCCGTCGGC
GTCGTGACCG AGGACGAGCG TGCGATGGCC CTCGTTCGCG CCGGCGAGTC GCTGGGCGTC
TCGGCCGACG ACGTCGCGGG GGCGCTGTAC GCCGACCTCG AGGAGCGGCA GGTCCTCATC
GAACTGGCGT CGCGGTGGGA GCCGGACGAG CTGGTGGCCC AGTACAACCT CTCGCTGGCA
CAGACCGCCC TTTTCGACGC AACCGAGCTT CGGGTCCGCT CGAGCGATCC GAAGGCGCTC
GTTTCGGCGA TCAAGCGACT GCGACTGATG TACGAAATTC GACGGCTGGA GAACGACGAG
GTCGGCGAAG CGCCCGATCG AGGCATCGCC GAGCGCGAGG TGATCGTCAC CGGGCCGACC
CACCTCTTCC GGGCGACCCG CCGGTACGGC ACTCGGTTTG CCCGCCTCTT GCGGACGGTC
GCGAAAGCCG AGGAGTGGCG CCTCGAGGCG ACGATCGACG ACCGCGGGAC CGAACGGACG
CTCCGTCTGT CCCACGAGGA TCCCGTCCGC GTCCCCGACG CAGAGCCAGT TGCCGAGGTC
TCCTTCGACA GCGGCGTCGA GGCCGATTTC GCCGCGCGCT TCTCGACTCT CGATCTCGAG
TGGGATCTCG TGCGCGAACC CGCGCCCCTC GCGACGGGAA CGCGGGTGAT GATCCCCGAT
TTCGCGTTCG ACTATCGTCC TGGGGGCAGC GCCCGCAGGG ACTCGTCGGA CGAGTCCGAC
GGAGGGCACA GCGAGTTCCG CGTCTACTTC GAAATCATGG GCTTCTGGAC GCCCGAGTAC
GTCGAGAAGA AACTCGCACA GCTGTCGGAC CTCGAGGACG TCGAACTGAT CGTCGCCGTC
GACGAGTCCC TCGGCGTCGG CGAGGAAATC GCGGCCCGAG ACTTCCGGGC GATCCCCTAC
TCCGGAAGCG TCCGGCTGAA GGATGTCGCC GGCGTCCTCC GGGAGTACGA GCGCCAACTC
GTCGCCGAGA GCGCCGCCGC GCTGCCGGAC GAACTGTGCC CCGACGAGGA CGTACTCTCG
CTCGAAGCGC TGGCCGGCCG GCGGGGCGTC AGCGAGGACG CGCTGGTCGA CGTCGCGTTT
CCGGACCACG TGCGGGTCGG CCGGACGCTC GTCCGGCCGG CCGTCCTCGA GTCGCTGGCG
GACGAGATCG AGGCCGGAAT GGCGCTGGCC GACGCCGAGA AGATCCTCGA AGCGGCCGGA
TTCAGCGATT CGAGTGCGAT CCTCTCGGAA CTCGGTTATC GCGTCGAGTG GGAGGGGCTG
GCCGGCGGGA CGCTCGTCGA GCGGTAG
 
Protein sequence
MLTKDLLRVS RAGGGYHLQF ADREHRPLAA RVIGTYQGHV GESRAELEAA VTELERGADD 
FKLVRGLSAL LERDATFETD AEIDPERARR AAFEAAEAVG VVTEDERAMA LVRAGESLGV
SADDVAGALY ADLEERQVLI ELASRWEPDE LVAQYNLSLA QTALFDATEL RVRSSDPKAL
VSAIKRLRLM YEIRRLENDE VGEAPDRGIA EREVIVTGPT HLFRATRRYG TRFARLLRTV
AKAEEWRLEA TIDDRGTERT LRLSHEDPVR VPDAEPVAEV SFDSGVEADF AARFSTLDLE
WDLVREPAPL ATGTRVMIPD FAFDYRPGGS ARRDSSDESD GGHSEFRVYF EIMGFWTPEY
VEKKLAQLSD LEDVELIVAV DESLGVGEEI AARDFRAIPY SGSVRLKDVA GVLREYERQL
VAESAAALPD ELCPDEDVLS LEALAGRRGV SEDALVDVAF PDHVRVGRTL VRPAVLESLA
DEIEAGMALA DAEKILEAAG FSDSSAILSE LGYRVEWEGL AGGTLVER