Gene Htur_4698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4698 
Symbol 
ID8745294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp289601 
End bp290734 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content61% 
IMG OID646515202 
ProductRieske (2Fe-2S) iron-sulphur domain protein 
Protein accessionYP_003406149 
Protein GI284172767 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.412068 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCAGT GGGACGATTC ACAGGCGAAA GCGGTGAGCG AGGACATCAC GGAGAAGTCG 
AACGCGCTGC CGGCCCGGTA CTTCACCGAC GATGACGTCT TCGAGATGGA GAAAGACAAG
GTGTTCGGCC AGTACTGGGT GTACGCCGGC CACGCCAACT GTATCAAGGA ATCGGGCCAG
TACTTCACCC GGACGATCGG TGATCGCCAA CTGATCGTCG TTCGCGGTCA CGACGGCGAG
GTCAAAGCGT TCGACAACGT CTGTGCCCAC CGCGGCTCGA AGATGGTCGA GGACACGCCG
ATGACCGACC CCGGCGATGC AAAGCGAATC AAGTGTCCGT ACCACCTCTG GACGTACGAC
CTCGACGGAG AGCTCAAAAG CACGCCCAAG AGCTTCGAAG AAGCGGGCCT GAACCCCGAC
CTCGAGGACG AAGACGTTCA GAAGTTCGAC GCCGAAGAGA ACGCCCTGAA CGATGTGCAC
GTCGACACCA TCGGCCCGCT GATCTTCGTG AACCTCAGCG AGGATCCGAT GCCGCTGGCC
GAACAGGCCG GCGTGATGAA AGACCGCCTC GAGGCGCTGC CCCTCGGGGA GTACGAACAC
GCCACCCGAA TCGTCTCGGA GGTCGAGTGC AACTGGAAGG TGTTCGCGAG CAACTACTCG
GAGTGCGACC ACTGCCAGGC CAACCACCAG GACTGGATCA AAGGCATCTC GCTCAACGAG
TCCGAACTCG AAGTCAACGA CTACCACTGG GTGCTCCACT ACACCCATGC AGAGGACGTC
GAGGACGAGA TGCGGATCCA CGACGAACAC GAGGCCCAGT TCCACTACTT CTGGCCGAAC
TTCACGGTCA ACATGTACGG CACTGCCGAC GGCTACGGCA CCTACATCAT CGATCCGATC
GACACCAACC GGTTCCAGCT CATCGCGGAC TACTACTTCC GCGACAGCGA ACTCTCCGAG
GAAGAGCGCG AGTTCGTTCG CACGAGCCGC CAGCTCCAGG AAGAGGACTT CGAATTAGTC
GAACGTCAGT GGGAAGGGCT CAGAACGGGC GCGCTCGCCC AGGCTCAGCT CGGCCCCAAC
GAACACACCG TCCACCGCTT CCACCAGCTC GCGCAGGAAG CCTACGACTC GTGA
 
Protein sequence
MTQWDDSQAK AVSEDITEKS NALPARYFTD DDVFEMEKDK VFGQYWVYAG HANCIKESGQ 
YFTRTIGDRQ LIVVRGHDGE VKAFDNVCAH RGSKMVEDTP MTDPGDAKRI KCPYHLWTYD
LDGELKSTPK SFEEAGLNPD LEDEDVQKFD AEENALNDVH VDTIGPLIFV NLSEDPMPLA
EQAGVMKDRL EALPLGEYEH ATRIVSEVEC NWKVFASNYS ECDHCQANHQ DWIKGISLNE
SELEVNDYHW VLHYTHAEDV EDEMRIHDEH EAQFHYFWPN FTVNMYGTAD GYGTYIIDPI
DTNRFQLIAD YYFRDSELSE EEREFVRTSR QLQEEDFELV ERQWEGLRTG ALAQAQLGPN
EHTVHRFHQL AQEAYDS