Gene Htur_4738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4738 
Symbol 
ID8745330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp342549 
End bp343682 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content60% 
IMG OID646515238 
ProductRieske (2Fe-2S) iron-sulphur domain protein 
Protein accessionYP_003406185 
Protein GI284172803 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.590157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACAGT GGGACGACTC ACAGACGCAA TCAGTGAGTT CAGATATCAC GGAAAAATCA 
AACGCGTTAC CGGCGAGGTA TTTCACTGAC CCGGACGTCT TCGAGATAGA GAAAGACAAG
GTGTTCGGCC AGTACTGGGT GTACGCCGGC CACGCCAACT GTATCAAGGA ATCGGGCCAG
TACTTCACCC GGACGATCGG TGATCGTCAA CTGATCGTCG TCCGCGGTCA CGACGGCGAG
GTCAAAGCGT TCGACAACGT CTGTGCCCAC CGCGGCTCGA AGATGGTCGA GGACACGCCG
ATGACCGATC CCGGCGATGC GAAGCGGATC AAGTGTCCGT ATCACCTCTG GACGTATGAC
CTCGAGGGAG AGCTCAAAAG CACGCCCAAG AGCTTCGATG AAGCCGGTCT GAACCCCGAC
CTCGAGGACG AAGACGTCCA GGAGTTCGAC GCCGAAGAGA ACGCCCTGAA CGACGTCCAC
GTCGACACCA TCGGCCCGCT GATCTTCGTG AACCTCAGTG AGGATCCGAT GCCGCTGGCC
GAGCAGGCCG GCGTGATGAA AGATCGCCTC GAGGCGCTGC CCCTTGGGGA GTACGAACAC
GCCACCCGAA TCGTCTCGGA GGTCGAGTGC AACTGGAAGG TGTTCGCGAG CAACTACTCG
GAGTGCGACC ACTGCCAGGC CAACCACCAG GACTGGATCA AGGGCATCTC GCTCAACGAC
TCCGAACTCG AGGTCAACGA CTACCACTGG GTGCTCCACT ACACGCACGC CCAGGACGTC
GACGACGAGA TGCGGATCCA CGACGAACAC GAGGCGCAGT TCCACTACTT CTGGCCAAAC
TTCACGGTTA ACATGTACGG TACCGCCGAC GGCTACGGCA CCTACATCAT CGATCCGATC
GATACCGACC GCTTCCAGCT CATCGCGGAC TACTACTTCC GCGACAGCGA GCTCTCCGAG
GAGGAGCGCG AGTTCGTTCG CACGAGCCGC CAGCTCCAGG AAGAGGACTT CGAACTGGTC
GAACGCCAGT GGGAAGGCCT CAGAACGGGC GCGCTCGCCC AGGCCCAACT CGGTCCCAAC
GAACACACCG TCCACAAGTT CCACCAGCTC GCCCAGGAGG CCTACGACTC GTGA
 
Protein sequence
MTQWDDSQTQ SVSSDITEKS NALPARYFTD PDVFEIEKDK VFGQYWVYAG HANCIKESGQ 
YFTRTIGDRQ LIVVRGHDGE VKAFDNVCAH RGSKMVEDTP MTDPGDAKRI KCPYHLWTYD
LEGELKSTPK SFDEAGLNPD LEDEDVQEFD AEENALNDVH VDTIGPLIFV NLSEDPMPLA
EQAGVMKDRL EALPLGEYEH ATRIVSEVEC NWKVFASNYS ECDHCQANHQ DWIKGISLND
SELEVNDYHW VLHYTHAQDV DDEMRIHDEH EAQFHYFWPN FTVNMYGTAD GYGTYIIDPI
DTDRFQLIAD YYFRDSELSE EEREFVRTSR QLQEEDFELV ERQWEGLRTG ALAQAQLGPN
EHTVHKFHQL AQEAYDS