Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4728 |
Symbol | |
ID | 8745416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | - |
Start bp | 331098 |
End bp | 332108 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646515229 |
Product | Rieske (2Fe-2S) iron-sulphur domain protein |
Protein accession | YP_003406176 |
Protein GI | 284172794 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCAAT GGAACCAAAA CCAGACCGAG GCAGTGAGCG CAGACATCAC GGAGAAGACG AACGCGCTAC CGGCCCGGTA CTTCACCGAC GACGACGTCT TCGAGATGGA AAAAGAGAAG ATATTCGGGC AGTACTGGGT CTACGCCGGC CACGCTAACA GCATCAGTGA GCCCGGCCAA TACTTCACGC GGACCATCGG CGGCCGCGAC CTCATCATCG CTCGAGACGA CGACGGTGAC GTCCGCGCCG TGGAGAACTT CTCCGCTCGC GACGGCGACG CGCTCCTCGA GGACGCGCCG ATGACCGATC CCGGACGCGT CGATCCGGAC GAGCTCGCCG ACGTGGAGTC GGTGCACGTC GACAGCATCG GTCCGCTGCA GTTCGTCAAC CTGCAGGAGG ATCCGATGCC GCTGGCCGAG CAGGCCGGCG TGATGAAAGA CCGCCTCGAG GCGCTGCCCC TTGGGGAGTA CGAACACGCC ACCCGAATCG TCTCGGAGGT CGAGTGCAAC TGGAAGGTGT TCGCGAGCAA CTACTCGGAG TGCGACCACT GTCAGGCCAA CCACCAGGAC TGGATCAAGG GCATCTCGCT CAACGACTCC GAACTCGAGG TCAACGACTA CCACTGGGTG CTCCACTACA CGCACGCCCA GGACGTCGAC GACGAGATGC GGATCCACGA CGAACACGAG GCGCAGTTCC ATTACTTCTG GCCGAACTTC ACGGTCAACA TGTACGGCAC CGCCGACGGC TACGGCACCT ACATCATCGA CCCGATCGAC ACCGATCGCT TCCGACTGAT CGCGGACTAC TACTTCCGCG ACAGCGAGCT CTCCGAGGAG GAGCGCGAGT TCGTTCGCAC GAGCCGCCAG CTCCAGGAAG AGGACTTCGA ACTGGTCGAA CGTCAGTGGG AAGGGCTCAG AACGGGCGCG CTCGCCCAGG CTCAGCTCGG CCCCAACGAA CACACCGTCC ACCGCTTCCA CCAGCTCGCC CAGGAGGCCT ATAACTCATA A
|
Protein sequence | MTQWNQNQTE AVSADITEKT NALPARYFTD DDVFEMEKEK IFGQYWVYAG HANSISEPGQ YFTRTIGGRD LIIARDDDGD VRAVENFSAR DGDALLEDAP MTDPGRVDPD ELADVESVHV DSIGPLQFVN LQEDPMPLAE QAGVMKDRLE ALPLGEYEHA TRIVSEVECN WKVFASNYSE CDHCQANHQD WIKGISLNDS ELEVNDYHWV LHYTHAQDVD DEMRIHDEHE AQFHYFWPNF TVNMYGTADG YGTYIIDPID TDRFRLIADY YFRDSELSEE EREFVRTSRQ LQEEDFELVE RQWEGLRTGA LAQAQLGPNE HTVHRFHQLA QEAYNS
|
| |