Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_0042 |
Symbol | |
ID | 8740605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 48410 |
End bp | 50626 |
Gene Length | 2217 bp |
Protein Length | 738 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646510605 |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_003401616 |
Protein GI | 284163337 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCCA TAGCACAGTC GGAGGTGAGC AGGGAGACCT ATCTGGGGCT CGGCGGCGTG GAATACGTGC TGTTCTACTT CCTGGTCGCA GTGACCCTTG CCGTCTTCGC TTACGGCGTC TACCGACGGT TCAGTCGGTA CACCGAAGGT GACGACGATC CGTTCGCTCG GGTCGACGGA CTGCTGAGCC GAACCGTCAA CGCGGCGAAG ATCGTCATCA CGAACGAGAA ACAGTTCAAC AGGGATCTCT ACGGCGGCCT GATGCACTCG TTTATCCTCT GGGGATTCCT GACGCTGCTC ATCGCGACGT CGATCATCGC CGTCGAGCAG TACGGAACCG AACTGTTGCT CGGGTTGACG TTCTGGGAGG GCGACTTCTA TCTCACCTAT CAGTTCATCG TCGACGCGAT GGGCCTGCTC TTCGTCGTCG GGATCGGAAT GGCCCTCTAC CGACGGTACT GGGTCCAGAA TCACCGCCTC TGGGGCCGGC ACACCTCTAA CGAGGACGAT ATCTTCATCT GGACGCTGTT CGGCCTCGGC ATCGGCGGCT TCCTGCTCGA GGGCTTTCGC ATCTACATCA GCGGCATGCC GAGCCACGAG GTCGTCAGCT TCGTCGGTTA CGGGCTGGCG ATGGCCTTCG ACGCGGTCGG CCTGCCGACG ACCGGTGCGG TGCTGGAGGG CGGCGCGCTC AATCCCGACT ACGCCGCAGT CGATCAACTC GGCTTCAACG CCGAGACGCT CCACTGGCTG ACGTGGTGGT CCCACTCGCT GCTCGCGCTC TTCTTCATCG CGTGGATCCC CCACGCCGGG AAGCCGTTCC ACATGCTCTC CTCGTTCGCG AACGTCGTCA CGCGCGACGA GAAGGCCGGT CGGCGCCTGC CTAACGTCCC GTCGGATCTG GACGCCACGA ACGCCGAGTC CATCGACGAC TTCACCTGGA AAGAGATTCT CGATCAGGAC GCCTGTACCA AGTGCGGTCG CTGTTCGTCG GTCTGCCCCG CAAAGGCCTC CAATCGCCCG CTCGATCCGC GAGACGTCAT CCTCGACCTC CGCAAGTATC GCGAGGAACT CGAGGCCGGC GGCGAGGAGA AGCCGATCAT CGCCGACGGG GGGACCTCAG TGATCAACAC GGAGACCATG GAGTCCTGTA TGGCCTGTAT GGCCTGTATG GACGCCTGTC CCGTCGAGAT CGAACACCTC CAGTCCTTTA CCCGGCTCAA CCGGCAGATG ACCGACCAGG GCGACGTCGC CCCCAGCATG CAGGACGTCT TCCAGAACGT CATGCAGAAC GGCAACACCT TCGGCGACTC GCCGCGCAAT CGCGGCGATT GGAGCGAGGG CCTCGAGTTC GACGTGCCCG ACGCCCGCGA GGAGACGGTC GACTACCTCT GGTACGTCGG CGACTTCCCG AGCTACGACG AGCGCAACAA GCAGGTCGCC CGCTCGCTGG CGACCATTCT GAAGGAGGCC GACGTCAGCT TCGGCATCCT CTTCGACGAC GAGAAGTACG ACGGCAACGA CATCCGCCGC GTCGGCGAGG AACTGCTCTA CGTCGAACTA GCCGGCCACC ACGTCGAGAC CTGGGAGGAC TGCGAGTTCG ACAAGATCGT CTGTACGGAC CCCCACTCCT ACAACACGTT CAAAAACGAG TATCCGGAGG TCAACTTCGA CGAGTTCGCC GACGACCCGA TGATGCCCTT CGAGTACGAC GAGCAGTGGA ACGAGGACGG CGAAATCGAG GTCTACCACT GGACCCAAGC CGTCGAGGAG CTAGTCCGAG AGGGCAAACT CGAGCTGTCG GGCACCGAAC TCGACTACAC GGTCACCTAC CACGACCCGT GCCACCTCGG CCGGTACAAC GACGAGTACG AAGCGCCCCG GGAACTCATC CGCGCCACGG GCTGTGAACT CGACGAGATG CCCCGCAACC GCGACAACTC CTTCTGCTGT GGCGGCGGCG GCGGCGGCCT CTGGATGGAC TTCGAGGAAG AGCCAAAGCC CAGCGAAGAG CGGATTCGGG AAGCCCTCGA GGACACCGAC AACGGCCCCG GCGTCGAGAA GTTCGTCGTC GCTTGCCCGA TGTGCATGAC AATGTACGAG GACGGGCGCA AGACCGGCGG CTACGAGGAC GAGATCGAGA TCGTCGACGT CGCCGAACTC ATCGTCGAAG CGATCGGCGC CGAGGAGGAA GCGAATCTCG AGGTCGCGGC GGACTGA
|
Protein sequence | MNAIAQSEVS RETYLGLGGV EYVLFYFLVA VTLAVFAYGV YRRFSRYTEG DDDPFARVDG LLSRTVNAAK IVITNEKQFN RDLYGGLMHS FILWGFLTLL IATSIIAVEQ YGTELLLGLT FWEGDFYLTY QFIVDAMGLL FVVGIGMALY RRYWVQNHRL WGRHTSNEDD IFIWTLFGLG IGGFLLEGFR IYISGMPSHE VVSFVGYGLA MAFDAVGLPT TGAVLEGGAL NPDYAAVDQL GFNAETLHWL TWWSHSLLAL FFIAWIPHAG KPFHMLSSFA NVVTRDEKAG RRLPNVPSDL DATNAESIDD FTWKEILDQD ACTKCGRCSS VCPAKASNRP LDPRDVILDL RKYREELEAG GEEKPIIADG GTSVINTETM ESCMACMACM DACPVEIEHL QSFTRLNRQM TDQGDVAPSM QDVFQNVMQN GNTFGDSPRN RGDWSEGLEF DVPDAREETV DYLWYVGDFP SYDERNKQVA RSLATILKEA DVSFGILFDD EKYDGNDIRR VGEELLYVEL AGHHVETWED CEFDKIVCTD PHSYNTFKNE YPEVNFDEFA DDPMMPFEYD EQWNEDGEIE VYHWTQAVEE LVREGKLELS GTELDYTVTY HDPCHLGRYN DEYEAPRELI RATGCELDEM PRNRDNSFCC GGGGGGLWMD FEEEPKPSEE RIREALEDTD NGPGVEKFVV ACPMCMTMYE DGRKTGGYED EIEIVDVAEL IVEAIGAEEE ANLEVAAD
|
| |