Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2666 |
Symbol | |
ID | 8743280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 2736722 |
End bp | 2738890 |
Gene Length | 2169 bp |
Protein Length | 722 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646513254 |
Product | catalase/peroxidase HPI |
Protein accession | YP_003404214 |
Protein GI | 284165935 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0376] Catalase (peroxidase I) |
TIGRFAM ID | [TIGR00198] catalase/peroxidase HPI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTGGT CCAACCAAGA CTGGTGGCCG GATCTGCTGA GAGTAGACAT CCTCGACGAT AACACCGTGG ACGCCAGTCC GTACGGCGAG GACTTCGACT ACGCGGAGGC GTTCCAGCAA CTCGATTACG AGGCGGTGAA AGCGGACATC GAGGACGTGA TGACGACCTC TCAGGAGTGG TGGCCGGCCG ACTACGGCCA CTACGGGCCG CTTTTCATCC GGATGGCGTG GCACAGCGCG GGGACGTATC GCACGCTCGA CGGCCGCGCT GGCGCGTCCG GCGGCCTCCA GCGCCTCCCG CCGGAGAGCA GTTGGCCGGA CAACGTGAAC CTCGACAAGG CCCGCCGCCT GCTCCAGCCG GTCAAGCGGA AGTACGGCCG CAAGCTCTCG TGGGCCGACC TGATGGTCCT CGCTGGGAAC GTCGCCCTCG AGTCGATGGG CTTCGAGACG TTCGGCTTCG CCGGCGGGCG CGAGGACGCG TTCAAGTCCA ACGAGGCCGT CGAGTGGGGG CCCGAGACGG AGTGGGAGAC GACCTCGCCC GAGCGCTACC GCGACGGGGA GGTCGGCGAC CTCAAGGACC CGCTCGCGAA CACCGTGATG GGGCTCATCT ACGTCAACCC CGAGGGGCCG TACGGCGAAC CGGACCTCGA GGGCTCCGCG AAGAACATCC GCGAGGAGTT CTCTCGCATG GCGATGACCG ACGAGGAGAC CGTGGCGCTC ATCGCCGGCG GCCACACCTT CGGGAAGGTC CACGGCGCCG ACAACCCCGA CACGAATCTC GGCCCCGAGC CCGAGGCGGC CCCCATCGAC CAGCAGGGCC TCGGCTGGCA GCACGAAGGC AGCGAAGACA AGACCGGCGG ACTTGACGTC ATCACGAGCG GTATCGAAGG ACCGTGGAAC GCCTCGCCGA TCCAGTGGGA CACGGGCTAC GTCGACAACC TGCTCGATCA CGAGTGGGAA CCCCACAAGG GGCCCGGCGA CGCGTGGCAG TGGCGGGCGA AAGACGAAGC GGACCTCGAG TCCGCGCCGG ACGCCCAGGA CTCGTCGGAG ACCCAACTGC CGATGATGCT GACGACGGAC GTCGCCCTGA AGCACGACCC CGACTACCGA GAGGTCCTCG AGCGCTTCCG GGAGAACCCC AACCAGTTCC GGGAGGCGTT CGCGAAGGCG TGGTTCAAGC TCCTCCACCG CGACATGGGA CCGCCCGAGC GGTACCTCGG CCCGGAGGTC CCCGACGAGA CGTTGATCTG GCAGGATCCC GTCCCCGACG CCGACTACGA ACTGGTCGGC GAAGCGGAAA TCGACGAACT CGAGGCGGAG ATTCTCGATT CGGACCTCTC CGTCCCGCAA CTGGTCAAGA CCGCGTGGGC GTCGGCGTCG ACGTACCGCG ATAGTGACAA ACGCGGCGGC GCGAACGGCG CACGGATCCG CCTCGAACCA CAGCGAAACT GGGAGGTAAA CGAGCCCGAG GAACTGGAGA CGGTCCTTTC GACCTACGAG GAGATTCGGG GCGAGTTCAA CCGCACCCGC TCCGACGACG TGACGGTCTC GCTGGCCGAC CTCATCGTGC TGGGGGGCAA CGCGGCCGTC GAGCAGGCAG CGGCCGAGGC CGGCTACGAC GTGGACATTC CCTTCGAACC GGGCCGCACG GACGCCTCAC AGGAGCAAAC CGACGTCGAG TCCTTCGAGG CGCTCGAGCC GAAGGCCGAG GGCTTCCGGA ACTACCTCGG TGGCGAGTAC GACGACCTGT ACGACTCGCC CGAGGAGCGG CTGATAGACC ACGCGCACCT CCTGACCCTG TCGGTACCCG AGATGACGGT GCTGGCCGGA GGCATGCGCG CGCTGGGTGC GACCTACGGG GATTCCGGTC GCGGCGCCTT TACCGACGAA CCCGGCGTCC TGACGAACGA TTTCTTCGCG AACCTGCTCG ATATGGGCTA CGATTGGGAG CCGGTCTCGG AGGACAGAGA ACGCTTCGAA GTCCGCGACC GCGACACCGG CGACGTCGAG TGGGAAGCCA CCCGCTTCGA CCTCATCTTC GGCTCGAACG CTCGGCTTCG AGCGCTCGCG GACGCCTACG GTGCCGACGA CGGCGAGGAA GAATTCGTCC GTGACTTCGC GGACGCCTGG AGCAAGGTGA TGACGCTCGA TCGCTTCGAC CTCGAGTAA
|
Protein sequence | MTWSNQDWWP DLLRVDILDD NTVDASPYGE DFDYAEAFQQ LDYEAVKADI EDVMTTSQEW WPADYGHYGP LFIRMAWHSA GTYRTLDGRA GASGGLQRLP PESSWPDNVN LDKARRLLQP VKRKYGRKLS WADLMVLAGN VALESMGFET FGFAGGREDA FKSNEAVEWG PETEWETTSP ERYRDGEVGD LKDPLANTVM GLIYVNPEGP YGEPDLEGSA KNIREEFSRM AMTDEETVAL IAGGHTFGKV HGADNPDTNL GPEPEAAPID QQGLGWQHEG SEDKTGGLDV ITSGIEGPWN ASPIQWDTGY VDNLLDHEWE PHKGPGDAWQ WRAKDEADLE SAPDAQDSSE TQLPMMLTTD VALKHDPDYR EVLERFRENP NQFREAFAKA WFKLLHRDMG PPERYLGPEV PDETLIWQDP VPDADYELVG EAEIDELEAE ILDSDLSVPQ LVKTAWASAS TYRDSDKRGG ANGARIRLEP QRNWEVNEPE ELETVLSTYE EIRGEFNRTR SDDVTVSLAD LIVLGGNAAV EQAAAEAGYD VDIPFEPGRT DASQEQTDVE SFEALEPKAE GFRNYLGGEY DDLYDSPEER LIDHAHLLTL SVPEMTVLAG GMRALGATYG DSGRGAFTDE PGVLTNDFFA NLLDMGYDWE PVSEDRERFE VRDRDTGDVE WEATRFDLIF GSNARLRALA DAYGADDGEE EFVRDFADAW SKVMTLDRFD LE
|
| |