Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2658 |
Symbol | |
ID | 5734553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3412435 |
End bp | 3413676 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279800 |
Product | metallophosphoesterase |
Protein accession | YP_001545424 |
Protein GI | 159899177 |
COG category | [R] General function prediction only |
COG ID | [COG1408] Predicted phosphohydrolases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00191858 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGCAA TTAATCCTTG GGCAAAACGT CTATATCAGA CTGGTCGCTG GGCCAGCAAC GTCGTTATTT GGCTGGTACT ATGGATCAGT TGTGTGGCGA TCATCTTTTT GATGATTCGC TATTTCAGCG GCGCAGAATG GCTAAGCAAT CTGCAAGGCT CAGCCCATTT TGTCGCCGAA TTTGGCATGC GCCTGTTGAT GGCTGCGCCA TTTACCCTCT GGCTTTTGTT GATGCTGCGA CCAATTCGCA CCCGTCGCTG GGTCGTCAGC CACATACTCA AATTGACTCA ACGCTTGCGC CGCCAGCCCA AGCCCCAACT TGAGCCAGCC ATTGATCAGC TAGATCGAGA AGTTCAACCT ACAACCAACC CAACTATGAG CGCAAAACCG CTCAGTCGCC GTCGATTTTT GGTTGAATCA GGACTGGTTG GTGGTGTGGT CGGTTATGCA ATGTTGATCG AGCCATATCA GATTCAGGTC CGCGAAGTTA ATTTGCCAAT CGCCAATTTG CCCGAACGTT TTCGCGGCAT GCGCATCGCC CAAATGAGCG ATTTGCATAT CAATGCCTAC ACCACCAGCG CCGATTTGGC CCGCGCTGTG GCCCAAATCA ACCAGCTCAA CCCTGATATG GTGCTGCTCA CTGGCGATTT TGTCGATTGG GATGCACGCT TTGCTGATGC CGCCACCGAG CCATTCCGCC AGCTGCGTGC ACCCGAAGGT ATTTATTCGG TGCTTGGCAA CCACGATTAC TACAGTGGCA AGATCGATAT AATCAAACAA GCCATCCAAC GCCACGATTT AGGTTTGTTG GTCAATCAGC ATACCATTTT GCGCCGTGGC GCTGATCAAT TGGTCTTGGT AGGTTTTGAT GATCCACGGC ATAATCGTAG CGGCGGGCCA CGGCTCAGCC CTGAGAGCAT CAATCCTGAA GCGGCCTTGA AGGGTACGCC GAAAAATGTT GCCCGCCTAG CAATGGTGCA TAATCCAGTA ATTGTGCCGC ATTTTGTCGC CAATTATCAG CTTGATGTGA TTCTATCGGG GCATACCCAT GGCGGCCAAT TCCAAGTGCC AATTCTCACC GACCAGCTGG TGGGCAATGC TGAATATTTT GTGCGCGGCC ATTACGATTT GGGTAAATCA CAGGTTTATG TCAACAGTGG TTTTGGTTTT ACCGGGCCGC CCTTGCGATT TCGCTCGGCT CCAGAAATTA CATTAATTAA TTTAGTTAAT GCCAAAGCCT AG
|
Protein sequence | MAAINPWAKR LYQTGRWASN VVIWLVLWIS CVAIIFLMIR YFSGAEWLSN LQGSAHFVAE FGMRLLMAAP FTLWLLLMLR PIRTRRWVVS HILKLTQRLR RQPKPQLEPA IDQLDREVQP TTNPTMSAKP LSRRRFLVES GLVGGVVGYA MLIEPYQIQV REVNLPIANL PERFRGMRIA QMSDLHINAY TTSADLARAV AQINQLNPDM VLLTGDFVDW DARFADAATE PFRQLRAPEG IYSVLGNHDY YSGKIDIIKQ AIQRHDLGLL VNQHTILRRG ADQLVLVGFD DPRHNRSGGP RLSPESINPE AALKGTPKNV ARLAMVHNPV IVPHFVANYQ LDVILSGHTH GGQFQVPILT DQLVGNAEYF VRGHYDLGKS QVYVNSGFGF TGPPLRFRSA PEITLINLVN AKA
|
| |