Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2661 |
Symbol | |
ID | 5734556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3414559 |
End bp | 3416712 |
Gene Length | 2154 bp |
Protein Length | 717 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279803 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_001545427 |
Protein GI | 159899180 |
COG category | [R] General function prediction only |
COG ID | [COG1480] Predicted membrane-associated HD superfamily hydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00304076 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGTTC GTTCATATCG CTTAAAAACC TTGCTACGTT CGTTCATCGA GCATCACCAC CATTTGGTGT TGGTGCTCTT TGGGGCCGTG CTCACGCTAA TTTTGACCTT GATTTTTACG TGGCGCTCGG CGATCAACCA AGATATTATG GTTGGTCGTC CCAGCCCACG CACAATCAAC GCCGACCGCG ATTTGACCTT TGAAAGCCCC TTGCTGACTG AGGCCAAACG CCGTGAAGCC GCCAACGATC CCCGCAACTT GGTCTATAAC GAAGATACTC AAATTCATGG TCAACAGCGT GAACAGCTGC AAGCAACCTA CAGCGTGATT AACTCGGTTC GCGAAAATCC CAGCCTCAAC CTCGATCAAC AACGCGGGCA ACTGACTGAA TTGCCTTCGC TGCCGCTCTC CGATACCCTC GCCATCACCA TTCTTGAAGC TGATGACGAT ACCTGGCAAC GGATCAAAGA TCAAACCAAT GCCTTGTATG ATCGAACGCT GCGCGAAAAT AATTATTCAA TTGATGAAAC CACCCTCGCC GAAATTAAAG TGCGCTATTT GCCTTACAAC TTGCCCAGCA GTTTAAAGCC CGATCAACGA GCGGTTGTGC TGTATTTGGT CGAACAAACC CTGCATGTTA ATCGCACCCT GAATCAAGAG GAAACTGAGC GCCGCCAACA AACAGCCCGC AATGCTGTCC AATCGGTCTC AAAAGATGTC GTCAATGGCC AAAATATTGT GCGCCAAGGC GATACGGTTT CAGCAGAACA ATACGAAACC TTGATCAAAA TGGGTCTGAT CACGCCTGAA TTAGGCTTTG ATGGCTTTAT GGGGCGCTTC TTGCTAGCGC TCTTAGTGGC GTTGGCCTTA TGTACAGCAC TTTATATCGA TCAACATAAT CTTTTGACAT GGCCACGGGC ATTGCTGGTC ATCTTGATTT TGATGGTTAT TCCGATTTTG TCTGGGCGCA TTTTCCTCAA CACATGGCTG AATTTCCCTG AAACGTTTGC TTTGGCGGTG ATTGCGATTC CGTTGGCAGC GCTATTTAAC AACAATTTGG CCTTAGTTAT TTCAGCCTTA GTTTCGATTG TGATGATGTT TTTGGGCGAA GGCGCACTCC AAGTTGGCAT GATCAGCTTT GCCGGAGCCT TGTGTGGCAT CTACGCAATT CGTCGCGCCG ACCGGGCCAT GGCCTTTATT ATGGCTGGCG TTTGGATTGC GCTGGGAGTA TTCGCCACCG CCATGATTTG GCGTTTGATT CAGCCCCAAG GCGTAACCTG GCAACAAACT ATGTTCACCT TGATTTTTAG CATGCTCAAC GGTGGCATCA CGGCCATGAT GTCGTTGACC TTGCATAACG TGCTTGGGCG GATCGCTGGC ATCGTTACGC CAATGCAATT ATTGGAGTTG GCCCACCCCA ACCAGCCGCT GCTGCGCCGC TTGATGCAAG AAGCTCCAGG CACTTATCAT CACTCAGTCG TTGTCAGCAA TTTGGCCGAA CAGGCTGCTG AACGGATCGG CGCTGATACC TTGCTAACCC GCGTGGGAGC CTACTATCAC GATATTGGCA AAATGCTGCG GCCATTCTTC TTCACCGATA ATCAATACGA TCGCTCAAAT GTCCACGATA ACCTTGATCC GCAAACCAGC GCCAAATTAA TCGCCGATCA CGTGATTGAG GGAGCTAAAA TTGCGCGGCA GCATAAGCTG CCTGAGCAAA TTGTTAATTT CATCGTTGAG CATCACGGCA CCGATGTGAT TCGCTATTTC TATCAGCAAG CCTTACAAGC CCAAGATAGC GTTGATATCA ACGATTATCG CTACCCTGGA CCCAAGCCAC AATCCAAGGA AACAGCGATT TTGATGCTAG CCGATGGAGT TGAGGCCACT GTGCGCTCCA AGGAGCAAGC GGGCATGCTC GTGGCTGAGC GTCACGATGA CGATGATCAA CAAGCACCCA AAGGTTGCCA AAGCATTGCC CAAGTGGTCA ACCAAAGCAT CGATATGCGC CTTGCCAGCG GCCAGCTTGA TCAATGCCCG CTCACCCAAA AAGATCTCAA CACAATTCGC CAATCGTTTG TCAAAACGCT CCAAGGGATC TATCATCCAC GGGTTGAGTA TCCCAAATTG ATGCGGGAAC CGCAAAATAA ATAA
|
Protein sequence | MIVRSYRLKT LLRSFIEHHH HLVLVLFGAV LTLILTLIFT WRSAINQDIM VGRPSPRTIN ADRDLTFESP LLTEAKRREA ANDPRNLVYN EDTQIHGQQR EQLQATYSVI NSVRENPSLN LDQQRGQLTE LPSLPLSDTL AITILEADDD TWQRIKDQTN ALYDRTLREN NYSIDETTLA EIKVRYLPYN LPSSLKPDQR AVVLYLVEQT LHVNRTLNQE ETERRQQTAR NAVQSVSKDV VNGQNIVRQG DTVSAEQYET LIKMGLITPE LGFDGFMGRF LLALLVALAL CTALYIDQHN LLTWPRALLV ILILMVIPIL SGRIFLNTWL NFPETFALAV IAIPLAALFN NNLALVISAL VSIVMMFLGE GALQVGMISF AGALCGIYAI RRADRAMAFI MAGVWIALGV FATAMIWRLI QPQGVTWQQT MFTLIFSMLN GGITAMMSLT LHNVLGRIAG IVTPMQLLEL AHPNQPLLRR LMQEAPGTYH HSVVVSNLAE QAAERIGADT LLTRVGAYYH DIGKMLRPFF FTDNQYDRSN VHDNLDPQTS AKLIADHVIE GAKIARQHKL PEQIVNFIVE HHGTDVIRYF YQQALQAQDS VDINDYRYPG PKPQSKETAI LMLADGVEAT VRSKEQAGML VAERHDDDDQ QAPKGCQSIA QVVNQSIDMR LASGQLDQCP LTQKDLNTIR QSFVKTLQGI YHPRVEYPKL MREPQNK
|
| |