Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1558 |
Symbol | |
ID | 5733445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1808805 |
End bp | 1810181 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641278697 |
Product | type I phosphodiesterase/nucleotide pyrophosphatase |
Protein accession | YP_001544329 |
Protein GI | 159898082 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATAAAA CCGTTGTGAT TAATGTGGTT GGGCTGACTC CAGCCTTGAT TGGACAATAC ACGCCACGTT TACAAGCATG GCAACAAAAA ACTGCTCAAG CCTCGATCAA ACCGATCGTT CCAGCAGTAA CCTGTAGCAT GCAGGCAACC TATCTCACGG GAAAAATGCC TAGCGAGCAT GGAATTGTTG GCAATGGCTG GTATTTTCGT GACGAATGTG AAATTAAATT TTGGCGACAA TCGAACAAAT TAGTCCAATC ACCAAAAATT TGGGAAGCGG CCAAAGCACT TGATCCTAGT TTTACTTGTG CTAATTTGTT CTGGTGGTAC AACATGTATT CATCGGTTGA TGTTGCCGTC ACGCCCCGCC CGATGTACCC CGCCGATGGC CGTAAATTAC CTGATATTTA TAGCCAGCCA GCCGAATTGC GCGATGATTT GCAACGTGAT TTGGGCCAGT TTCCGTTATT TAATTTTTGG GGACCAAATT CATCAATTGC CTCATCACGC TGGATCGCCA ACTCGGCCAA ATCTGTTGAA CAACGCTATA ATCCAACATT GAGTTTAATT TATCTGCCAC ATTTGGATTA TTGTTTTCAG CAATATGGTG CTGATATTGA GCGTTGCGCC AGCCAATTGG CTGAAATCGA TCAAGTCGTT GGCGATTTGC TGGATTTTTA TGAGCAACGC AATGCGCGGA TCGTGATTTT ATCGGAATAT GGCATTACCA GCGTTAATCG ACCAGTCGCA ATTAATCGAT TGTTGCGGGC AGCAGGCTTG ATTGCAGTGC GCGAGGAATT GGGCCGCGAA TTACTTGATG CAGGCGTGAG CAAAGCCTTT GCTGTGGCTG ATCATCAAAT TGCCCATGTT TATATCAACG ATTTAAGCTA TTTGGAGCGC GTCAAAGCTT TGCTTGAGGC TACGCCTGGG ATTGCCAAAG TGCTTGATGC CGAGGGCAAA CGCGAGTATG GGCTAGATCA TGAGCGTTCG GGCGAGTTAG TGGCGATTGC TGAGGCCGAT GCTTGGTTTA GCTATTACTA CTGGCTTGAT GATCAGCGTG CGCCAGATTT TGCGCGGGCG GTCGATATTC ATCGCAAGCC TGGCTATGAC CCAGCGGAAC TCTTTGTTGA CCCGCATTTA CGCTTGCCGA TGGCCAAAAT TGGCCTGACT TTGGCCAAGA AAAAACTTGG ATTTCGCTAT GTGATGGATG TGATTGGGCT TGATCCAAGT GTGGTGCGTG GCTCGCATGG CATTATCAGC AGTGATCCGG CGCATGCGCC GCTGTTGCTG ACCAATCAGC CTAGCCTGTT GCCCGAAGCC GCCTTGCATG CGACCGATGT TTACCAGATT TTGTGGCGAC ACTTGATCGA AGCATGA
|
Protein sequence | MHKTVVINVV GLTPALIGQY TPRLQAWQQK TAQASIKPIV PAVTCSMQAT YLTGKMPSEH GIVGNGWYFR DECEIKFWRQ SNKLVQSPKI WEAAKALDPS FTCANLFWWY NMYSSVDVAV TPRPMYPADG RKLPDIYSQP AELRDDLQRD LGQFPLFNFW GPNSSIASSR WIANSAKSVE QRYNPTLSLI YLPHLDYCFQ QYGADIERCA SQLAEIDQVV GDLLDFYEQR NARIVILSEY GITSVNRPVA INRLLRAAGL IAVREELGRE LLDAGVSKAF AVADHQIAHV YINDLSYLER VKALLEATPG IAKVLDAEGK REYGLDHERS GELVAIAEAD AWFSYYYWLD DQRAPDFARA VDIHRKPGYD PAELFVDPHL RLPMAKIGLT LAKKKLGFRY VMDVIGLDPS VVRGSHGIIS SDPAHAPLLL TNQPSLLPEA ALHATDVYQI LWRHLIEA
|
| |