Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1002 |
Symbol | |
ID | 5732905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1147161 |
End bp | 1148369 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641278136 |
Product | metallophosphoesterase |
Protein accession | YP_001543778 |
Protein GI | 159897531 |
COG category | [R] General function prediction only |
COG ID | [COG1408] Predicted phosphohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGCAT CAGAAGCAGA TAGTTTGCCG AAAGCTCGTC CATCGCAACG GCTCAATCGA CGCAAAATTG CCAGCATTAT TATGGCTTGG ATGGCGCTGT GCTGGCTGAT TATTGGTAGC GTGTTTTATA GTGTCGTGCC TGGTGGCTGG CTCAGCATTC TAGGCTTGAT GCTCTTGAGT TACATTCCAT TGCTTTTTGT AGCACGTTCG TTTGGTGGGC CAGTTGCACC ATCGGCGCAT ATCCGCTTGT GGGGCTTTCG GCCATTTTGG TATAGCCAAT TGCTCTTGCC ATTGATGGCA ATCGGTGGCC TAATTGGCTT GATTATTGGC TTGCCTTTCC ACAGCAGTGG GTTGTTGGGT CGTGGCTTGG CCGGTGGCAT CGGGTTACTC TATTTGACGG GCATTGGTTT GGCTTATTTT GGCTCACGCC GCTTGGCTGT GCGCGAATTG ACCGCTAATT TGCCCCAATT GCCCAACGAG CTAGCTGGCT TGAAAATTGT CCAGATCTCG GATACCCACG TTGGGCCGCA TACCTCGCGC CGCCACTTGC GCAACGTCGT CGCGGCAATC GAAGCCGCCA AGCCTGACCT GATTGTCATG ACTGGCGATC AAGTTGATGA TTATGTTGAT GACGTTGAAC CATTTGCCGC AGCCTTCGGC CAACTCTCAG CCCCCTTAGG CGTGGTTGCC ATCGCTGGCA ATCACGATGT CTATGCTGGT TGGGATGGCG TGCGGGCTGG ATTAGAAGCC ATGGGCATCA AGGTTTTGGT CAATCAAGCG ACGGCATTTA ATTATCGTGG CGTGCGTTGG TGGCTGGCAG GCACTGGCGA TCCGGCAGGA ACCTACGTGG CCCAAGGTCG GGAAATTGTG GCCCCTGATA TTCCCAAAAC CTTGGCTGAT GTTCCAGCCA ATGAGTTTCA TGTGGTTTTA GCCCACAACC CAGCGCTCTG GCCCGCTTTG GCCCAACGCA ACGTGCCACT AACCTTGAGC GGCCATACCC ACTACGGCCA ATTTGCCATT CCCAAACTTG GCTGGAGCAT GGCTTCGGCC TTTTTGGAGC ATGCCATGGG TCACTATCAG CTTGAGCAAT CGCTACTCTA CATCAACCCC GGCACGAACT ATTGGGGCAT TCCCTTCCGG CTCGGCACCA AGCCCGAAGT CACAGTAATT ACATTACAAC CCAGCCAAAC CGCATCCATC GTTGGGTAA
|
Protein sequence | MAASEADSLP KARPSQRLNR RKIASIIMAW MALCWLIIGS VFYSVVPGGW LSILGLMLLS YIPLLFVARS FGGPVAPSAH IRLWGFRPFW YSQLLLPLMA IGGLIGLIIG LPFHSSGLLG RGLAGGIGLL YLTGIGLAYF GSRRLAVREL TANLPQLPNE LAGLKIVQIS DTHVGPHTSR RHLRNVVAAI EAAKPDLIVM TGDQVDDYVD DVEPFAAAFG QLSAPLGVVA IAGNHDVYAG WDGVRAGLEA MGIKVLVNQA TAFNYRGVRW WLAGTGDPAG TYVAQGREIV APDIPKTLAD VPANEFHVVL AHNPALWPAL AQRNVPLTLS GHTHYGQFAI PKLGWSMASA FLEHAMGHYQ LEQSLLYINP GTNYWGIPFR LGTKPEVTVI TLQPSQTASI VG
|
| |