Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1228 |
Symbol | |
ID | 5733121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1421012 |
End bp | 1421995 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278368 |
Product | selenide, water dikinase |
Protein accession | YP_001544004 |
Protein GI | 159897757 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0709] Selenophosphate synthase |
TIGRFAM ID | [TIGR00476] selenium donor protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGGCAG GTGCTCTCAG CAATTTAATT AGTAGCTTGC GTTTACCCAG CAGCCCGCAG CTTTTAGTTG GCCTTGATGT CAGCGATGAT GCGGCGGTCT ATCAACTCAA CCAACAGCAA GCGCTCGTCC AAACGGTGGA TTTCTTTCCG CCGATTGTTG ATGATCCCTA TAGTTTTGGG GCGATTGCTG CCGCCAACGC GCTAAGCGAT GTTTATGCCA TGGGCGGCAA GCCAATTTTA GCCTTGGCAA TTGCTGGATT CCCACGCGAT TTAGACCCAG CGATCATTCA GGCTATTATG CAGGGCGGCG CAGATAAAGT GGCCGAAGCC GGAGCTGTGC TAGCTGGCGG CCATACGATC ATCGATAATG AGCCAAAATA TGGTTTATGC GTGACTGGTT TAATCGATCC AGCCCTAATC ACCCGTAAAG CTCAAGCTCA ACCAGGCGAT CAACTCTACC TTACCAAAGC GCTGGGCACA GGCGTAATTA GCACTGCCAG CAAGCGCCAA ATTGCTGATC CAGCCGATTT AGCAGCAGCA ATTGAGAGCA TGCTCAAACT CAATCGCCAT GCAGCTGAGC ATATCGCAAT GCTGGGCACA ATTCGTAGTG CCACCGATAT CACTGGGTTT GGTTTGTTGG GCCATGGCTT GGAGCTAGCT CGCAATAGTG GGGTTGGGCT ACAGATTAAT AGTCGAGCCT TGCCATTGTT GTCAGGAGCT TATCGGTATG CCAAAGCCGG AATTATGCCT GGTGGCTTGC ATACCAATCG TGCTTATGTT GAGCAACAAA TGCAAGTTAA TTATGCCAAA ACAGTTGATC CAGTCCATCA AGCTTTGTTG TACGATCCTC AAACTTCAGG CGGCTTGTTA ATTGCGGTAG CGGCGGAGCA TGCCAAAGCA CTTGAACAAC AATTTGCTCA AGCTGGCGAC CCACTTTGGT CAATTGGTAC AGTCATTGAA CTGCAAGCGC TTGAAATTGT TTAA
|
Protein sequence | MGAGALSNLI SSLRLPSSPQ LLVGLDVSDD AAVYQLNQQQ ALVQTVDFFP PIVDDPYSFG AIAAANALSD VYAMGGKPIL ALAIAGFPRD LDPAIIQAIM QGGADKVAEA GAVLAGGHTI IDNEPKYGLC VTGLIDPALI TRKAQAQPGD QLYLTKALGT GVISTASKRQ IADPADLAAA IESMLKLNRH AAEHIAMLGT IRSATDITGF GLLGHGLELA RNSGVGLQIN SRALPLLSGA YRYAKAGIMP GGLHTNRAYV EQQMQVNYAK TVDPVHQALL YDPQTSGGLL IAVAAEHAKA LEQQFAQAGD PLWSIGTVIE LQALEIV
|
| |