Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_27426 |
Symbol | HMA1 |
ID | 5005514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | - |
Start bp | 246663 |
End bp | 249656 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | |
GC content | 59% |
IMG OID | 640420935 |
Product | P-ATPase family transporter: cadmium/zinc ion; heavy metal translocating P-type ATPase family-like protein |
Protein accession | XP_001421427 |
Protein GI | 145354301 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2217] Cation transport ATPase |
TIGRFAM ID | [TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC [TIGR01525] heavy metal translocating P-type ATPase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.040121 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.339459 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGG ACGAGGACGA GTTCGTGTCG CAGGCGGGGA TGATGGCGGC GACGAGCTCC GGGGAGTTTG CACCGCGCGG ACGCACGGAG AGGGCGCTGG ATGCGATGTT TAACGCCACT GGTTTGCACG CGGTGGCGGA TTTCCTGCGC GGGAACGCGG CGGTGGCGGT GGTGTCGTGG GCGCTGTTTC TCGTCGCGGG GGCGGCGCAC GTGGCGACGC ACGTCGGCGG CGGCGGATTG GTGGAGACGA CGATGGCGGC GAACGTCAGT CGGGTGTGCA CGATTCTGGT TTACGTGTTG GCGGGGACGC CCGAGTTTGT GGACGTGACG TACGAGCTCG CGGTGGGGAA CGTGAACATT CACGTGCTCA CGACGTTGGC CGTGTTCGGC ACCGTGCTCT TGGGGTGCGC GATGGAGGGC GCGCTGCTCT TGGTGCTCTT CGCCCTGGCG CACTTCGTGG AGGATCGATT GACGCTGCAC GCTCGGGGGG ATTTGAAGGC GTTGTGGGGC ACGGTGCCCA CGACGGCGGA CGTCGTGCAG TTGCTCGCCG ATGGGACGCC GGATTTGACG ACGCTCCGAG AGATGCCCGC GGCGGACGTC GACGTTGGGA CGATGATTTT CGTCAAGGCT GGACAGCAAG TGCCGTTGGA TGGTATGGTT GTCCACGGCA GCGCGTTAGT GAGCATTCAA CACATCACCG GTGAGGCTTT GCCGGTGATG AAGAGATACG GCGATGAGAT TCCCGCCGGC GCGATGAACA CCGACGGCGT GCTCGTGGTG AAGAGCTTGA GATCGAGCGA GGAGAGCACA CCGGCGCGAA TCGCGCGACT CACCGAGGCT GCGCAGCGGC GTCGTCCGAA AGTCTCACGA TTGATCGATA GCATAGGAGA CCGCTACAGT AAGGCGATAT TGGCCATCAC CTTTATTACC ATGGCCGCGG CGCCGATGGT GCTCGGGATT CCATTTTTGG GTCGCGGCGG TGCCATGTAT CGCAGCTTTG CGTTTTTATC CTCCGCAGCG CCGTGTGCAT TGTTGATGTC GCCGCTCGTG TACGTGGCGG CCATCGGGGC CATGGCTCGG CGAGGTGTGC TCATTCGAGG CGGCTTGACT CTCGACGCTT TGGCCGAAGT CGGCGCGGTG GCGCTCGATA AGACGGGAAC CATCACGACG GGGCAAATGA GCTGCACGAA TATCACACAA TTTGAAGACG GTGCGCAACA CGACACTGCT CCGAGCGCGT CAGTCAAGGC TCTGGCGTAC GCGCTGAGCC TCGAACGCGG TTCGTCCCAC CCGATTGCCG CCGCGGTGAC GCAAGCGGCG CGGGGGATCA TTTTGCCAGA CGTCGGTCCC GTGACGGATT ACAAAGTCAT CGCGGGGAGT GGTGTGGAAG GCACGATCGA CGGCAAGCGT GCGCGATTCG GTAGCTCCGA GTTTGCGCTC GAGCTGTGTA ACACTGGCGC TGCGGAATGC TCCGCCGAGG TCATCGCGCA AGAGGGCGAA GTGCTCTCTG TGCTCGCGAT TGAGGGCGAA AGCCCGGCGC TGTTTAGGTT TAGTGATACG CTTAATTTGC AGGCGCCCGA TGCCATCGAG TCCCTGCGTA CTGGCAAGTG CCGACGGAGA TGGTGGGAGG TGGCGAGCGA CTCGACGGGT ATGGAGTTGG CCATGCTCAC GGGCGACAAC AAGACGAGCG CCATAGCCAT GGCTGAGGAG ATCAAGTTGA AACCGGAAGA CGTACACGCC GGATTGACTC CGTCTCAAAA GCTTGCACTG GTTGAATCAA TGCGAGAGCG CGTGAAATCC AAGCAATATC CTCGCGTGGC CATGGTTGGC GATGGTATCA ACGATGCCCC AGCGCTCGCC GCCGCGGATG TCGGCGTCGC CATCGCGTCG ACGCCGAGCG ATGCCGCCGC GAGTGCGGCG GATGTACTGT TGCTCACTAA AGACGAAGGA GGCATCTCTC AATTACCAGA GCTGTTTTCC ATCGCCGAGC GCACGCGACG CACTTTACGA CAAAACATCG CACTCGCCGT GGTATCAATC TTGGGGTCAG CTATTCCCGC TCTCTTCGGC GCGTTTCCGC TATGGTTGGC GGTATTGCTG CACGAAGGCG CGACCCTCAT GGTGGCGATG AACTCAGTGC GTTTGCTTGT CACGTTCGGG CGACAGCCGT TGTCCAAGAG TACTACCGTG GCCCTCACGA CGTTCACCGT GTTTTGTTGC TTGGGCGCAG CGTACTCGGT GTGCAGCGAA GCGATTGCCA TTTGGGTCAA ACATTTGCAC GTCGCACACT GGATGAGCGT AATCACGGCG TTTAAGTCTG CATGGGCTGG TCTATTAGCC GGTTGCTTGC ACACGCTCAC GGGGCCAGAC CACTTAGCGG CACTCACGCC ACTGACAGTG GGACCGAGCA GAGCGCAAAA CGCGTTGATG GGCGCGCTTT GGGGGCTCGG GCATAACACG GGTCAAATTT TATTTGGATG TATTTTCATC GCTTTGAGGG ATAAATTACC GCTCAACTTG GAAGTTATCG GTCAGTTTGG ACAAGGAATC GTCGGTTTGA CGCTCATTAT TATTGGCGCT ATTGGCTTTT GGGAAAGCAT GGGCGGTCAC TCGCACTCAC ATTCACACTC ACACTCACAC TCGCATTCGC ACTCGCATGG GAGCGATGTG CCCGCGAAGC GCGATAGCAG CTTCATCTCA TGGACGTATA TTACTGGCAC CATCCATGGA TTACAACCGG ATTCATTGTT TCTGCTTTTA CCTGCCCTGG CGCTCCCTCG CGTCGAGGCG ATTTCTTTTT TGGCGACGTT CTTCATCGGA ACCATTATCG CCATGGGGAC GTACACGTAC TGCATTGGTG CGGGTACGGC AGCGTTGGAG AAGAATAATC CCAAGTTTGT GAGCTATATC GCTCGTGGAT CGAGCGCCGT CGCGCTCGCC TTCGGTGTAT TATTTGTCAT CAGCGCCGTA TTTGGTCTTG ATTTGATTTT TTAG
|
Protein sequence | MSADEDEFVS QAGMMAATSS GEFAPRGRTE RALDAMFNAT GLHAVADFLR GNAAVAVVSW ALFLVAGAAH VATHVGGGGL VETTMAANVS RVCTILVYVL AGTPEFVDVT YELAVGNVNI HVLTTLAVFG TVLLGCAMEG ALLLVLFALA HFVEDRLTLH ARGDLKALWG TVPTTADVVQ LLADGTPDLT TLREMPAADV DVGTMIFVKA GQQVPLDGMV VHGSALVSIQ HITGEALPVM KRYGDEIPAG AMNTDGVLVV KSLRSSEEST PARIARLTEA AQRRRPKVSR LIDSIGDRYS KAILAITFIT MAAAPMVLGI PFLGRGGAMY RSFAFLSSAA PCALLMSPLV YVAAIGAMAR RGVLIRGGLT LDALAEVGAV ALDKTGTITT GQMSCTNITQ FEDGAQHDTA PSASVKALAY ALSLERGSSH PIAAAVTQAA RGIILPDVGP VTDYKVIAGS GVEGTIDGKR ARFGSSEFAL ELCNTGAAEC SAEVIAQEGE VLSVLAIEGE SPALFRFSDT LNLQAPDAIE SLRTGKCRRR WWEVASDSTG MELAMLTGDN KTSAIAMAEE IKLKPEDVHA GLTPSQKLAL VESMRERVKS KQYPRVAMVG DGINDAPALA AADVGVAIAS TPSDAAASAA DVLLLTKDEG GISQLPELFS IAERTRRTLR QNIALAVVSI LGSAIPALFG AFPLWLAVLL HEGATLMVAM NSVRLLVTFG RQPLSKSTTV ALTTFTVFCC LGAAYSVCSE AIAIWVKHLH VAHWMSVITA FKSAWAGLLA GCLHTLTGPD HLAALTPLTV GPSRAQNALM GALWGLGHNT GQILFGCIFI ALRDKLPLNL EVIGQFGQGI VGLTLIIIGA IGFWESMGGH SHSHSHSHSH SHSHSHGSDV PAKRDSSFIS WTYITGTIHG LQPDSLFLLL PALALPRVEA ISFLATFFIG TIIAMGTYTY CIGAGTAALE KNNPKFVSYI ARGSSAVALA FGVLFVISAV FGLDLIF
|
| |