Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_11311 |
Symbol | |
ID | 4776609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1011717 |
End bp | 1012727 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640086640 |
Product | arsenite efflux pump ACR3 |
Protein accession | YP_001017145 |
Protein GI | 124022838 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0544357 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCTGT TCGAGCGATA TCTCTCTGTA TGGGTTGGTC TTGCGATGGT GGCGGGTGTG GTTCTCGGCG GCCTGCTCCC TGATCTGGCG GGCTGGATTG CCTTGCTGGA GGTTGCCCGC ATCAATCTGC CGATCAGTGT TTTGGTCTGG GGGATGATTT TCCCGATGAT GCTGGCGGTG GATTTCACCG CAATCGGCGA CATCCACCAG CAGCCGCGGG GACTGCTGAT CACCGTTGCG GTGAACTGGC TGATAAAACC ACTCACCATG GCGGCACTGG CCTGGCTGTT CATCCGTGGG TTGTTCTCCG CCTGGATTCC TGAAGCGATG GGCCAGGAGT ATGTGGCCGG GATGATTCTT CTGGGGGTGG CTCCCTGTAC GGCGATGGTC TTCGTGTGGA GTCGCCTCAG CGATGGCGAC CCCAACTACA CCCTGGTGCA GGTGGCCATT AACGACATGA TCATGGTGTT CGCTTTCGTT CCGATCGCGA CCCTGCTACT CGGGGTGTCG GATGTGCTTG TGCCCTGGGC CACGATGTTC ACCGCGGTGG GGCTGTTCGT GGTGTTTCCC CTGGTTGCGG GCTGGCTGAC GCGGGTATTT CTGAGGAGTC CAGGCCGGAT CGAGCGACTG GAGGTAAGGC TCAAACCCTT TGCCATAACT GCCCTGATTG CCACCGTATT GCTGCTTTTT ATGGTGCAGG CCCAGGCCAT CCTTTCGAAG CCCCTGGCGA TCGTGATGAT CGCGGTTCCA CTTATTATCC AGACCTATCT GATTTTCTGG ATCACTGCCC GCTGGATGCA CCTTTGCGGT CAGCCACGCA CGGTTGCTGC ACCCGGAGCC ATGATTGGCG CATCCAATTT CTTCGAGCTG GCTGTTGCCG TTGCGATAAG CCTGTTTGGG TTGAATTCCG GCGCCGCCCT CGCCACCGTG GTGGGCGTTC TGGTGGAGGT TCCGGTGATG CTGTCGCTGG TGGCCATCGC CAATCGCAAC AAACGACTGT TCCCTGGCTG A
|
Protein sequence | MGLFERYLSV WVGLAMVAGV VLGGLLPDLA GWIALLEVAR INLPISVLVW GMIFPMMLAV DFTAIGDIHQ QPRGLLITVA VNWLIKPLTM AALAWLFIRG LFSAWIPEAM GQEYVAGMIL LGVAPCTAMV FVWSRLSDGD PNYTLVQVAI NDMIMVFAFV PIATLLLGVS DVLVPWATMF TAVGLFVVFP LVAGWLTRVF LRSPGRIERL EVRLKPFAIT ALIATVLLLF MVQAQAILSK PLAIVMIAVP LIIQTYLIFW ITARWMHLCG QPRTVAAPGA MIGASNFFEL AVAVAISLFG LNSGAALATV VGVLVEVPVM LSLVAIANRN KRLFPG
|
| |