Gene PMN2A_0447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPMN2A_0447 
Symbol 
ID3605821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL2A 
KingdomBacteria 
Replicon accessionNC_007335 
Strand
Start bp981609 
End bp982637 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content31% 
IMG OID637687307 
ProductACR3 family arsenite transporter 
Protein accessionYP_291642 
Protein GI72382287 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.881711 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTAA TAGATCGATA CCTAAGTTAT TTTATCGCTG TTTCTATGAT TCTAGGGGTC 
TCTATTGGAT CTATCTTTCC TAATTTTTCT AATTATATTT CCTCTTTAGA ACTAACAGGT
ATCAATCTAC CTATAGCTTT TTTGATATGG GGAATGATAA TTCCGATGAT GTTATCAATA
AATTTTAATT CTATTATCAA AATCAAGGAT AGGCCACAGG CGATTTTGAT TACGATAATA
GTGAATTGGT TAATCAAACC AATACTTATG ACAGGTATAG CTATATTATT TATAAATAAT
ATATTTTCCT CTTGGATTGA TGCTGCTAAA GCTTCAGAAT ATATTTCTGG GATGATTCTA
TTAGGAGTTG CTCCTTGCAC TGCAATGGTT TTTGTCTGGA GTAATCTTGT TAAAGGGAAC
TCTAACTATA CTTTGGTCCA GGTTATTATT AATGATCTTA TTTTATTATT TGCTTTTGCA
CCTATTGCTT CTTTCTTGCT TGGTGTTAAT CAAATCAAAA TACCCTTATG GACTATATTT
AACTCTGTTT TAATTTATGT ATTTATACCG CTTTTATTCT GCTTATTGAT CAAAAAAATT
GTTAATGATG CAGCAAAAAT TTATACGATA AATAATTTTT TGAAGCCAAT TTCTGGGATT
TGCTTGGTTT TAACTGTTTT ATTTTTATTT TTAGTACAAG CTAGTGAGGT TATCAATAAC
CCATTCCAAA TCTTATTAAT AGCTATCCCT TTAATCATTC AGACCTTTTT GATCTTTTTT
ATTACAGCAA TTCTTTTGAG AATATTTAAT CAAGAAAAAT CAATAGCAGG TCCAGCTTCA
ATGATTGGGG CTTCCAATTT CTTTGAATTA GCTGTTGCTA TCGCAATAAG CCTTTTTGGT
GTTAATTCAG GTGCCGCAAC TGCGACGGTT GTAGGTGTTT TAGTTGAAGT GCCGGTAATG
CTATCTTTGG TTGGCATTGT TAACAATAAT GATTATTTAT TTCCTACTCG AGCTAAAAGC
TTTCGCTGA
 
Protein sequence
MSLIDRYLSY FIAVSMILGV SIGSIFPNFS NYISSLELTG INLPIAFLIW GMIIPMMLSI 
NFNSIIKIKD RPQAILITII VNWLIKPILM TGIAILFINN IFSSWIDAAK ASEYISGMIL
LGVAPCTAMV FVWSNLVKGN SNYTLVQVII NDLILLFAFA PIASFLLGVN QIKIPLWTIF
NSVLIYVFIP LLFCLLIKKI VNDAAKIYTI NNFLKPISGI CLVLTVLFLF LVQASEVINN
PFQILLIAIP LIIQTFLIFF ITAILLRIFN QEKSIAGPAS MIGASNFFEL AVAIAISLFG
VNSGAATATV VGVLVEVPVM LSLVGIVNNN DYLFPTRAKS FR