Gene NATL1_11591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_11591 
Symbol 
ID4780546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1035570 
End bp1036598 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content31% 
IMG OID640084438 
Productarsenite efflux pump ACR3 and related permeases 
Protein accessionYP_001014982 
Protein GI124025866 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0590544 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTTG TAGATCGATA CTTAAGTTAT TTTATCGCTG TTTCTATGAT TCTAGGGGTT 
TCTATTGGAT CTATCTTTCC TAATGTTTCC AATTATATTT CCTCTTTAGA ACTAACAGGT
ATCAATCTAC CTATAGCTTT TTTGATATGG GGAATGATTA TTCCGATGAT GTTATCAATA
AATTTTAATT CTATTATCAA AATCAAGGAT AGGCCACAGG CAATTTTAAT TACAGTAATA
GTGAATTGGT TAATCAAACC AATACTTATG ACAGGTATAG CTATATTATT TATAAATAAT
ATATTTTCTA CTTGGATTGA TACTGGTAAA GCATCAGAAT ATATTTCTGG AATGATTCTA
TTAGGAGTTG CTCCCTGCAC TGCAATGGTT TTTGTTTGGA GTAATCTTGT TAAAGGGAAC
TCTAACTATA CTTTGGTCCA GGTTATTATT AATGATCTTA TTTTATTATT TGCTTTTGCA
CCTATTGCTT CTTTCTTGCT TGGTGTTAAT CAAATCAAAA TACCCTTATG GACTATATTT
AACTCTGTTT TGATTTATGT ATTTATACCA CTTTTATTCT GCTTATTAAT GAAAAAAATT
GTTAATAATG CAGCAAAGAT TCATATGATA AACAATTTTT TGAAGCCCGT TTCTGGGATT
TGCTTGGTTT TAACTGTTTT ATTTTTATTT TTAGTACAAG CTAGTGAGGT TATCAATAAC
CCATTCCAAA TCTTATTAAT AGCTATACCT TTAATTATTC AGACCTTTTT GATCTTTTTT
ATTGCAGCAA TTCTTATGAG AATATTTAAT CAAGAAAAAT CAATAGCAGG TCCAGCGTCA
ATGATTGGGG CTTCCAATTT CTTTGAATTA GCTGTTGCTA TCGCAATAAG CCTTTTTGGT
GTTAATTCAG GTGCCGCAAC TGCGACGGTT GTTGGTGTTT TAGTTGAAGT GCCAGTAATG
CTATCTTTGG TTGGCATTGT TAACAATAAT GATTATTTAT TTCCTACTCG AGCTAAAAGC
TTTAACTGA
 
Protein sequence
MSFVDRYLSY FIAVSMILGV SIGSIFPNVS NYISSLELTG INLPIAFLIW GMIIPMMLSI 
NFNSIIKIKD RPQAILITVI VNWLIKPILM TGIAILFINN IFSTWIDTGK ASEYISGMIL
LGVAPCTAMV FVWSNLVKGN SNYTLVQVII NDLILLFAFA PIASFLLGVN QIKIPLWTIF
NSVLIYVFIP LLFCLLMKKI VNNAAKIHMI NNFLKPVSGI CLVLTVLFLF LVQASEVINN
PFQILLIAIP LIIQTFLIFF IAAILMRIFN QEKSIAGPAS MIGASNFFEL AVAIAISLFG
VNSGAATATV VGVLVEVPVM LSLVGIVNNN DYLFPTRAKS FN