Gene OSTLU_12872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_12872 
Symbol 
ID5003740 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp200065 
End bp201198 
Gene Length1134 bp 
Protein Length330 aa 
Translation table 
GC content61% 
IMG OID640419161 
ProductArsAB family transporter: arsenite (ArsA) 
Protein accessionXP_001419523 
Protein GI145350244 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.302636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACG GACGCGAGCG GAAGTATTAC ATGGTCGGAG GCAAGGGCGG GGTCGGGAAG 
ACGTCGCTGT CGAGCTCGCT CGCGGTGAAG TTCGCGAGCG CCGGGCACGA GACGCTGGTG
GTGAGCACGG ACCCGGCGCA CTCGCTGAGC GATTCGTTGG CGCAGAACGT CAGGGGAGGG
CAGCCGGTGG AGGTGAACGA TACGGATGGG ATGCTGTACG CGCTGGAGAT CGATCCGGAG
AGCGCGAAGG CGGAGTTTAC GCAATTCGCG CGGGCGACGG ACATGAGCGG GGGGGCGAGA
GATTTTATGA GCTCGGTCGG TTTGGGCGGG TTCGCGGACT CGATCGCGGA TTTGAAGCTC
GGGGAGCTCT TAGACACGCC GCCGCCGGGG TTGGACGAGG CGATCGCGAT CGCCAAGGTG
TTGCAGTTTA CGAAGGATGA GAAATTTAGC AAGTTTACGC GCATCGTCTT CGACACGGCG
CCGACGGGGC ACACGCTGCG GTTGCTGTCG CTTCCGGATT TCCTCGACGC GTCGATCGGG
AAGATCGTGC GATTACGTCA AAAGCTCACG AGCGCGACGG ATGCGGTGAA GGGGATCTTC
GGCGTGGGTG AGGACAAGCA GGACGACGCG GTGGAGAAGC TCGAAAAGCT CAAGGCGCAA
GTCAAGGAAG TGCGCACGCT GTTTCGAAAC AAAGACACCA CCGAATTCAT CATCGTCACC
ATCCCCACGG TGCTAGGCGT GAGCGAATCG GGCCGCTTGT TACAAAGCCT TCGCGACGAG
GACGTGCCGT GCAAGCGGTT AATCGTCAAC CAAGTGCTCA AGGTGAACGT CGACGACTTT
AAAGCCACCG CCGCCGAGGC GCGAGACGCC CAAGACGCCC TCGTCGCGCG ATTATCCGGC
GACGACGCGG AGGCGTTACA AAAGTACGTC GATTTGAACG CCAAAGCCTT GAAAGCGGCG
CAGGCCGCGG TGACGTTTTG CAGCGTCAAG GAAAAAGACC AGACGCGCGC GTTGCAAATG
TGCGAAGAAG ACGCGGGATT AAACTCTCTC AATCGCACCG ACGCGCCGCT GTTTGACATG
GAAATTCGCG GCGTTCCGGC GTTGAAATTC TTCGGCGACC AAGTGTGGCG ATAG
 
Protein sequence
MLDGRERKYY MVGGKGGVGK TSLSSSLAVK FASAGHETLV VSTDPAHSLS DSLAQNVRGG 
QPVEVNDTDG MLYALEIDPE SAKAEFTQFA RATDMSGGAR DFMSSVGLGG FADSIADLKL
GELLDTPPPG LDEAIAIAKV LQFTKDEKFS KFTRIVFDTA PTGHTLRLLS LPDFLDASIG
KIVRLRQKLT SATDAVKGIF GVGEDKQDDA VEKLEKLKAQ VKEVRTLFRN KDTTEFIIVT
IPTVLGVSES GRLLQSLRDE DVPCKRLIVN QVLKAAVTFC SVKEKDQTRA LQMCEEDAGL
NSLNRTDAPL FDMEIRGVPA LKFFGDQVWR