Gene P9515_04101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_04101 
SymbolsolA 
ID4719598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp371236 
End bp372420 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content33% 
IMG OID640080083 
Productputative sarcosine oxidase 
Protein accessionYP_001010726 
Protein GI123965645 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0903484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATAA AGATCCCTGA ACATGTTGAT ATAGCCATAA TAGGTGGAGG TATGGCAGGC 
TTAAGTGCAG CAGCATCATT AAGTGAAATG GGAGTAAAAA ATATAGCTGT TTTTGAATCT
GAAAAACTTG CACATTCCGA AGGTAGCAGT TTTGGGGAGT CCAGAATGTA CCGCGAAATG
TATTCAGATC CTGTGTTATG CAAACTTGCT AAAGAATCAA ATAAATTATG GGCCGAACAG
GAATCAATAG CAAAATATTC TCTAAGAAAA GAACATGGAT TATTGTTTTA TGGTGAATCC
TGGGATGAAG AAACCATAGA AGGTTCTATT CCTGGAGCTC AGAAAGTGAT GGATGAGCAA
AATATACCCT ATGAATTTTT AACATCTGAA AAAATTTCTG AAAGATTTCC TATAAAACCT
AAAGAACATT TTGTTGGACT TTTTGAACCA AGTGCAGGAT CAATTTTAAG TGATAGAGCT
ATTGAAAATT GGATTAAAAT CATAAGAAGT AATGGTAATC AGATTTATGA AGGTTGCAAA
ATTAAAAGAA TAGAAGAAAA AAATAATTCT CTTGAAATAA TAGGAAATCA ATCTGTAAGT
TTTGATCAAT TAATAGTTGC ATCTGGAATG TGGACTAATG AATTATTGGA ACCAATAGGT
TTAAAACAAG ATGTAAAAAT CTGGCCAATG CTTTGGGCAC ATTATCTGGT TGACGAAGAT
TTCATAAATA GTTATCCTCA ATGGTTTTGT TTCCAAAAAT CAAGAGGAGA TGATGGTGGA
TTGTATTATG GTTTCCCCGT CTTAAGTCGA AATAAAAATA ATATACCCAG AATAAAAGTT
GGAATTGATT GGACCTCTCC AAAACTAATT GGTGGTGATG AGGGCGCAAT GAAAAAACCT
TTTATGGAAC CTCTAATAAA AATGCTGGAT GATTTCATTA TCAACAATTT CGAGGGTGTT
ATCGTTTGTG ATGAAACTTT TGTAAGTCCT TATACAATGA CAAATGATGT TAATTTTATT
CTCGATAAAC CCAAGTCAAA TATTACTGTT TTTTCTGGCG GATCTGGCCA GTCATTTAAA
TTTGCACCAA TAATTGGTAA ATGTCTTGCG GAAAAAGCAT TAAATAAAAA TTGTAGTTTT
GATATAAGTT GCTGGGAATT TGACAGATTT CTTACAACAA AATAA
 
Protein sequence
MNIKIPEHVD IAIIGGGMAG LSAAASLSEM GVKNIAVFES EKLAHSEGSS FGESRMYREM 
YSDPVLCKLA KESNKLWAEQ ESIAKYSLRK EHGLLFYGES WDEETIEGSI PGAQKVMDEQ
NIPYEFLTSE KISERFPIKP KEHFVGLFEP SAGSILSDRA IENWIKIIRS NGNQIYEGCK
IKRIEEKNNS LEIIGNQSVS FDQLIVASGM WTNELLEPIG LKQDVKIWPM LWAHYLVDED
FINSYPQWFC FQKSRGDDGG LYYGFPVLSR NKNNIPRIKV GIDWTSPKLI GGDEGAMKKP
FMEPLIKMLD DFIINNFEGV IVCDETFVSP YTMTNDVNFI LDKPKSNITV FSGGSGQSFK
FAPIIGKCLA EKALNKNCSF DISCWEFDRF LTTK