Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_04101 |
Symbol | solA |
ID | 4719598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | + |
Start bp | 371236 |
End bp | 372420 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640080083 |
Product | putative sarcosine oxidase |
Protein accession | YP_001010726 |
Protein GI | 123965645 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0903484 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATAA AGATCCCTGA ACATGTTGAT ATAGCCATAA TAGGTGGAGG TATGGCAGGC TTAAGTGCAG CAGCATCATT AAGTGAAATG GGAGTAAAAA ATATAGCTGT TTTTGAATCT GAAAAACTTG CACATTCCGA AGGTAGCAGT TTTGGGGAGT CCAGAATGTA CCGCGAAATG TATTCAGATC CTGTGTTATG CAAACTTGCT AAAGAATCAA ATAAATTATG GGCCGAACAG GAATCAATAG CAAAATATTC TCTAAGAAAA GAACATGGAT TATTGTTTTA TGGTGAATCC TGGGATGAAG AAACCATAGA AGGTTCTATT CCTGGAGCTC AGAAAGTGAT GGATGAGCAA AATATACCCT ATGAATTTTT AACATCTGAA AAAATTTCTG AAAGATTTCC TATAAAACCT AAAGAACATT TTGTTGGACT TTTTGAACCA AGTGCAGGAT CAATTTTAAG TGATAGAGCT ATTGAAAATT GGATTAAAAT CATAAGAAGT AATGGTAATC AGATTTATGA AGGTTGCAAA ATTAAAAGAA TAGAAGAAAA AAATAATTCT CTTGAAATAA TAGGAAATCA ATCTGTAAGT TTTGATCAAT TAATAGTTGC ATCTGGAATG TGGACTAATG AATTATTGGA ACCAATAGGT TTAAAACAAG ATGTAAAAAT CTGGCCAATG CTTTGGGCAC ATTATCTGGT TGACGAAGAT TTCATAAATA GTTATCCTCA ATGGTTTTGT TTCCAAAAAT CAAGAGGAGA TGATGGTGGA TTGTATTATG GTTTCCCCGT CTTAAGTCGA AATAAAAATA ATATACCCAG AATAAAAGTT GGAATTGATT GGACCTCTCC AAAACTAATT GGTGGTGATG AGGGCGCAAT GAAAAAACCT TTTATGGAAC CTCTAATAAA AATGCTGGAT GATTTCATTA TCAACAATTT CGAGGGTGTT ATCGTTTGTG ATGAAACTTT TGTAAGTCCT TATACAATGA CAAATGATGT TAATTTTATT CTCGATAAAC CCAAGTCAAA TATTACTGTT TTTTCTGGCG GATCTGGCCA GTCATTTAAA TTTGCACCAA TAATTGGTAA ATGTCTTGCG GAAAAAGCAT TAAATAAAAA TTGTAGTTTT GATATAAGTT GCTGGGAATT TGACAGATTT CTTACAACAA AATAA
|
Protein sequence | MNIKIPEHVD IAIIGGGMAG LSAAASLSEM GVKNIAVFES EKLAHSEGSS FGESRMYREM YSDPVLCKLA KESNKLWAEQ ESIAKYSLRK EHGLLFYGES WDEETIEGSI PGAQKVMDEQ NIPYEFLTSE KISERFPIKP KEHFVGLFEP SAGSILSDRA IENWIKIIRS NGNQIYEGCK IKRIEEKNNS LEIIGNQSVS FDQLIVASGM WTNELLEPIG LKQDVKIWPM LWAHYLVDED FINSYPQWFC FQKSRGDDGG LYYGFPVLSR NKNNIPRIKV GIDWTSPKLI GGDEGAMKKP FMEPLIKMLD DFIINNFEGV IVCDETFVSP YTMTNDVNFI LDKPKSNITV FSGGSGQSFK FAPIIGKCLA EKALNKNCSF DISCWEFDRF LTTK
|
| |