Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMN2A_0729 |
Symbol | |
ID | 3606107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL2A |
Kingdom | Bacteria |
Replicon accession | NC_007335 |
Strand | + |
Start bp | 1221589 |
End bp | 1222515 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637687592 |
Product | short-chain dehydrogenase/reductase |
Protein accession | YP_291923 |
Protein GI | 72382568 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.486158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTAGAT TAAATGAAAT CAAGATGCAA GATGGAAAAA TATTCTTAAT CACTGGAGCC AATAGCGGTC TTGGTTATGA AACATCAAAA TTCCTTCTAG AAAGAGGGGC AACAGTAATC ATGTGTTGCA GAGACTTGCT CAAAGGGGAG AAAGCCAAAA AAGAACTTTT AAAATTTAAG TTTTCTGGAA AAATTGAACT AGTTGAATTA GATTTATCCG ATTTAATTAA TGTTAAAAAA TTTGCTGAAT CAATAAAAAA TACATTTGAT CACTTAGATG TCTTAATCAA TAATGCTGGA ATAATGGCTC CCCCAAAGAC TCTTAGCAAG CAAGGTTTCG AAATACAGTT TGCTGTTAAT CATCTTGCAC ACATGTTTTT AACCTTAGAG TTACTACCCA TGCTTGAAGA AAAAAATAAT TCTAGGGTTG TCACAGTAAC CTCAGGCGTT CAATATTTTG GAAAAATTCA ATGGGAGGAT CTACAAGGAA ATCTTAAATA TGATAGGTGG GCTTCATATG CGCAGAGCAA GCTTGCAAAC GTAATGTTTG GATTGGAACT CGATTCAAAA CTTAAGGAAA CCAATTCAAA AACTTCTTCA CTACTAGCTC ATCCAGGATT TGCACGTACA AATTTACAGC CAAAGTCTGT TGAGGCTAAT CAATCATGGC AAGAAGAACT TGCTTATAAA TTGATGGATC CCATGTTTCA AAGCGCGAAA ATGGGAGCAT TACCTCAAAT AACTGCCGCC ACATTAACTA GCGCTTCGGG AGGAGAACAA TATGGACCTA GGTTCAACTT CAGAGGGTTC CCGAAAATCT GTAGAAATGC TCCAAAAGCA TTAAATCAAA CTTCAAGAAA AAAATTGTGG GACATAAGCG AAAAGCTCAT AAAAGATTTT GATACCCTCT CAAAACAAAG TAAGTAA
|
Protein sequence | MVRLNEIKMQ DGKIFLITGA NSGLGYETSK FLLERGATVI MCCRDLLKGE KAKKELLKFK FSGKIELVEL DLSDLINVKK FAESIKNTFD HLDVLINNAG IMAPPKTLSK QGFEIQFAVN HLAHMFLTLE LLPMLEEKNN SRVVTVTSGV QYFGKIQWED LQGNLKYDRW ASYAQSKLAN VMFGLELDSK LKETNSKTSS LLAHPGFART NLQPKSVEAN QSWQEELAYK LMDPMFQSAK MGALPQITAA TLTSASGGEQ YGPRFNFRGF PKICRNAPKA LNQTSRKKLW DISEKLIKDF DTLSKQSK
|
| |