Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_15671 |
Symbol | |
ID | 4780356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1273099 |
End bp | 1274001 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640084849 |
Product | short-chain dehydrogenase/reductase |
Protein accession | YP_001015389 |
Protein GI | 124026273 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0732777 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.472072 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTAGAT TAAATGAAAT CAAGATGCAA GACGGAAAAG TATTCTTAAT TACCGGAGCC AATAGTGGAC TTGGCTATGA AACATCAAAA TTCCTTTTAG AAAGGGGAGC AACAGTAATC ATGTCTTGCA GAGACTTGAT CAAAGGAGAG AAAGCCAAAC AAGAACTTTT AAAATTTAAT TTTTCTGGAA AGATCGAATT AGTTGAATTA GATTTATCCG ATTTAATAAA CGTTAAAAAA TTTGCTGAAT CTATAAAAAA TAAATTTGAT TACTTAGATG TTTTAATCAA TAATGCTGGG ATAATGGCTC CACCAAAAAC TTTTAGCAAG CAAGGTTTTG AAATACAGTT TGCGGTTAAT CATCTTGCAC ATATGTTTTT AACGTTAGAA CTATTACCCA TGCTTGAAGA AAAAAATAAT TCTAGAGTTG TCACAGTAAC CTCAGGTGTC CAATATTTTG GAAAGATTCA GTGGGCAGAT TTACAAGGAA ATCTTAAATA CGATCGTTGG GCTTCATATG CGCAGAGCAA GCTTGCAAAC GTAATGTTTG GCTTAGAACT TGATTCAAAA CTTAAAGAAA GCAATTCAAA AACTTCTTCA CTACTAGCTC ATCCAGGATT TGCACGTACA AATTTACAGC CAAAGTCTGT TGAGGCTAAT CAGTCATGGC AAGAAGAACT TGCTTATAAA TTGATGGATC CCATGTTTCA AAGCGCGAAA ATGGGTGCAT TGCCTCAAAT AACTGCCGCC ACATTAACTA GTGCTTCGGG AGGAGAACAA TATGGACCTA GATTTAGCTT CAGAGGGTAT CCAAAAATAT GTAGCAATGC TCCAAAAGCA TTAAATCAAA CTTCAAGAAA AAAATTGTGG GAAATAAGCG AAAAACTTAT AAAAGGTGTT TGA
|
Protein sequence | MVRLNEIKMQ DGKVFLITGA NSGLGYETSK FLLERGATVI MSCRDLIKGE KAKQELLKFN FSGKIELVEL DLSDLINVKK FAESIKNKFD YLDVLINNAG IMAPPKTFSK QGFEIQFAVN HLAHMFLTLE LLPMLEEKNN SRVVTVTSGV QYFGKIQWAD LQGNLKYDRW ASYAQSKLAN VMFGLELDSK LKESNSKTSS LLAHPGFART NLQPKSVEAN QSWQEELAYK LMDPMFQSAK MGALPQITAA TLTSASGGEQ YGPRFSFRGY PKICSNAPKA LNQTSRKKLW EISEKLIKGV
|
| |