Gene NATL1_15671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15671 
Symbol 
ID4780356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1273099 
End bp1274001 
Gene Length903 bp 
Protein Length300 aa 
Translation table11 
GC content35% 
IMG OID640084849 
Productshort-chain dehydrogenase/reductase 
Protein accessionYP_001015389 
Protein GI124026273 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0732777 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.472072 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAGAT TAAATGAAAT CAAGATGCAA GACGGAAAAG TATTCTTAAT TACCGGAGCC 
AATAGTGGAC TTGGCTATGA AACATCAAAA TTCCTTTTAG AAAGGGGAGC AACAGTAATC
ATGTCTTGCA GAGACTTGAT CAAAGGAGAG AAAGCCAAAC AAGAACTTTT AAAATTTAAT
TTTTCTGGAA AGATCGAATT AGTTGAATTA GATTTATCCG ATTTAATAAA CGTTAAAAAA
TTTGCTGAAT CTATAAAAAA TAAATTTGAT TACTTAGATG TTTTAATCAA TAATGCTGGG
ATAATGGCTC CACCAAAAAC TTTTAGCAAG CAAGGTTTTG AAATACAGTT TGCGGTTAAT
CATCTTGCAC ATATGTTTTT AACGTTAGAA CTATTACCCA TGCTTGAAGA AAAAAATAAT
TCTAGAGTTG TCACAGTAAC CTCAGGTGTC CAATATTTTG GAAAGATTCA GTGGGCAGAT
TTACAAGGAA ATCTTAAATA CGATCGTTGG GCTTCATATG CGCAGAGCAA GCTTGCAAAC
GTAATGTTTG GCTTAGAACT TGATTCAAAA CTTAAAGAAA GCAATTCAAA AACTTCTTCA
CTACTAGCTC ATCCAGGATT TGCACGTACA AATTTACAGC CAAAGTCTGT TGAGGCTAAT
CAGTCATGGC AAGAAGAACT TGCTTATAAA TTGATGGATC CCATGTTTCA AAGCGCGAAA
ATGGGTGCAT TGCCTCAAAT AACTGCCGCC ACATTAACTA GTGCTTCGGG AGGAGAACAA
TATGGACCTA GATTTAGCTT CAGAGGGTAT CCAAAAATAT GTAGCAATGC TCCAAAAGCA
TTAAATCAAA CTTCAAGAAA AAAATTGTGG GAAATAAGCG AAAAACTTAT AAAAGGTGTT
TGA
 
Protein sequence
MVRLNEIKMQ DGKVFLITGA NSGLGYETSK FLLERGATVI MSCRDLIKGE KAKQELLKFN 
FSGKIELVEL DLSDLINVKK FAESIKNKFD YLDVLINNAG IMAPPKTFSK QGFEIQFAVN
HLAHMFLTLE LLPMLEEKNN SRVVTVTSGV QYFGKIQWAD LQGNLKYDRW ASYAQSKLAN
VMFGLELDSK LKESNSKTSS LLAHPGFART NLQPKSVEAN QSWQEELAYK LMDPMFQSAK
MGALPQITAA TLTSASGGEQ YGPRFSFRGY PKICSNAPKA LNQTSRKKLW EISEKLIKGV