Gene NATL1_04241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_04241 
Symbol 
ID4779630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp394304 
End bp395683 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content36% 
IMG OID640083699 
Productputative aldehyde dehydrogenase 
Protein accessionYP_001014253 
Protein GI124025137 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.844488 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTAG AAAATTTCGT ACTTAATCAA TTACAAGATC TAGTTCTATC TGGCAAAACA 
AGAAATGAAA AATGGAGAAG GGCACAGCTT AAATCCTTAT CAAATTTATT AGAAAATCAT
CAGCAAGAAA TATTAAAAGC CTTAAGTCAA GATTTAGGGA AGCCAGCTAC AGAAGCGTTC
TTCGAGATTA TTGCAGTAAA GCAAGAAATA AAACTGGCGC AGAAAAGTTT ATCTAATTGG
ATGAAGACGA GACAAATCAA TGTGCCTGTC TCTCTTAAAC CAGCTCAAGC ATTGGTCCAG
CCGGATCCGT TGGGCTGCAT TTTGATAATT GGGCCATGGA ATTATCCTTT TTCGCTTACC
CTTCAACCAC TAGTAGGAGC ATTAGCCGCT GGAAACACTG CTGTTTTAAA GCCATCAGAG
CATGCTCCTA ACGTTTCAAA TCTGATAAAA AAACTTATAG AAGAATATTT TCCACCAGAG
ATCGTGCAAG TTTTTGAAGG AGATGGAAAT ATTGCTGCTG ATTTAATGAC TCGACAATTT
GATCACGTCT TTTTTACAGG TGGAGAAAAT ATAGGAAAAA AAGTAATGGA AGCCGCCTCA
AAAAACCTCA CTCCAGTAAC TTTAGAACTT GGTGGCAAAA GCCCAGCTGT TGTTATCGAT
GGTGCAAATC TAGAAGTAAC TGCAAAGAGA GTTATATGGG GAAAAAGTTT AAACGCTGGT
CAAACATGTA TTGCTCCAGA TCATTTACTG GTTGAGAATA AACTTTTTGA TTCATTAATT
TCTAATTTAA TAAATTCGAT CAATGATTTC TACGGAAATA CGCCTTTAGA TTCAAAGCAT
CTGGGGAGCA TTATTAATGA AAAGCAATTT AATAGACTTA ATAATTTACT AACACAAGCT
AAAAAGAATA ATCAGATAAT CTATGGAGGA GATAGCAATG AAAAAGAGAA AAGAATTAGC
CCTACATTGA TCAAAATTGA CAATAGAAAT GATCCTCTTA TGAAGGAAGA ACTTTTCGGC
CCATTGCTGC CTATTTTGAG TATTAAAAAT CTCGACCAAG CTATTTCAGA TTTCAAGTTA
TTACCTAAAC CCCTAGCTTT ATATCTTTTT GGAGGAGGTG AGAAAGAACA AGGCAAAGTA
CTCTCAATGA CCTCTTCAGG AGGTGTTTGT TTTAATGATG TTGTTCTACA AGCAGGGATA
CCTGAACTGC CTTTTGGAGG TGTCGGAACA AGTGGCATGG GTAAATACCA CGGTAAAGCA
GGTTTTGATA ACTTTACTCA TTACAAATCA GTCCTAAAAA GACCTTTTTG GTTAGATCTA
AACTTCAGAT ACCCTCCGTA TAAGTTAGAT TTGTCTTTAC TTAATAAATT AATAGGTTAA
 
Protein sequence
MSLENFVLNQ LQDLVLSGKT RNEKWRRAQL KSLSNLLENH QQEILKALSQ DLGKPATEAF 
FEIIAVKQEI KLAQKSLSNW MKTRQINVPV SLKPAQALVQ PDPLGCILII GPWNYPFSLT
LQPLVGALAA GNTAVLKPSE HAPNVSNLIK KLIEEYFPPE IVQVFEGDGN IAADLMTRQF
DHVFFTGGEN IGKKVMEAAS KNLTPVTLEL GGKSPAVVID GANLEVTAKR VIWGKSLNAG
QTCIAPDHLL VENKLFDSLI SNLINSINDF YGNTPLDSKH LGSIINEKQF NRLNNLLTQA
KKNNQIIYGG DSNEKEKRIS PTLIKIDNRN DPLMKEELFG PLLPILSIKN LDQAISDFKL
LPKPLALYLF GGGEKEQGKV LSMTSSGGVC FNDVVLQAGI PELPFGGVGT SGMGKYHGKA
GFDNFTHYKS VLKRPFWLDL NFRYPPYKLD LSLLNKLIG