Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_04241 |
Symbol | |
ID | 4779630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 394304 |
End bp | 395683 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640083699 |
Product | putative aldehyde dehydrogenase |
Protein accession | YP_001014253 |
Protein GI | 124025137 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.844488 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATTAG AAAATTTCGT ACTTAATCAA TTACAAGATC TAGTTCTATC TGGCAAAACA AGAAATGAAA AATGGAGAAG GGCACAGCTT AAATCCTTAT CAAATTTATT AGAAAATCAT CAGCAAGAAA TATTAAAAGC CTTAAGTCAA GATTTAGGGA AGCCAGCTAC AGAAGCGTTC TTCGAGATTA TTGCAGTAAA GCAAGAAATA AAACTGGCGC AGAAAAGTTT ATCTAATTGG ATGAAGACGA GACAAATCAA TGTGCCTGTC TCTCTTAAAC CAGCTCAAGC ATTGGTCCAG CCGGATCCGT TGGGCTGCAT TTTGATAATT GGGCCATGGA ATTATCCTTT TTCGCTTACC CTTCAACCAC TAGTAGGAGC ATTAGCCGCT GGAAACACTG CTGTTTTAAA GCCATCAGAG CATGCTCCTA ACGTTTCAAA TCTGATAAAA AAACTTATAG AAGAATATTT TCCACCAGAG ATCGTGCAAG TTTTTGAAGG AGATGGAAAT ATTGCTGCTG ATTTAATGAC TCGACAATTT GATCACGTCT TTTTTACAGG TGGAGAAAAT ATAGGAAAAA AAGTAATGGA AGCCGCCTCA AAAAACCTCA CTCCAGTAAC TTTAGAACTT GGTGGCAAAA GCCCAGCTGT TGTTATCGAT GGTGCAAATC TAGAAGTAAC TGCAAAGAGA GTTATATGGG GAAAAAGTTT AAACGCTGGT CAAACATGTA TTGCTCCAGA TCATTTACTG GTTGAGAATA AACTTTTTGA TTCATTAATT TCTAATTTAA TAAATTCGAT CAATGATTTC TACGGAAATA CGCCTTTAGA TTCAAAGCAT CTGGGGAGCA TTATTAATGA AAAGCAATTT AATAGACTTA ATAATTTACT AACACAAGCT AAAAAGAATA ATCAGATAAT CTATGGAGGA GATAGCAATG AAAAAGAGAA AAGAATTAGC CCTACATTGA TCAAAATTGA CAATAGAAAT GATCCTCTTA TGAAGGAAGA ACTTTTCGGC CCATTGCTGC CTATTTTGAG TATTAAAAAT CTCGACCAAG CTATTTCAGA TTTCAAGTTA TTACCTAAAC CCCTAGCTTT ATATCTTTTT GGAGGAGGTG AGAAAGAACA AGGCAAAGTA CTCTCAATGA CCTCTTCAGG AGGTGTTTGT TTTAATGATG TTGTTCTACA AGCAGGGATA CCTGAACTGC CTTTTGGAGG TGTCGGAACA AGTGGCATGG GTAAATACCA CGGTAAAGCA GGTTTTGATA ACTTTACTCA TTACAAATCA GTCCTAAAAA GACCTTTTTG GTTAGATCTA AACTTCAGAT ACCCTCCGTA TAAGTTAGAT TTGTCTTTAC TTAATAAATT AATAGGTTAA
|
Protein sequence | MSLENFVLNQ LQDLVLSGKT RNEKWRRAQL KSLSNLLENH QQEILKALSQ DLGKPATEAF FEIIAVKQEI KLAQKSLSNW MKTRQINVPV SLKPAQALVQ PDPLGCILII GPWNYPFSLT LQPLVGALAA GNTAVLKPSE HAPNVSNLIK KLIEEYFPPE IVQVFEGDGN IAADLMTRQF DHVFFTGGEN IGKKVMEAAS KNLTPVTLEL GGKSPAVVID GANLEVTAKR VIWGKSLNAG QTCIAPDHLL VENKLFDSLI SNLINSINDF YGNTPLDSKH LGSIINEKQF NRLNNLLTQA KKNNQIIYGG DSNEKEKRIS PTLIKIDNRN DPLMKEELFG PLLPILSIKN LDQAISDFKL LPKPLALYLF GGGEKEQGKV LSMTSSGGVC FNDVVLQAGI PELPFGGVGT SGMGKYHGKA GFDNFTHYKS VLKRPFWLDL NFRYPPYKLD LSLLNKLIG
|
| |