Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_02331 |
Symbol | ndhA |
ID | 4779513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 215682 |
End bp | 216800 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640083498 |
Product | NADH dehydrogenase subunit H |
Protein accession | YP_001014062 |
Protein GI | 124024946 |
COG category | [C] Energy production and conversion |
COG ID | [COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.896857 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATTCAG GTATAGACCT TGAAATGAGT TTTACCCAAG GTGTTCAAAA CCTTGGTTTA TCTCATGAAT TAGCACACCT TTTATGGATC CCTCTTCCAA TGCTTTTAGT ATTGGTATCA GCTGTCATTG GTGTATTAGT AACAGTTTGG CTTGAACGAA AAATTTCTGC AGCAGCTCAA CAAAGAATTG GGCCTGAATA TGCAGGTGCA CTTGGTATTC TTCAGCCAAT GGCAGACGGT TTGAAACTCT TAGTAAAAGA AGACATTATT CCAGCAAGGG CTGACAGTGT TCTTTTTACA GTTGGTCCAA TATTAGTTTT AGTGCCAGTT ATTCTATCTT GGCTAATTGT TCCCTTTGGT CAAAATCTGT TAATTAGTAA TGTAGGTATA GGAATATTTT TGTGGATTGC CCTAAGTAGC ATTCAACCAA TTGGTTTACT GATGAGTGGC TACTCTTCCA ATAATAAATA TTCTTTACTA GGTGGACTGA GAGCTGCCGC TCAATCAATT AGTTATGAAA TTCCATTAGC GCTAGCAGTT TTGGCCATAG TTATGATGAG CAATTCATTG AGCACTGTTG ACATAGTTGA GCAACAAAAT ACTGCCGGCT TACTTAGCTG GAATATATGG CGACAACCTG TAGGTTTTAT TATTTTTTGG ATCTGTGCAT TAGCTGAATG CGAGAGACTA CCTTTTGATT TACCAGAGGC AGAAGAAGAG CTTGTAGCTG GTTATCAAAC TGAGTATTCA GGTATGAAAT TTGCTCTGTT TTACTTAGCT GGATACATCA ATCTTGTTTT ATCTGCTTTA CTTGTTTCTG TTCTGTACTT GGGAGGATGG GGGTTTCCAA TCTCAATTGA TTGGTTTTCA TCATTGATAG GTCTTTCAAT TGATAATCCA TTAGTTCAAA TTATTGCTGC GTCTCTTGGA ATTGTAATGA CAATCCTTAA GGCTTATTTA CTGGTTTTTC TAGCAATCTT ATTGAGATGG ACTACTCCAA GAGTTCGGAT TGATCAACTA CTTGATTTAG GCTGGAAGTT TCTTTTACCT ATTTCTTTGG TCAATTTACT TGTAACTGCA TCACTTAAAT TGGCATTTCC GATTACATTT GGCGGATAA
|
Protein sequence | MNSGIDLEMS FTQGVQNLGL SHELAHLLWI PLPMLLVLVS AVIGVLVTVW LERKISAAAQ QRIGPEYAGA LGILQPMADG LKLLVKEDII PARADSVLFT VGPILVLVPV ILSWLIVPFG QNLLISNVGI GIFLWIALSS IQPIGLLMSG YSSNNKYSLL GGLRAAAQSI SYEIPLALAV LAIVMMSNSL STVDIVEQQN TAGLLSWNIW RQPVGFIIFW ICALAECERL PFDLPEAEEE LVAGYQTEYS GMKFALFYLA GYINLVLSAL LVSVLYLGGW GFPISIDWFS SLIGLSIDNP LVQIIAASLG IVMTILKAYL LVFLAILLRW TTPRVRIDQL LDLGWKFLLP ISLVNLLVTA SLKLAFPITF GG
|
| |