Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_26911 |
Symbol | ndhA |
ID | 4778149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2374037 |
End bp | 2375191 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640088214 |
Product | NADH dehydrogenase subunit H |
Protein accession | YP_001018686 |
Protein GI | 124024379 |
COG category | [C] Energy production and conversion |
COG ID | [COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGACCA CCTTTGCCAC TGCAACCTCA GGCTTGGTGA GTTCTGGCCT TGATCTGGAG TTGAGCTTTA GTCAGTCATT ACAAGGGCTG GGACTTTCCC CTCAGATGGC CCATCTCATT TGGTTGCCGC TGCCGATGCT GTTGGTGTTG ACCGCGGCGA TGGTGGGCGT TTTGGTCACC GTGTGGCTGG AGCGGAAGAT TTCGGCGGCA GTTCAGCAGC GGATCGGACC GGAATACGCA GGAGCTCTCG GCGTGTTGCA GCCGATGGCC GATGGCTTGA AGTTGCTGGT GAAGGAAGAC GTGATCCCTG TCAGGGCGGA TGGCCTGCTG TTCACTCTTG GTCCGGTCCT GGTGCTTGTG CCTGTGATCC TCTCCTGGCT GATCGTGCCT TTTGGTCAAA ATCTGCTGAT CAGCAATGTG GGGATAGGGA TTTTCCTCTG GATCTCCCTC AGCAGCATTC AGCCCATTGG CCTATTGATG AGTGGCTATG CCTCCAATAA CAAGTACTCA CTCTTGGGTG GGCTGAGGGC TGCAGCTCAA TCGATCAGTT ATGAAATCCC TCTTGCCTTG GCTGTTCTGG CAGTGGTGAT GATGAGCAAT TCGCTCAGCA CGGTCGACAT TGTTGCCCAA CAAAATGGTG CTGGCTTGCT GAGCTGGAAT GTGTGGCGGC AGCCTGTGGG CTTTTTGATC TTTTGGATCT GTGCGCTGGC CGAGTGCGAA CGACTTCCTT TCGATCTCCC TGAAGCGGAG GAAGAGCTGG TGGCGGGTTA TCAGACCGAG TACGCAGGGA TGAAGTTTGC CCTGTTTTAT CTCGGTAGTT ACATCAACCT TGTGCTTTCG TCCTTGTTGG TCGCAGTGCT CTACCTAGGG GGATGGGGCT TCCCCATCCC GGTGGAATGG TTGGCTGGTT GGCTTGGTCA GTCGGTTGAT GCTCCCTTGG TGCAGGTGAT CACTGGCTCC GTTGGCATTG TGATGACAGT GCTGAAGGCT TATCTGCTGG TGTTCTTAGC CATCTTGTTG CGCTGGACTA CTCCACGCGT ACGCATTGAT CAATTGCTTG ATTTGGGCTG GAAGTTTCTT TTGCCGATCG CGCTTGGAAA TCTGCTGATC ACCGCTGCTC TTAAGTTGGC TTTCCCTGTT GCCTTTGGCG GTTGA
|
Protein sequence | MVTTFATATS GLVSSGLDLE LSFSQSLQGL GLSPQMAHLI WLPLPMLLVL TAAMVGVLVT VWLERKISAA VQQRIGPEYA GALGVLQPMA DGLKLLVKED VIPVRADGLL FTLGPVLVLV PVILSWLIVP FGQNLLISNV GIGIFLWISL SSIQPIGLLM SGYASNNKYS LLGGLRAAAQ SISYEIPLAL AVLAVVMMSN SLSTVDIVAQ QNGAGLLSWN VWRQPVGFLI FWICALAECE RLPFDLPEAE EELVAGYQTE YAGMKFALFY LGSYINLVLS SLLVAVLYLG GWGFPIPVEW LAGWLGQSVD APLVQVITGS VGIVMTVLKA YLLVFLAILL RWTTPRVRID QLLDLGWKFL LPIALGNLLI TAALKLAFPV AFGG
|
| |