Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_02581 |
Symbol | |
ID | 4778431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 270705 |
End bp | 271874 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640085762 |
Product | putative NADH dehydrogenase, transport associated |
Protein accession | YP_001016278 |
Protein GI | 124021971 |
COG category | [C] Energy production and conversion |
COG ID | [COG1252] NADH dehydrogenase, FAD-containing subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCCCA ACCCGTCAAA CGCCAATGCC GTGGTGGTTG TGGGAGGCGG ATTTGCAGGG CTAACGACCG CGCTCGCTCT CAGCAACTGC CAACCTCGCC CCCCCATCGT GCTGATTGAA CCGCGCCAGC GGTTCGTCTT TCTCCCCTTG CTTTACGAAC TCTTGAGTGG AGAGCTTCAG GCTTGGGAAG TCGCACCTCC TTATCACTCT CTACTCAGTC AACGCGGTAT TGCCCTTCTC GAAGACCGGG TTGAAAGCAT CGACACCAAA GCAAAAACAG TCACCACCAG CTCAGGGCTC AAGCTCAACT ACGCACAACT GGTAATCAGC ACTGGCTCAG CTCCAACAGA TTTCGACATT CCAGGGGTCC GTAAACACGC CCTGATGTTT CACCGCCTAA ACGACGTTGA AGTGCTGCGA CAACGAATTA AAGAACTTCA ATTACGTAGA AACCCTCGTC AGGACCTTGT GATTGTTGGA GCGGGACCAA CTGGCGTAGA ACTGGCCTGC AAGCTGGCAG ATCTCCTTGA CGGGGCTGCT GAACTCCACC TCATTGAGCT TGGCGAACGG GTTCTGCCCA GTGCGAAAGC CTTCAACCAA GAACAAGCCG AGCGAGCACT CAGCAAACGT GGTGTTCACG TTCATTTGCT CACGCAAGTT CGATCAATCT CGACAGACCA GGTTGAACTC CTAAGCAAAC ACAAGGAACC TGCCATCAGT TCAGCAATAA CGCACAGTGG TCTTGTTTGG ACAGCCGGCA CAAGACCAGT AATACCAGCA CTCAACCCCG ACTTTGTCCT TACCGAAGCA AGACTGCCTA TCGATTCGTG CCTACAAGTG ATCGGACTCA GTGATGTGCT TGGCCTTGGC GATGCCACTT ACAACAAAGA CCACTCCTGG CCATCCACAG CCCAGGTCGC ACTTCAACAA GGGGAGATCG CGGCTCGAAA CGTGATGGCA CTACGAGCAA GCAGTCCTCT CCAGCCTTTT GAATTTAAGG ATTTTGGCGA AATGCTCAGC CTGGGAGTAG GGGAAGCCTC CCTCACCGGC ATGGGTTTCA CACTCGCAGG TCCTCTGGCT TTCCAAATAC GCCGGGGTGC CTATCTAACA AAACTACCTG GATTGTCTCT AGGCCTGCGC TCAGGCGGAG CCTGGCTGCT CGGACATTGA
|
Protein sequence | MIPNPSNANA VVVVGGGFAG LTTALALSNC QPRPPIVLIE PRQRFVFLPL LYELLSGELQ AWEVAPPYHS LLSQRGIALL EDRVESIDTK AKTVTTSSGL KLNYAQLVIS TGSAPTDFDI PGVRKHALMF HRLNDVEVLR QRIKELQLRR NPRQDLVIVG AGPTGVELAC KLADLLDGAA ELHLIELGER VLPSAKAFNQ EQAERALSKR GVHVHLLTQV RSISTDQVEL LSKHKEPAIS SAITHSGLVW TAGTRPVIPA LNPDFVLTEA RLPIDSCLQV IGLSDVLGLG DATYNKDHSW PSTAQVALQQ GEIAARNVMA LRASSPLQPF EFKDFGEMLS LGVGEASLTG MGFTLAGPLA FQIRRGAYLT KLPGLSLGLR SGGAWLLGH
|
| |