Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_15361 |
Symbol | |
ID | 4777712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1337689 |
End bp | 1338906 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640087045 |
Product | NAD binding site |
Protein accession | YP_001017545 |
Protein GI | 124023238 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.298365 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACTGTTT TGGTAGCAGG TGCAGGGCCA GCAGGAGCCC GGCTTGCACA ACGACTTGCT AGCCAGGGGA TTGCGGTAAC TCTTGTTGAA CGCCTAAGAG AAGCCAATCA GAATGCCTTT TCTAGTGCTG CCATCCCTAT CCAGGCGGTG TCTGATCTCT GCATTCCTGC TGAAGCAATT GCCAGCCATT GGAATGGCTG GCAATTGGTG GATCCAGATG GAATTCAACA TCAATGGTGG TCTCAACATG ACCTTGGTGT GGTTCTCAAC TTCGGCTCGC TACGTCATCA GCTCTGGCAG CAAGCTATAG AAGCAGGTGT TGAATTTTTA TTGGGTTGGC GTGTTAACTC CGTTCTTACG GTCAATGACG GCGCAAACGT TGAATTGATT GGTCCAAATG GCTTGTGTCA GAGAAGGAGA GTGAGCTGGG TGGTAGATGC CACAGGCCAT CGCCGCCTTT TATTGGGTTC AACTGCAGCA TCCAAGCCTT CCGATAGCGA TTGCATGCTT GAAGGGGCTG GCGTTGAGTG GATCCTGCAG GGTGATAAAA AAACAACGGC TCTCTGGCGT GACCGGGTTT GTTTTTTTTT GGGATGCCAG TGGATTCAAC ATGGTTATGG CTGGATTTTT CCCATGGCTG GTAATCAGTT GAAGGTAGGA GTTTGCCGTC TACCTCCACC AATGAAGGAG TGCCTCGAGC CGATGAGCAG CATCTTGAAC CGCCTGCTTA TCAAGAACCA GCTTGATGAG CTACCAGTTA TCGATAGACA TGGCGGAATC TTGCGCAGCA GCTTAAGGCG CAGTGAAGCT CACGTTGTTG GCAGGATTGT TGGAGTTGGC GACGCTATTA GTACTGCCAA TTTGCTTGGC GGAGAAGGCA TCCGTCATGC TCTCGTTAGT GCAGAGGTGC TTACTCCTTT GCTTGTTGAA GCCTGCTGTC ATCCATCCAG CTCAAATCAA GACAAAGACA ACAAGGTCTT GATGCAATTT GAACGGCTCC TGCGTCTTCG TTTGGGTTGG CGCTGGAATC TTTCAGGTCG CTTAGCTAAG CGAACATGGT GGGGTTTGTG CGATCAAAAA GGCGATCGAC GCTTGGCACG ATTGATTACT GGGCTGTCAA TGCGAGTCAG CGCTGAAGAT CTGAGCCGTT TGTTGTTCGA ATACCGCTTT GAGCGCTACG GACTCCGTCT TCTTCCTTAT TTAATGGGCT GGCGATGA
|
Protein sequence | MTVLVAGAGP AGARLAQRLA SQGIAVTLVE RLREANQNAF SSAAIPIQAV SDLCIPAEAI ASHWNGWQLV DPDGIQHQWW SQHDLGVVLN FGSLRHQLWQ QAIEAGVEFL LGWRVNSVLT VNDGANVELI GPNGLCQRRR VSWVVDATGH RRLLLGSTAA SKPSDSDCML EGAGVEWILQ GDKKTTALWR DRVCFFLGCQ WIQHGYGWIF PMAGNQLKVG VCRLPPPMKE CLEPMSSILN RLLIKNQLDE LPVIDRHGGI LRSSLRRSEA HVVGRIVGVG DAISTANLLG GEGIRHALVS AEVLTPLLVE ACCHPSSSNQ DKDNKVLMQF ERLLRLRLGW RWNLSGRLAK RTWWGLCDQK GDRRLARLIT GLSMRVSAED LSRLLFEYRF ERYGLRLLPY LMGWR
|
| |