Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_18731 |
Symbol | |
ID | 4777651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1634743 |
End bp | 1635900 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640087382 |
Product | NAD binding site |
Protein accession | YP_001017880 |
Protein GI | 124023573 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.413657 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGAGT CGTGGGATGT ACTGGTGGTT GGGGCCGGTC CTGCAGGGGG GCTAGCCGCC CTGGATTGCG CAAGGCGAGG ATTAAGGGTG TTACTGGTGG AAAAACGCGC TTTCCCCCGC TGGAAAGTGT GTGGTTGCTG CTTTAACAAA CAGGCTCAGG CGACTCTCGC ATCTCTGGGA CAGAACGATC TAATCATTGA TCGCGGCGGC GTACGACTGC AAACCCTTCG CCTCGGCCTG AATGGTCGTC AAACATCTCT AGCCATTCCC GATGGTTTTG CTCTCTCCAG GGAACAGTTC GATCAGGCAC TGATGGACGC GGTCGTTGAA GCTGGAGCTT CCGTTCGCTG TCAAATGAGC GCCGGGGTCG AAGAAGTCCA ACCAGGCTGG CGGATTGTGC GGCTGAAGGA TCAGCGCAGC GGTCAGCAGA ACCTCGTCAG GGCGCGTGTC GTGCTTGTCG CAGCTGGGCT TGCCCAGCGT TGCTTACCCG AGCAGGATGC TGGCATCACC AGGATCCGTA GTCGTTCCAG GGTTGGAGCC GGTTGTGTTC TTGCTGATGA TGAGAACCAC TACACCGCAG GCGCCATTCA CATGGCGATC GGTGAACGTG GTTATGTGGG TCTGGTGCGC CGAGAGGACG GTTTACTCAA TGTGGCAGCC GCTTTCGATC GGCAGGCGCT AGCCCATGGG CAAGGAGCGG CCGGAGCTAC CCAGGATGTG CTTATGCAGG CTGGTTTCCC ACCACCTGCG GCTCTGAGAC AGGGGCAATG GCAGCTGACG CCAGCTCTTA GTCGCGGCGC TGAGGTTGTT GCCGGAGAGC GCTTTCTGGT GATGGGTGAC GCAGCGGGTT ATGTGGAGCC ATTTACAGGA GAAGGAATTG CCTGGGCCCT CACTGCAGGT GCCGTGGTGG CACCATTTGT TCAAGAAGGC CTCCTGCGCT GGAGCCATGA TCTGGAGAAG CGCTGGACGC GAGAGCTGAA GCTGCGGATC GGTCGTCGCC AGCGGATCTG TCGCACTCTG GCCATGGTGC TGAGGCAGCC AAAGCCGACG AGGGCATTGT TTGAACTGAG CAGCCGCTGG CCGGCACTGT CTGAAACGAT TGTTGCCAGC CTGAACCATG TGACCCTCCC CTCCGCCGGA AGTCAGCAAT GCCTCTGA
|
Protein sequence | MQESWDVLVV GAGPAGGLAA LDCARRGLRV LLVEKRAFPR WKVCGCCFNK QAQATLASLG QNDLIIDRGG VRLQTLRLGL NGRQTSLAIP DGFALSREQF DQALMDAVVE AGASVRCQMS AGVEEVQPGW RIVRLKDQRS GQQNLVRARV VLVAAGLAQR CLPEQDAGIT RIRSRSRVGA GCVLADDENH YTAGAIHMAI GERGYVGLVR REDGLLNVAA AFDRQALAHG QGAAGATQDV LMQAGFPPPA ALRQGQWQLT PALSRGAEVV AGERFLVMGD AAGYVEPFTG EGIAWALTAG AVVAPFVQEG LLRWSHDLEK RWTRELKLRI GRRQRICRTL AMVLRQPKPT RALFELSSRW PALSETIVAS LNHVTLPSAG SQQCL
|
| |