Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_17471 |
Symbol | |
ID | 4778116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1527832 |
End bp | 1528857 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640087254 |
Product | putative mRNA binding protein |
Protein accession | YP_001017754 |
Protein GI | 124023447 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGGCA AAGAACGCTG CCTTAGACCT CCATCCTGCA GGGCGAAGGG CCTTCAGTTG CTGATCGAAC GCGCTTCAAT ATGCGCGTTT GATGATTCAG CTGTTTTGAA GATTCTGATC ATGGGGGGGA CCCGATTTGT CGGCAAGCCT CTGGTTACTC GACTTCAGGC CCAAGGCCAT GCGCTCACGT TGTTCACTCG TGGCCGCCAT TCTTTGCCAG ATGGTGTGGA ACATCTCAGT GGTGATCGAA CCACCACTGA GGGGCTGAGT CGTCTTCAAG GCCGAAGCTT CGATGTCATC GTCGACAGCT CAGGGCGCAA GCTTGAAGAC AGTCAAAGGG TGGTGGCCTG TACAGGAGAG CCAAAGCATC GTTTCCTCTA TGTCAGTTCC GCTGGCGTCT ATGCGGATTC CGAACACTGG CCACTGAATG AGGAGAGTGC CACCGACCCG AACAGTCGTC ATGCCGGCAA GGCTCAGACC GAATCATGGC TGCTTCAGCA AGGAATTCCC TTTACCAGTT TCCGACCTAC TTATATCTAT GGTCCTGGTA ATTACAACCC GATTGAACGT TGGTTTTTCG ATCGTATCGT CCATAACCGA CCGGTTCCGT TGCCACGAGA TGGCACCACC ATCACCCAAT TGGGGCATGT TGTTGATCTG GCTGATGCCA TGGTTCGTTC CCTTGAGGTG GAGACAGCGA CGAATCGCAT TTACAACTGT TCCAGCAAGC GTGGTATCAC CTTCAGGGGC TTGATTGCAG CGGCAGCAAG GGCTTGTGGC AAAGATCCAA ATACCGTTGA GCTTCGTTCT TTTGATCCTT CAGGCCTGAA TCCCAAAGCT CGTAAGGCCT TCCCGCTGAG GCTGAGTCAT TTCCTTACCG ATATCACCAG GGTGGAGCGG GAATTGGCCT GGCAACCACG CTTTGACCTT GAGACTGGCC TCGAAGATAG CTACTGCAAC GACTACTCCT TGAAGCCAAC GGCTGAACCA GATTTCAGTG CCGATCAATC CTTGATCGGG GTTTGA
|
Protein sequence | MIGKERCLRP PSCRAKGLQL LIERASICAF DDSAVLKILI MGGTRFVGKP LVTRLQAQGH ALTLFTRGRH SLPDGVEHLS GDRTTTEGLS RLQGRSFDVI VDSSGRKLED SQRVVACTGE PKHRFLYVSS AGVYADSEHW PLNEESATDP NSRHAGKAQT ESWLLQQGIP FTSFRPTYIY GPGNYNPIER WFFDRIVHNR PVPLPRDGTT ITQLGHVVDL ADAMVRSLEV ETATNRIYNC SSKRGITFRG LIAAAARACG KDPNTVELRS FDPSGLNPKA RKAFPLRLSH FLTDITRVER ELAWQPRFDL ETGLEDSYCN DYSLKPTAEP DFSADQSLIG V
|
| |