Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_01731 |
Symbol | |
ID | 4776900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 193567 |
End bp | 194607 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640085672 |
Product | putative carbohydrate kinase |
Protein accession | YP_001016193 |
Protein GI | 124021886 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0524] Sugar kinases, ribokinase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGGA CCAATCCGGT TCACAGCAAC GAGCCTGAGG TGATTTGCTT GGGCGAGGCG TTGGTGGATC GTCTTGGACC TCTGGGGGGA GATCCAGTTG TGGATCAGCC TGTGGAAGAT TGCCTAGGGG GAGCCCCTGC CAACGTGGCT TGTGGCTTAG CTCGGCTTGG TAGCAAGGTG GCTTTTTTAG GACGTCTTGG AGACGATTCC ATCGGGGCCA GGTTTCGAGA ATTGTTTGAT ACTCGAGGCG TGAACCTTGC TGGTCTGCAG ACCGATCTGC GTCGGCCAAG TCGCATTGTG TTGGTGCGTC GTGATCTCGA TGGGGAGAGG GTTTTCCAGG GTTTTGCAGG CGACCGCGGA GACGGTTTTG CTGATCAGGC TCTGTCTTTA GATGAGTTGG CGGCCAGCTG GCCTTTGCTG GTAGGTAAGG CGAGCTGGTT GTTGATCGGC TCCATTCCTT TGGCCACACC AGCTTCCGCC CAAGCCTTGC TTTGGTGCGT TGAGCAGGCA CAAACAGCAG GGATTGAGAT AGCCCTGGAT GTCAACTGGC GTTCAACCTT CTGGGATCCT GGCCGTTCTC CCGATAGTGG CCCTGATGAG AAGGCCTTGC AGGCGATTGC GCCGCTTCTT GAGCGTGCTT CTCTGCTCAA ACTGGCTAGG GAAGAGGCCG TTTGGTTCTT TGACACTGAC GACCCTGCAG TGATTGCGCG ATCCCTTCCG CAGCAGCCGG ATGTGGTTGT GACCGATGGG GCGCGTCCGG TGCGGTGGTG GATAGGGGGT TGTGTTGGTG AGCTTGCAGC GCTTTCTCCC CCTTCAGTTG TCGACACCAC GGGTGCTGGC GATGCTTTCA CTGCAGGATT GCTGCATCAA TTGTTGATGG ATGTCTCATC TCAACGAGAT TCGATCAAGG TCCGGGAGAT GGTTCGTTTT GCTGCAGCCT GTGGTGCGCT TGTTTGTGGA GGGGCTGGTG GTATCGATCC GCAGCCGTCT CAGATGCAAG TGGAGGAGTT TTTGGGGTCG GTCGCAGGTG ACGTGAACTG A
|
Protein sequence | MTGTNPVHSN EPEVICLGEA LVDRLGPLGG DPVVDQPVED CLGGAPANVA CGLARLGSKV AFLGRLGDDS IGARFRELFD TRGVNLAGLQ TDLRRPSRIV LVRRDLDGER VFQGFAGDRG DGFADQALSL DELAASWPLL VGKASWLLIG SIPLATPASA QALLWCVEQA QTAGIEIALD VNWRSTFWDP GRSPDSGPDE KALQAIAPLL ERASLLKLAR EEAVWFFDTD DPAVIARSLP QQPDVVVTDG ARPVRWWIGG CVGELAALSP PSVVDTTGAG DAFTAGLLHQ LLMDVSSQRD SIKVREMVRF AAACGALVCG GAGGIDPQPS QMQVEEFLGS VAGDVN
|
| |