Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_18741 |
Symbol | sqdB |
ID | 4718612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 1608540 |
End bp | 1609733 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640079608 |
Product | sulfolipid (UDP-sulfoquinovose) biosynthesis protein |
Protein accession | YP_001010264 |
Protein GI | 123969406 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.110238 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAGTTA TTGTTCTAGG TGGAGATGGT TTTTGCGGTT GGCCTTGTGC GGTGAATTTA GCAGAGCAAA ATCATGATGT AATTATTGTC GACAATTTAA GTCGTAGAAA AATTGATATT GATCTAGAGG TAGAATCTTT AACTCCAATT TCTTCCATAA CAGAACGACT TTCTGCATGG GAAGAGACTG GAGGCAAGCC TATGAGATTT CTTAACATGG ATATCTCTAA GCAATATCAA AAATTACTCA ATTTGCTCAT TGATGAAAAA CCAGATTCCG TGATCCATTT TGCAGAACAA AGAGCAGCAC CATACTCGAT GAAATCGAGT TTTACCAAAA GATATACAGT AGATAATAAT GTTAATGGCA CCCACAACTT GCTTGCTGCG ATAGTAGAGA GTAATTTAGA TATTCATGTT GTTCATTTAG GAACAATGGG AGTCTACGGA TATGGATCAC ATAGAGGTGC AACAATTCCA GAAGGTTATC TAAAAGTTGA AGTTCCACAA CCTGATGGAA GCCGCTTTGA AGAAGAAATA TTACACCCTG CAAGCCCAGG TAGTGTCTAC CACATGACTA AAACTTTAGA TCAATTATTA TTTCTTTACT ACAACAAAAA TGATCTTGTA AGGATCACTG ATCTACACCA AGGCATTGTT TGGGGAACAA ATACAGAAGC AACTTTAAAA GATCCTAGAT TGACAAACAG ATTTGACTAT GACGGAGATT ATGGAACTGT TTTAAACAGA TTTCTAATGC AAGCTGCAAT TGGATATCCA TTAAGTGTTC ATGGGACAGG AGGGCAAACA AGAGCATTTA TACATATAAA AGACTCTGTC AAATGCGTAC AACTTGCTCT TGAAAATCCT CCAAAATCTG GAGAAAGAGT CAAAATCTTT AATCAAATGA CTGAGAGTCA TCAAGTTGGA GAACTAGCTA AAAAAGTTGC TTCTCTAACT GGAGCTGAAA TCAATTATTT ACCAAATCCA AGGAATGAAG CAGTAGAAAA TGATCTAATT GTTGATAATA AATGCTTTAT AGAATTAGGT TTAAACCCAA CTACTCTTGA TAATGGCTTA TTAGAAGAAG TTGTTGAAGT TGCTAAAAAA TACTCCAATA GATGTGATCT TAATCGCATA CCTTGTGTTT CATCCTGGAC AAAAAAACAA GCTGAGGCTA TTAAGACTAA TTAA
|
Protein sequence | MKVIVLGGDG FCGWPCAVNL AEQNHDVIIV DNLSRRKIDI DLEVESLTPI SSITERLSAW EETGGKPMRF LNMDISKQYQ KLLNLLIDEK PDSVIHFAEQ RAAPYSMKSS FTKRYTVDNN VNGTHNLLAA IVESNLDIHV VHLGTMGVYG YGSHRGATIP EGYLKVEVPQ PDGSRFEEEI LHPASPGSVY HMTKTLDQLL FLYYNKNDLV RITDLHQGIV WGTNTEATLK DPRLTNRFDY DGDYGTVLNR FLMQAAIGYP LSVHGTGGQT RAFIHIKDSV KCVQLALENP PKSGERVKIF NQMTESHQVG ELAKKVASLT GAEINYLPNP RNEAVENDLI VDNKCFIELG LNPTTLDNGL LEEVVEVAKK YSNRCDLNRI PCVSSWTKKQ AEAIKTN
|
| |