Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_21351 |
Symbol | sqdB |
ID | 4780871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1791502 |
End bp | 1792695 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640085432 |
Product | sulfolipid (UDP-sulfoquinovose) biosynthesis protein |
Protein accession | YP_001015955 |
Protein GI | 124026840 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAGTTT TCGTTCTTGG TGGTGACGGC TTCTGCGGAT GGCCTTGTGC AGTAAATCTT GCTGACAAAG GTCATGATGT ATTCATCGTG GATAATCTGA GTCGTCGAAA AATCGATATA GACCTTGAAG TTGAATCCTT AACTCCAATT ACAAGTATTG GAGAAAGGAT TAAAGCTTGG TCTGAGATAG GCGGAAAACC TATTCAATTT ATTCATTTAG ACCTTGCCTC TGAATATCAA AAGCTTTTAG ATCTGTTAAT CGAGGAAAAG CCGGATTCAA TAATTCATTT TGCTGAACAA CGTGCTGCTC CATATTCCAT GAAAAGCAGC GCAACCAAGA GATATACAGT TGATAACAAT GTCAATGGTA CTCATAATCT CCTTGCCGCA ATTGTTGAAT CTCAATTAGA TATTCACATT GTTCATCTTG GGACAATGGG AGTTTATGGC TATGGATCTC ATAGAGGCGC GACCATTCCT GAAGGTTACT TAAAAGTGGA AGTACCCCAA CCAGATGGCA GTCGTTTTGA GGAAGAAATC CTTCATCCAG CAAGTCCTGG CAGCGTTTAT CACATGACAA AAACGCTTGA TCAATTGCTC TTTCTTTACT ACAACAAAAA TGACCAGATC AGAATCACTG ATCTTCATCA AGGAATTGTA TGGGGAACGA ATACTGATGT CACAAGTCGT GACCCAAGAT TGACTAATCG ATTTGACTAT GACGGTGATT ATGGAACAGT GTTAAATCGA TTTTTAATGC AGGCGGCCAT TGGATACCCA TTAACTGTTC ATGGTACTGG TGGACAAACA AGAGCTTTTA TACATATTCA AGATTCAGTA AAATGTGTTC AATTGGCACT CGAGAACCCT CCTGAGAAAG GTGAAAGGGT CAAAATTTTC AACCAAATGA CGGAAAGTCA CCAAGTTGGG GAATTAGCTA AAAAAGTAGC CTCTCTCACT GGAGCAAAAA TTAATTATCT GCCAAACCCA AGAAACGAAG CGGTTGAAAA TGATTTGATA GTTGATAATA GATGTTTTAT TGAGCTTGGA TTAGATCCGA CAACTCTCGA TAATGGACTT TTAGAAGAAG TGGTTAATGT CGCGAAAAAA TTCTCCAATC GCTGTGATTT AAAACGAATA CCTTGTGTAT CTGCATGGAC ATCAACCCAA GCAAAAGCAA TCCATAAATC CTAA
|
Protein sequence | MKVFVLGGDG FCGWPCAVNL ADKGHDVFIV DNLSRRKIDI DLEVESLTPI TSIGERIKAW SEIGGKPIQF IHLDLASEYQ KLLDLLIEEK PDSIIHFAEQ RAAPYSMKSS ATKRYTVDNN VNGTHNLLAA IVESQLDIHI VHLGTMGVYG YGSHRGATIP EGYLKVEVPQ PDGSRFEEEI LHPASPGSVY HMTKTLDQLL FLYYNKNDQI RITDLHQGIV WGTNTDVTSR DPRLTNRFDY DGDYGTVLNR FLMQAAIGYP LTVHGTGGQT RAFIHIQDSV KCVQLALENP PEKGERVKIF NQMTESHQVG ELAKKVASLT GAKINYLPNP RNEAVENDLI VDNRCFIELG LDPTTLDNGL LEEVVNVAKK FSNRCDLKRI PCVSAWTSTQ AKAIHKS
|
| |