Gene A9601_18741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_18741 
SymbolsqdB 
ID4718612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1608540 
End bp1609733 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content35% 
IMG OID640079608 
Productsulfolipid (UDP-sulfoquinovose) biosynthesis protein 
Protein accessionYP_001010264 
Protein GI123969406 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.110238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAGTTA TTGTTCTAGG TGGAGATGGT TTTTGCGGTT GGCCTTGTGC GGTGAATTTA 
GCAGAGCAAA ATCATGATGT AATTATTGTC GACAATTTAA GTCGTAGAAA AATTGATATT
GATCTAGAGG TAGAATCTTT AACTCCAATT TCTTCCATAA CAGAACGACT TTCTGCATGG
GAAGAGACTG GAGGCAAGCC TATGAGATTT CTTAACATGG ATATCTCTAA GCAATATCAA
AAATTACTCA ATTTGCTCAT TGATGAAAAA CCAGATTCCG TGATCCATTT TGCAGAACAA
AGAGCAGCAC CATACTCGAT GAAATCGAGT TTTACCAAAA GATATACAGT AGATAATAAT
GTTAATGGCA CCCACAACTT GCTTGCTGCG ATAGTAGAGA GTAATTTAGA TATTCATGTT
GTTCATTTAG GAACAATGGG AGTCTACGGA TATGGATCAC ATAGAGGTGC AACAATTCCA
GAAGGTTATC TAAAAGTTGA AGTTCCACAA CCTGATGGAA GCCGCTTTGA AGAAGAAATA
TTACACCCTG CAAGCCCAGG TAGTGTCTAC CACATGACTA AAACTTTAGA TCAATTATTA
TTTCTTTACT ACAACAAAAA TGATCTTGTA AGGATCACTG ATCTACACCA AGGCATTGTT
TGGGGAACAA ATACAGAAGC AACTTTAAAA GATCCTAGAT TGACAAACAG ATTTGACTAT
GACGGAGATT ATGGAACTGT TTTAAACAGA TTTCTAATGC AAGCTGCAAT TGGATATCCA
TTAAGTGTTC ATGGGACAGG AGGGCAAACA AGAGCATTTA TACATATAAA AGACTCTGTC
AAATGCGTAC AACTTGCTCT TGAAAATCCT CCAAAATCTG GAGAAAGAGT CAAAATCTTT
AATCAAATGA CTGAGAGTCA TCAAGTTGGA GAACTAGCTA AAAAAGTTGC TTCTCTAACT
GGAGCTGAAA TCAATTATTT ACCAAATCCA AGGAATGAAG CAGTAGAAAA TGATCTAATT
GTTGATAATA AATGCTTTAT AGAATTAGGT TTAAACCCAA CTACTCTTGA TAATGGCTTA
TTAGAAGAAG TTGTTGAAGT TGCTAAAAAA TACTCCAATA GATGTGATCT TAATCGCATA
CCTTGTGTTT CATCCTGGAC AAAAAAACAA GCTGAGGCTA TTAAGACTAA TTAA
 
Protein sequence
MKVIVLGGDG FCGWPCAVNL AEQNHDVIIV DNLSRRKIDI DLEVESLTPI SSITERLSAW 
EETGGKPMRF LNMDISKQYQ KLLNLLIDEK PDSVIHFAEQ RAAPYSMKSS FTKRYTVDNN
VNGTHNLLAA IVESNLDIHV VHLGTMGVYG YGSHRGATIP EGYLKVEVPQ PDGSRFEEEI
LHPASPGSVY HMTKTLDQLL FLYYNKNDLV RITDLHQGIV WGTNTEATLK DPRLTNRFDY
DGDYGTVLNR FLMQAAIGYP LSVHGTGGQT RAFIHIKDSV KCVQLALENP PKSGERVKIF
NQMTESHQVG ELAKKVASLT GAEINYLPNP RNEAVENDLI VDNKCFIELG LNPTTLDNGL
LEEVVEVAKK YSNRCDLNRI PCVSSWTKKQ AEAIKTN