Gene P9303_21451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_21451 
Symbol 
ID4777324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1908072 
End bp1909250 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content57% 
IMG OID640087653 
Productputative glycosyl transferase, group 1 
Protein accessionYP_001018145 
Protein GI124023838 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTCACA TTGCCTGGCT AGGCAAAAAA ACACCGTTCT GCGGCAACGT CACCTACGGT 
CTCAACACGA CTGAGGCCCT AAGACAACGC GGCCATCAGA CCAGTTTTAT TCACTTCGAC
AATCCAGGCG GCCTTAGCAA CGGCGAGAGC GCACTGTTGG CCAATGACCC AGAAGTGAGC
CTGCCATACC TGGTGAAGTC ACAGGTTTAC ACCATTCCCT TCCCAGGAGC GCAGCGAGAA
CTCAGGGAAT CATTAGAAAG ACTGCAACCC GATCTCGTTC ACGCCAGCCT CACCCTTTCC
CCCCTCGACT TCCGGCTACC AGAACTCTGC GAGCAAATCG GTGTGCCATT GGTGGCAACT
TTCCACCCAC CATTTGATAG CGGCATGCGC CACCTCACGG CTGGCACACA ACAGCTCACG
TATCAGCTCT ACGCCCCTGC CCTGGCTCGC TATGACAAGG TGATCGTCTT CTCCGAGCTG
CAAGCTGAGG TTCTTACCAA ACTTGGAGTA CAGGAACAAC GACTCGCCGT GATCCCCAAT
GGCGTCGATC CTGAATGTTG GGCACCAACA AGTCCCCAAT GCACCAACCC AATGCAGCAA
GAGGTGCTTG GACGTCTGGG AAATGAACGA ATTTTCCTCT ACATGGGACG CATCGCAGCA
GAAAAAAATG TGGAGGCATT GCTGCGCGCT TGGCGGCTTG TAGAGACCAA GGGCTGCCGA
CTGGTCATCG TTGGCGATGG CCCTCTGCGT TCGACCCTGC AAAACAACTC AACCCCAACA
AAAGAAAACG ACGTGCTCTG GTGGGGCTAT GAGTCAGATC TCAATACCAA GGTGGCCCTA
CTGCAATGCG CTGAAGTCTT CCTCCTGCCA AGCCTGGTCG AAGGTCTGTC TCTGGCACTG
CTAGAGGCCA TGGCAACAGG TACAGCCTGC GTGGCCACTG ACGCCGGGGC TGATGGGGAA
GTGTTGGATG GCGGTGCGGG CATCGTATTA AGCACACAGG GTGTCACCAG CCAATTACGC
ACCCTGCTGC CGGTGCTCCG CGATCAGCCT GTACTAACAG CCGAACTGGG TCGCCGCGCC
CGTATGCGCG TGCTGGAGCG ATACACCATC ACCCGCAACA TCGACGACCT GGAAACGCTC
TACCGCGGCT TATTAGGGGC GACAAAGATG GCGGCCTAA
 
Protein sequence
MAHIAWLGKK TPFCGNVTYG LNTTEALRQR GHQTSFIHFD NPGGLSNGES ALLANDPEVS 
LPYLVKSQVY TIPFPGAQRE LRESLERLQP DLVHASLTLS PLDFRLPELC EQIGVPLVAT
FHPPFDSGMR HLTAGTQQLT YQLYAPALAR YDKVIVFSEL QAEVLTKLGV QEQRLAVIPN
GVDPECWAPT SPQCTNPMQQ EVLGRLGNER IFLYMGRIAA EKNVEALLRA WRLVETKGCR
LVIVGDGPLR STLQNNSTPT KENDVLWWGY ESDLNTKVAL LQCAEVFLLP SLVEGLSLAL
LEAMATGTAC VATDAGADGE VLDGGAGIVL STQGVTSQLR TLLPVLRDQP VLTAELGRRA
RMRVLERYTI TRNIDDLETL YRGLLGATKM AA