Gene P9211_17851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_17851 
Symbol 
ID5731605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1614323 
End bp1615636 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content37% 
IMG OID641286171 
Productglycosyl transferase family protein 
Protein accessionYP_001551670 
Protein GI159904326 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTTTG CAGCTACAAA TGGCAGAAAT CGCCGCGGAA AGGCCAGCTT ATTCTTGATT 
TGCTGCGTTT TATTAGGCTT ATGTCCTTAT TTAATTCCCG CATCAATAAG CTTATTGCCA
GCATTAATAT TGGCTGTATT GCTAGGAATT TATGGTTTGT CGATTGTTTT AAGAGAGATA
GATAATCGTT TTGATCAATC TAAATTGGTT GCTTTTACTC CAGAGGAATA CCAATCTCTC
CCAATGGTAG ATGTTGTGGT TTCAGCTAGA GATGAAGAGA ATGTTGTTGA GAGATTAGTG
GAACGTCTAA TTTCTATTAG ATATCCAAAG GATAAAATAA CTAGAATGAT TATTGATGAT
GGAAGTAAAG ATAAGACGTC CATTTTATTG AATCAATTAA CCCAAACTTT TCAAGAAATT
CAAGTCTTAA ATCGATCAAG ATCTTCCGGA GGAGGCAAGT CTGGCGCACT TAATTATGCG
CTATCTAAAC TGAATGGTAA ATGGATTTTT ATTCTTGATG CAGATGCACA GTTTAATGAT
GATATTTTGT TGAGGATTAT TCCTTTTGCG GAGAAATATG GTTTATCTGC CGTTCAATTA
AGAAAGGCAG TTATAAACTC AGGAAAGAAT TTATTGACTC ATTGCCAGTC TATGGAAATG
GCAATGGATG CTTTTATCCA GCAAGGAAGA ATCTTCGTAG GAGGAGTGGG TGAATTAAGA
GGAAATGGTC AGCTTATAGA GAGAAATATA TTGAATAAAT GTGGAGGCTT TAATGAAGAT
ACTTTGACTG ATGATCTTGA TTTGAGTTTT AGGCTATTGA TTGTTGGCGC TAATGTCGGT
TTGCTTTGGA ATCCTCCTAT TCAGGAAGAA GCAGTTGAAT CTTTAGGGTC TTTGTGGCGA
CAAAGAAATA GATGGGCAGA AGGTGGATTA CAACGTTTTT TTGACTATTG GTCTTTTCTC
TTTTCCGGAA GGCTCGGCTT TGTCAAAAAA CTTGACTTAG GATGTTTCTT CACTCTTCAA
TATGTATTGC CTGTCGTATC TTCTGTAGAC TTGTTAATTG CCACTTGGAC TCACTCATTT
CCCTTGTACT GGCCTTTATC ATGTATTGCT TTGAGTGTAT CTGGAGTGGC CTATTTTCGA
GGTTGCAGAA GAAAATCTCA AGGCCCAGAC TTACCTTCTC CTAAATTACT TAGATTATTA
ATATCTGTCA TTTACCTGAT TCACTGGTTT ATTGTTATTC CTTGGGTAAC AATAAAAATG
GCAGTCTTAC CTAAGAAATT AGTTTGGAAC AAGACTACTC ACCAAGGTAA TTAA
 
Protein sequence
MAFAATNGRN RRGKASLFLI CCVLLGLCPY LIPASISLLP ALILAVLLGI YGLSIVLREI 
DNRFDQSKLV AFTPEEYQSL PMVDVVVSAR DEENVVERLV ERLISIRYPK DKITRMIIDD
GSKDKTSILL NQLTQTFQEI QVLNRSRSSG GGKSGALNYA LSKLNGKWIF ILDADAQFND
DILLRIIPFA EKYGLSAVQL RKAVINSGKN LLTHCQSMEM AMDAFIQQGR IFVGGVGELR
GNGQLIERNI LNKCGGFNED TLTDDLDLSF RLLIVGANVG LLWNPPIQEE AVESLGSLWR
QRNRWAEGGL QRFFDYWSFL FSGRLGFVKK LDLGCFFTLQ YVLPVVSSVD LLIATWTHSF
PLYWPLSCIA LSVSGVAYFR GCRRKSQGPD LPSPKLLRLL ISVIYLIHWF IVIPWVTIKM
AVLPKKLVWN KTTHQGN