Gene P9211_12651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_12651 
Symbol 
ID5730018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1141018 
End bp1142106 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content39% 
IMG OID641285634 
Productglycosyl transferase group 1 
Protein accessionYP_001551150 
Protein GI159903806 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00148492 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACAGTTA AAGCATTAAA GCTTTTATTA GTAAGCACTC CGATTGGTTT TTTAGGTAGT 
GGTCAAGGGG GTGGTGTTGA ATTGACTTTG ATCTCTTTAG TTAAAGGTCT TTTAGAACTT
GGGCATGAAG TTGTTTTAAT TGCTCCAAAT GGTTCAGAAC TTCCAAAAGA ATGCGAAGGC
GTTGAAATTA GATATGTTTC AGGATTTGAT CAACCAAGTT GGCAACATCA AGATTATGGT
TCTCCAGTAA TCATTCCAGC TAACGGAGTA CTTCCAATGC TTTGGGAAAA GGCACTTGAT
GTTGGGAAAG AAGATTTTGA TGCGATTTTG AATTTTGGCT ATGACTGGTT GCCTTTATGG
CTAACACCTC ATGTTGAGCC AAGATTATTT CATTTGATAA GTATGGGAGC TGTTTCAAAA
GTGATTAAGG ATCAGATAAT GAAACTATCT AAGACACAAC ATTCTCGACT TGCTTTTCAT
ACTCATGTTC AGGCTTCAGA TTATGAATTG AGTGATACTC CAACAGTTGT AGGCAATGGC
TTTGATCTTA ATCAATATAA ATTCCAGTCA AAGTCTGATG GACCTTTAGG TTGGGCTGGA
AGAGTTGCTC CTGAAAAAGG ATTAGAAGAT GCAGTGGCTG TAGCGGCTCA TTTTGATGAC
ACTTTGTTGG TATGGGGACT GATTGAAGAT GAAAATTATG CCAGAACAAT TGAAGAATCT
TACCCGCCTG GAACAATTGA TTGGAGAGGC TTCCTTAAAA CAAATGAGTT TCAGGACCAA
CTTGGTAAGT GTAGGGCTTT AATTAATACT CCAAAATGGA ATGAAGCTTA TGGAAATGTT
GTTGTAGAGG CTTTGGCTTG TGGTGTTCCA GTGGTTGCCT ATAAACGAGG CGGCCCAGGT
GAATTAATCC AATCCGGCAG TACAGGCTGG TTAGTCGCTC CTGATGATTT GGATGCCTTA
ATCTCTGCCA CATATCGAGT AAATGAAATT GATCGACTTA AGTGTAGAAA TTGGGCTATA
GACTCAGCTT CTTCCAAAGG ATTTGCTAAA CGAATCACCT CTTGGGTCCA GAAAGATATT
ATGGAATAA
 
Protein sequence
MTVKALKLLL VSTPIGFLGS GQGGGVELTL ISLVKGLLEL GHEVVLIAPN GSELPKECEG 
VEIRYVSGFD QPSWQHQDYG SPVIIPANGV LPMLWEKALD VGKEDFDAIL NFGYDWLPLW
LTPHVEPRLF HLISMGAVSK VIKDQIMKLS KTQHSRLAFH THVQASDYEL SDTPTVVGNG
FDLNQYKFQS KSDGPLGWAG RVAPEKGLED AVAVAAHFDD TLLVWGLIED ENYARTIEES
YPPGTIDWRG FLKTNEFQDQ LGKCRALINT PKWNEAYGNV VVEALACGVP VVAYKRGGPG
ELIQSGSTGW LVAPDDLDAL ISATYRVNEI DRLKCRNWAI DSASSKGFAK RITSWVQKDI
ME