Gene P9303_13331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_13331 
SymbolstpA 
ID4778777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1141619 
End bp1142824 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content49% 
IMG OID640086841 
Productputative glucosylglycerolphosphate phosphatase 
Protein accessionYP_001017345 
Protein GI124023038 
COG category 
COG ID 
TIGRFAM ID[TIGR02399] glucosylglycerol 3-phosphatase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.788398 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCACA TCGACCTCGA TCAGCTTTTG GCTGAGATGG TCAGTACTGA AGACCTCTTG 
ATCGTTCAGG ATCTTGACGG CGTTTGTATC CCCCTTGTCA AGGATCCTCT GACGAGAGTT
TTGGATCCTG CCTATGTATG GGCTGCCAAA AGGCTTGAGG GGTCTTTCTC TGTACTGACC
AATGGAGAGC ATGGCGGACA TCGTGGAGTC AATTGTGTAG TCGAGAGGGC TTTGGGTGAT
CCCCAGCTGC CGGCTAAGCA GGGCCTCTAC TTACCGGGAT TAGCCGCAGG CGGTGTGCAA
CTGCAGAATT GTTATGGCGA GATCAGTCAT CCCGGCATTA GTGATAAAGA AATTGCCTTT
CTTGCTGCAC TGCCTAGCCG AATGCAGACC TTGCTTGAAC AGCGTCTCCC TGCATTGCTA
CCCCAGCTCA CCTCTGATGA GATCCAAACC CTCGCAAAGA TGTCAGTTCT TGATACAGAG
CTATCGCCCA CAATTCTCTT AAATGGCTTG TTTAGCCTGA CTCCTGACGA TGTCGGCATT
CAGCAATCCC TGCAAATTAT GTTGCAGGAG TTGATGAATG AATTGATTAA TAGTGCAATA
AGTGCTGGCT TACCTAATTC GTTTTTTCTG CATATTGCCC CCAACATGGG CTGTGATGGA
CAACGGGAGA GGCTCAAGCC TGCTGCCCCT GGCGATGTAG GCACCACTGA TATCCAGTTC
ATGCTCAAAG GTGCTGTCAA GGAAGCCGGA CTATTGGTTT TGATTAACAA GCACATCGCT
AAATACAAAG GCAAAGCTCC TCTCGGCAAA GACTTTGATG TGCGTTCAGC ACCTAAGACT
CATCAGGGAT TGTTGGATCT CTGCCGCAAA CATATTCCTG TTGATCAGAT GCCACTTTTG
ATGGGTGTGG GTGATACGGT TACATCCAAT CCATCTCCTG ATGGAACTGG ATGGTTACGT
GGCGGAAGCG ACCGCGGTTT TCTTACCTTG TTACAGGATT TAGGTAGAAT TTATAACCGT
ACCAATCGAG TGGTTCTTGT CGATAGTAGT GGCGGTGAAG TATACCGACC CAGCTTGGTG
GATGAACGAT TACAAGGGAT CAGTGATCCT GAGGATCCCT TGCATTTTGA TGTACTGGTT
CCTAGCGGCC CCAGCACATA CGTGGCTTGG TTTAGGTCAC TCGCTGAACG ACGTTCAGCT
CGTTGA
 
Protein sequence
MGHIDLDQLL AEMVSTEDLL IVQDLDGVCI PLVKDPLTRV LDPAYVWAAK RLEGSFSVLT 
NGEHGGHRGV NCVVERALGD PQLPAKQGLY LPGLAAGGVQ LQNCYGEISH PGISDKEIAF
LAALPSRMQT LLEQRLPALL PQLTSDEIQT LAKMSVLDTE LSPTILLNGL FSLTPDDVGI
QQSLQIMLQE LMNELINSAI SAGLPNSFFL HIAPNMGCDG QRERLKPAAP GDVGTTDIQF
MLKGAVKEAG LLVLINKHIA KYKGKAPLGK DFDVRSAPKT HQGLLDLCRK HIPVDQMPLL
MGVGDTVTSN PSPDGTGWLR GGSDRGFLTL LQDLGRIYNR TNRVVLVDSS GGEVYRPSLV
DERLQGISDP EDPLHFDVLV PSGPSTYVAW FRSLAERRSA R