Gene P9211_10671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_10671 
Symbol 
ID5730338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp963757 
End bp964899 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content40% 
IMG OID641285434 
Producthypothetical protein 
Protein accessionYP_001550952 
Protein GI159903608 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.43083 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGGGG CTTTTTGGTG GCATATAGGG CTTGCATGGG GACTCCAATT AGCCGCTCAA 
AAAGCAAAGC GAAATGGTGA CGAGAAAGTC TGGAAAGAGA TTAGAAATAG CCTTGAAGAT
AAAAGTTACT TACAGGAGGG GCCTGATCTA CTTAAAAAAT ATGATTCAAG AGGAATGGTA
AATCAATGGT TTGAAGAATC AAATAAAGCA AACCCAATCT GGAAGACTCA TAAACCGCTG
AAAACTTGGC TCCAACAACC TATCTTGCTA ATTGGAGGAT GGTGGGACCC TCACCTAAGA
GGGATTCTTG ATCTTTATGA AAAATCACTC ATAGTAGGTG GTCAACCAGA TCTGCATATT
GGACCAGCTT CTCATTTGCG ATGGTGGGAA GAAGTACAGC AAATTCATTT AGATTTCTTC
AACAAATATC TTCAACCAAG CAATACTCTT AAGGTCCCAT CTAGCAGACA ACAAAAACTT
TGGAATATTA CAAGCAAAAA ATGGTTTGAC CTACAACCTA TAGACACTAG AAATAGGATT
TGGCATCTAA GTACTGGAGG GAATGCATGC ATAGATTCAA CCGATGGAGA ACTTACTCAA
TTCGGCAAAG GGCAGGGGGA ACTCTCTATT GTTCATGATC CATGGAGACC CGTTTCTTCA
ATAGGAGGTC ACCTAAGTCC AGACCCTGGT ATTGCAAATA GAGCCGAAAT TGACAAGAGA
AACGATGTAG CAACTTTTAC CTCAAAACCA CTTGAAGAAA GAATTCAACT AAAAGGAGTT
CCTAAGCTTG AAATTATTGC AATGGCTGAT AGGAATGGAT TTGATCTATG CGTTGCAATT
TCAATTATTC AGCAAAATTC AAAAGAAGTA CTGCAGATTT CTACAGGAGT ACTTCGTCTA
GTTGGGAATA AAGCCAAAAG TACACTCAAA AGAAATGTGA CGCTGCAGCC ATTATTCGCA
GATATTCATA AAGGAGATCG CCTTAGATTG TCAATATCTG GAGCAGCTTG GCCAGCTATT
GCTATAAACC CAGGAGACCC AAGTTATAAC TGTGGGTCTC CATCTCCATA TTGTCTAGTA
ACGACAATCT CCCTAGAACT TTCTCAGGCC AAGCTAGAGA TCTGTCCACT CTTCTCAAAA
TAA
 
Protein sequence
MGGAFWWHIG LAWGLQLAAQ KAKRNGDEKV WKEIRNSLED KSYLQEGPDL LKKYDSRGMV 
NQWFEESNKA NPIWKTHKPL KTWLQQPILL IGGWWDPHLR GILDLYEKSL IVGGQPDLHI
GPASHLRWWE EVQQIHLDFF NKYLQPSNTL KVPSSRQQKL WNITSKKWFD LQPIDTRNRI
WHLSTGGNAC IDSTDGELTQ FGKGQGELSI VHDPWRPVSS IGGHLSPDPG IANRAEIDKR
NDVATFTSKP LEERIQLKGV PKLEIIAMAD RNGFDLCVAI SIIQQNSKEV LQISTGVLRL
VGNKAKSTLK RNVTLQPLFA DIHKGDRLRL SISGAAWPAI AINPGDPSYN CGSPSPYCLV
TTISLELSQA KLEICPLFSK