Gene P9211_01281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_01281 
Symbol 
ID5730848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp125945 
End bp127576 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content39% 
IMG OID641284471 
ProductBeta-glucosidase-related glycosidase 
Protein accessionYP_001550013 
Protein GI159902669 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.742847 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCGT TTGATCGCCA GGTTCTTCGA CGCAAGGTTT CTGAAATCTT TGTTATACGC 
GCTAGTGGAC ATTCGTTGGA TGCATTGAGG GAATATCCCA ATTGGGAATT GACCAATCAT
CGATTACAGC AATTTCTCGA AGAAGGGGTG GGTGGTGTGA TTTTGTATGG AGGTTCGATT
GAGGAAATCA CAAATCGGTG TGCACAACTT CGAATGTGGG CAGGAAAGCC GATTTTTTTA
TGTGCTGATG TGGAGGAGGG AGTAGGACAA CGATTTCAAG GAGGGACTTG GTTGATCCCT
CCAATGGCTT TAGGAAGGAT TTATCTTAAA GAGCCTGAAT ATGCAATTTC ACTAGCTGAG
CACTATGGAG CTTTAATTGG TTATGAGTCA GTTATTTGTG GATTGAATTG GGTATTAGCT
CCAGTTTGTG ATGTTAATAG TAACCCTCTC AACCCTGTTA TCAATATGAG GGCATGGAGT
GATAATCCTC AGACTGTTGC AGATCTTGCA TGTGCTTTTC ATAGGGGCCT AACTTCTCAA
GGAGTACTGG GATGCGCTAA ACATTTCCCA GGTCATGGCG ATACTAAAGT TGATTCACAT
TTAGAATTGC CAGTTTTAGA TAATGATCTT TCTCGCTTAG CTGAAATAGA ACTTCCACCT
TTTCAGGCTT TAATTCAACA AGGAGTGAGC AGTATTATGA GCGCTCACTT GATTTTAAAT
AGGGTTGATT GTAATTACCC AGTCACTTTC TCAAAAAGAA TTTTGACGGA TCTTTTAAGA
AAGAAAATGT GTTTCGAAGG CATGATTGTC ACCGATGCTT TGGTAATGCG AGCTATATCT
AAGACCTTCA GCAGTGGTTC TGCTGCTGTA ATGGCATTTG AAGCTGGCGC TGATTTGATT
TTGATGCCTC AAAACCCTTC TGAAGCAATA GATGCAATTG TGGAGTCTTT AATTTCTGGA
AGATTACCTA TTTCAAGATT GGAAGATTCT TTACAAAGAC GTCAACTTGC ACTTGCACAG
TTGAACAGTG AAAAACCTGC GACCTCTTGC GAAAAGAATG CTTTTGAAAA TCAAAAAGTT
TCCTTTTTTG CTGAAAAACT TATTGACATT TCTATAGAGT CTAGAAATAC CTTGATAATT
GATAATTATA AATCACTCAT CAATTTAATT CGTGTAGATA ATTTGTACTC TAATCCAATT
TTGAATCATT CATCTCCTGC TCTTGTGATT CCTGAGCAGT ATGGCTTTAG AAATGTAATT
ACTCACCCTT CAGGGATTTC ACCTTGGCAA AATAATGTGA AAGAGCCGCT TGCTTTGGAA
AAGTTTTCAG ATAGCGCTTT TCTTCTCCAG CTTTTTATCA GAGGTAATCC ATTTCAAGGA
GATGAGCCTC TACAAGAGCC TTGGATCTCA GTCATTATGC AATTGCAGCG TTCTAAGCGA
TTAGCGGGTT TGATACTGTA TGGCGATTCC TTTTTATGGA ATGACCTTCA AAATGTTTTG
GAACCTACAG TTCCGTTTGT TTTTAGTCCA GGCCAAATGC CATTGGCTCA GGAAAAAGCA
TTGCAATGTT TATTGGACAG CAAAAAAATG AAAGTTGATA TCTCACCATC TCAATGGGAG
TTTATAAATT AA
 
Protein sequence
MPSFDRQVLR RKVSEIFVIR ASGHSLDALR EYPNWELTNH RLQQFLEEGV GGVILYGGSI 
EEITNRCAQL RMWAGKPIFL CADVEEGVGQ RFQGGTWLIP PMALGRIYLK EPEYAISLAE
HYGALIGYES VICGLNWVLA PVCDVNSNPL NPVINMRAWS DNPQTVADLA CAFHRGLTSQ
GVLGCAKHFP GHGDTKVDSH LELPVLDNDL SRLAEIELPP FQALIQQGVS SIMSAHLILN
RVDCNYPVTF SKRILTDLLR KKMCFEGMIV TDALVMRAIS KTFSSGSAAV MAFEAGADLI
LMPQNPSEAI DAIVESLISG RLPISRLEDS LQRRQLALAQ LNSEKPATSC EKNAFENQKV
SFFAEKLIDI SIESRNTLII DNYKSLINLI RVDNLYSNPI LNHSSPALVI PEQYGFRNVI
THPSGISPWQ NNVKEPLALE KFSDSAFLLQ LFIRGNPFQG DEPLQEPWIS VIMQLQRSKR
LAGLILYGDS FLWNDLQNVL EPTVPFVFSP GQMPLAQEKA LQCLLDSKKM KVDISPSQWE
FIN