Gene P9303_26161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_26161 
Symbol 
ID4776241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2308162 
End bp2309811 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content57% 
IMG OID640088138 
Productbeta-N-acetylglucosaminidase 
Protein accessionYP_001018611 
Protein GI124024304 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.407902 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCCGC CGAGCCAGAG CCCCCTTCGA CGTCAAGTCG CTGAGTTGCT AGTGGTGCGA 
GCAAGTGGTC ATGCCAGCGA TGATCAACGC CGTTATCCAA AATGGGAACT AAGCAACGCT
GAGCTAAAAC GTCTGCTCGC TGAAGGTGTG GGCGGTGTGA TTTTGCTGGG CGGCACAAGC
ACCGAAATAA GCCATCGCTG CAAGAGACTT AGACAATGGG CGAAAGCACC CCTCCTGTTA
TGTGCCGATG TGGAAGAGGG TGTCGGTCAA CGCTTTGAAG GCGGTACTTG GCTCGTACCG
CCCATGGCCC TTGGACGGCT CTATCAAGAA GACCAACGTC GTGCTGTCAA TCTGGCCGAG
CGCTACGGCC GCTGCACAGG ACACCAAGCT CGACGCTGCG GACTCAACTG GGTGTTAGCA
CCCGTCTGCG ATGTCAACAA CAACCCAGCC AACCCTGTCA TCAATGTGCG GGCATGGGGA
GAAGACACTG CCACCGTCTC TGCTTTGGCC TGTGCTTTCC AACAAGGCCT TGCAGCAGAA
GGTGTGCTGG GATGCGCCAA ACACTTCCCA GGTCATGGCA ATACGGGAAT GGATTCCCAT
CTGCAGTTGC CTGTACTGGA TGACAACCTC CGCCAGCTTA TGGAGCTTGA ACTGGTGCCT
TTCCAGGCCG TCATGAAAGC CGGCATCGAC AGCATCATGA CGGCCCATCT ATTGATGAGA
AACCTCGACG CCTCCTGTCC GGCGACCCTG TCTCCAGCTG TTCTCCAAGA CCTTTTACGC
CGCCAACTCA AGTTCGAGGG CTTGGTCGTG ACCGATGCCC TGGTCATGCG AGCCATCACT
CAGTCCTATA GCGCTGGCGA AGCCGCCGTG ATGGCCTTTG CCGCCGGTGC TGACCTGATT
TTAATGCCCG AGAACGCAGA TGACGCTATC GAGGCCCTAT GCGAGGCCCT CCAGTCGGGC
CAAATTCCAA TGCAACGTTT GCACGCCTCC CAAGAGCGCC GGCGAGAAGC CTTACAAAAG
GTTGGTGTCT CCACTGCAAA GCTGGCCCTC AAAGACAGCA CAAGCATTGA CAAACCCCTG
GAACGAGACG AAGACCGTGC TCTCGCCAGC GAACTGGTGA CTGCATCGCT AAAAATCCAC
CATCCAGGAC CGGTCACCCC AACTGAGTCA GGCATCAATC TCCTGCGGGT GGATGGAGTC
TTGCCCTGCT CGGTACTCAC AGCCACGGCC CCAGCACTCG TACTGCCCTC TGAAGCTGGA
TTTCAAAGTT TGCTCTCTCA CCCCCTAGGG ATCTCTCCCT GGCAAGACGA CCCAGATCAG
CCTTTAGCTC TAGAACGATT GGGCATCGGC CCAGTAATGC TGCAACTTTT CTTACGCGGG
AACCCCTTCC GTGGCGACCA GGATCGCCAC GAGCCATGGG TGGCAACCGT CAAGCAACTC
CAACAGCAAA AACGCCTCGC AGGCCTAGTT GTCTACGGCA GTCCTTACAT CTGGGATGAG
CTGCTTGAAG TGTTAAATAT CGGCATCCCC GCGGCTTACA GCCCTGGTCA AATGCCTGAA
GCGCAGCGCC AGGTTCTTAC CTGCCTGCTA CAACCTGCGC AGGTGCAATC AAGTGCCCAA
ACGCCGCTGT TCCAAGACTT CACTGACTGA
 
Protein sequence
MNPPSQSPLR RQVAELLVVR ASGHASDDQR RYPKWELSNA ELKRLLAEGV GGVILLGGTS 
TEISHRCKRL RQWAKAPLLL CADVEEGVGQ RFEGGTWLVP PMALGRLYQE DQRRAVNLAE
RYGRCTGHQA RRCGLNWVLA PVCDVNNNPA NPVINVRAWG EDTATVSALA CAFQQGLAAE
GVLGCAKHFP GHGNTGMDSH LQLPVLDDNL RQLMELELVP FQAVMKAGID SIMTAHLLMR
NLDASCPATL SPAVLQDLLR RQLKFEGLVV TDALVMRAIT QSYSAGEAAV MAFAAGADLI
LMPENADDAI EALCEALQSG QIPMQRLHAS QERRREALQK VGVSTAKLAL KDSTSIDKPL
ERDEDRALAS ELVTASLKIH HPGPVTPTES GINLLRVDGV LPCSVLTATA PALVLPSEAG
FQSLLSHPLG ISPWQDDPDQ PLALERLGIG PVMLQLFLRG NPFRGDQDRH EPWVATVKQL
QQQKRLAGLV VYGSPYIWDE LLEVLNIGIP AAYSPGQMPE AQRQVLTCLL QPAQVQSSAQ
TPLFQDFTD