Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_26161 |
Symbol | |
ID | 4776241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2308162 |
End bp | 2309811 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640088138 |
Product | beta-N-acetylglucosaminidase |
Protein accession | YP_001018611 |
Protein GI | 124024304 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.407902 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACCCGC CGAGCCAGAG CCCCCTTCGA CGTCAAGTCG CTGAGTTGCT AGTGGTGCGA GCAAGTGGTC ATGCCAGCGA TGATCAACGC CGTTATCCAA AATGGGAACT AAGCAACGCT GAGCTAAAAC GTCTGCTCGC TGAAGGTGTG GGCGGTGTGA TTTTGCTGGG CGGCACAAGC ACCGAAATAA GCCATCGCTG CAAGAGACTT AGACAATGGG CGAAAGCACC CCTCCTGTTA TGTGCCGATG TGGAAGAGGG TGTCGGTCAA CGCTTTGAAG GCGGTACTTG GCTCGTACCG CCCATGGCCC TTGGACGGCT CTATCAAGAA GACCAACGTC GTGCTGTCAA TCTGGCCGAG CGCTACGGCC GCTGCACAGG ACACCAAGCT CGACGCTGCG GACTCAACTG GGTGTTAGCA CCCGTCTGCG ATGTCAACAA CAACCCAGCC AACCCTGTCA TCAATGTGCG GGCATGGGGA GAAGACACTG CCACCGTCTC TGCTTTGGCC TGTGCTTTCC AACAAGGCCT TGCAGCAGAA GGTGTGCTGG GATGCGCCAA ACACTTCCCA GGTCATGGCA ATACGGGAAT GGATTCCCAT CTGCAGTTGC CTGTACTGGA TGACAACCTC CGCCAGCTTA TGGAGCTTGA ACTGGTGCCT TTCCAGGCCG TCATGAAAGC CGGCATCGAC AGCATCATGA CGGCCCATCT ATTGATGAGA AACCTCGACG CCTCCTGTCC GGCGACCCTG TCTCCAGCTG TTCTCCAAGA CCTTTTACGC CGCCAACTCA AGTTCGAGGG CTTGGTCGTG ACCGATGCCC TGGTCATGCG AGCCATCACT CAGTCCTATA GCGCTGGCGA AGCCGCCGTG ATGGCCTTTG CCGCCGGTGC TGACCTGATT TTAATGCCCG AGAACGCAGA TGACGCTATC GAGGCCCTAT GCGAGGCCCT CCAGTCGGGC CAAATTCCAA TGCAACGTTT GCACGCCTCC CAAGAGCGCC GGCGAGAAGC CTTACAAAAG GTTGGTGTCT CCACTGCAAA GCTGGCCCTC AAAGACAGCA CAAGCATTGA CAAACCCCTG GAACGAGACG AAGACCGTGC TCTCGCCAGC GAACTGGTGA CTGCATCGCT AAAAATCCAC CATCCAGGAC CGGTCACCCC AACTGAGTCA GGCATCAATC TCCTGCGGGT GGATGGAGTC TTGCCCTGCT CGGTACTCAC AGCCACGGCC CCAGCACTCG TACTGCCCTC TGAAGCTGGA TTTCAAAGTT TGCTCTCTCA CCCCCTAGGG ATCTCTCCCT GGCAAGACGA CCCAGATCAG CCTTTAGCTC TAGAACGATT GGGCATCGGC CCAGTAATGC TGCAACTTTT CTTACGCGGG AACCCCTTCC GTGGCGACCA GGATCGCCAC GAGCCATGGG TGGCAACCGT CAAGCAACTC CAACAGCAAA AACGCCTCGC AGGCCTAGTT GTCTACGGCA GTCCTTACAT CTGGGATGAG CTGCTTGAAG TGTTAAATAT CGGCATCCCC GCGGCTTACA GCCCTGGTCA AATGCCTGAA GCGCAGCGCC AGGTTCTTAC CTGCCTGCTA CAACCTGCGC AGGTGCAATC AAGTGCCCAA ACGCCGCTGT TCCAAGACTT CACTGACTGA
|
Protein sequence | MNPPSQSPLR RQVAELLVVR ASGHASDDQR RYPKWELSNA ELKRLLAEGV GGVILLGGTS TEISHRCKRL RQWAKAPLLL CADVEEGVGQ RFEGGTWLVP PMALGRLYQE DQRRAVNLAE RYGRCTGHQA RRCGLNWVLA PVCDVNNNPA NPVINVRAWG EDTATVSALA CAFQQGLAAE GVLGCAKHFP GHGNTGMDSH LQLPVLDDNL RQLMELELVP FQAVMKAGID SIMTAHLLMR NLDASCPATL SPAVLQDLLR RQLKFEGLVV TDALVMRAIT QSYSAGEAAV MAFAAGADLI LMPENADDAI EALCEALQSG QIPMQRLHAS QERRREALQK VGVSTAKLAL KDSTSIDKPL ERDEDRALAS ELVTASLKIH HPGPVTPTES GINLLRVDGV LPCSVLTATA PALVLPSEAG FQSLLSHPLG ISPWQDDPDQ PLALERLGIG PVMLQLFLRG NPFRGDQDRH EPWVATVKQL QQQKRLAGLV VYGSPYIWDE LLEVLNIGIP AAYSPGQMPE AQRQVLTCLL QPAQVQSSAQ TPLFQDFTD
|
| |