Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_24641 |
Symbol | |
ID | 4776087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2161743 |
End bp | 2164757 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640087984 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001018460 |
Protein GI | 124024153 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0383] Alpha-mannosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGAGC AGCTTCAAGC CCGACGATTG GCGTGGATTG AAACTTTTCG AAGTCGCTCT CGTTGGGATC TCAAGCCAGG TTGGTATCGC GTAGGGGATC GTGATCGGCG GTCTCTGCTT CAAGATGATT GGGCTCAAAT CCATCGACCA GACTGGGTTG GCAGGGGTTT ATTGGTCTGG CCTCGAGGCG GACTATGGCT GCAATTGCAG CAGGAGATTC TTTGTCCTGA AGACTGGCGA CAAGCTCAAG CAAGTTGTTT GCGGCTGGTC TTGGGTTGGT GGGCTGATCA GATGCGGCTG TGGGTGGATG GTGTGTTGGT GCATGAGGGG GATCTGTTCG ATACCACTTG TCGGTGGAGC CTTCCTGAAT CGATCGGTTG GGAGCGACCT CTAACACTCT TGCTGGAGTT GAGGAGTCCT AGCCATGACG ATGGCGCGTT GATCAACAGT GAGTTGGTGC TGGAACCCAG GGCCGGTGCT GTGGATCTAG ATCAGGTTCT GCTGCCTGAA GCGCTGGCGC TCTATTTAGA GGTTGCAGAG GTGTTGCCGG AGGCTTGGCT TGAGCTGGAT CCGCACAGTC AGGAGGCAAC CGTGGCTGTC GACCAGCAAC TTCGCAATGC AGCGCGGCCG CCGGGGCTGA TCCATTGGAT CGGCCATGCC CATCTTGATT TGGCCTGGCT GTGGTCAGTG GCTGACACTT GGCAGGCGGC AGAGCGCACG TTCCGCTCTG TGCTGCACCT GATGCAGCGA TTTCCTGATT TCCATTTCAG CCACTCCTCG CCAGCTCTCT ATCAGTGGGT TGAACATCAC CGTCCAACTC TTTTTGCTGC GCTTCAGGAT GCAAGTCGGG CAGGTCGTTG GGAGCCGATC AATGGCCCTT GGGTCGAGAC TGATTGTGTG CTGGTGAGCA CTGCTTCGCT TTGGCGACAG TTCACGCTTG GACAGCAGTA CAGCCGATCA GCATTTCCGG AGTGGTCCCA TCACTTGGCC TGGCTTCCAG ACAGCTTTGG CTTTGCCGCT GGTTTGCCAG CGGTAGCTAC CCAGACAGGA ATCCGTTGGT TCTGCACCCA TAAACTGGCC TGGAATGCCA GTAATCCCTT CCCGCATCGA CTGTTTCGTT GGTGCAGTCG TGGCCAGGCT GAGGTGCTAG CGCTGATGCT TCCGCCAATT GGCACTGATG GGAATCCGAT GGCCATGCTT CGCGAGCAGC GGTCTTGGCA GGCGTCTACC GCTGTTGAAG AGGCTTTGTG GGTTCCGGGA GTTGGCGACC ACGGAGGTGG CCCCACTGCA GAAATGCTGG AGCAGCTGCA GCTTTGGGAG GATCATCCTC AGGCTTTGCC CCAGAAGCCT GGCACAGTAA GGTCTTACCT TGAGACGCTT GAGGCCCATA TTGAGACTTT GCCAGTGTGG CGTGATGAGC TCTATTTGGA GCTTCATCGC GGTTGTGCGA CGACCCGTCC CGACCAGAAG CGTCAAAACC GACACCTTGA AAGGCTTTTG AGGGAAGCAG ACCTGGCTAT AGCTTTGCTG GCTTGCCACC TTGGCGATAG GGCTGACATT CCTGGAACCT CTGCATCTTC ATTGCCAGAT TGGCGGCCTT TGCTCTTTCA GCAATTTCAC GACATCCTTC CAGGCACATC GATTCCTGAA GTGTTTGAGC AGGCTAAGCC GGTCTGGCGA TCGTCATGTC GTGAATCAAG GCGGAGCCGC GATCAGCATC TGCAGCAGCT TTTTCGCAGT GGGGTGTCCT GCTCCGATCG ATTCGATCCA GAGTCTTCTG CTGGGTGCTG GGTCTGGATG GGTCTACAGC CACTACAGCG TTGGTCGCCT GTGCTGAGAT TGCCACGAGA TCAATGGGGC AGCGCCGGTA GGAAGCTTCC ATGCCAAGAG GCCAAGGGAG GCGGAACCTG GGTGCAGTTG CCTGTGCAGC AAGGGGTGAC AGCAGTTCCG CTTCAACGTT CTGTCAGTAA GGCCTTAGAT GTTGTTGACG GCTTACCTGT TCGTGGGGCT GTGCAGATTC AGCCCCTTGA TCAAGGCGGT TGGCGGATTG GTAATGGACT TTTGGAAGCT GATTTTGGTG CCGAAGGCTT GCAGCAGCTT TGGGATCGGA ACGGGACACC TCAGTTGGCA GGACCTCTGA TCTGGGGGCG TTTTCGCGAT CGTGGCGAAT TTTGGGACGC CTGGGATCTG GCGACCGATT ATCGCCAGCA TCCATTGGAT CTGAACTGGA ATGGTTCGAT CGAGCTGGTG GAGCGCGGAC CTCTTGTAGC GCGCTTGGTC TTGCGAGGTT GGGCTGGGAG CAGTGCTCTG CGCCTTGATG TGCAGCTGAG AGCTGATTGC CCCTGGCTCG AGCTGCGTCT AGGCGTGGAT TGGCGTCAAA GTCATGAGCT ACTGCGTTTG GAGGTTCCCC TGGCTTGCTC AGCGGTGCGA TGGGCGGCGG ATACCAGCGG TGGAGTCATT GAACGTCCTG CTGAAGCTAT GACGGCTAGG GAAAAAGCAC GTTGGGAGGT CCCTTTGATC TCCTGGTTGG CGAGCGAGTC GGCGGCGCCG GGCGGTGGCC TTGCGGTGTT ATTGGATGGG CCTCAGGGCG TTCAGGTGTG CCCAGACAGG CTTGGTGTTT CTCTGCTTAG AGGGGCTACC TGGCCAGATC CCTCGGCTGA TCGGGGTTGG CATCGTCAGC AATTAGCACT GATGCCTGTT CCCGATGGTT GGAGTCGGGA GGCTGTTCCT CAGGCAGCGT TGGCTTTTCG AGAGCCGGGC TGGTTGGGGC CTTCTGATCT CAAGGTGTCA TGGTCAGGAT TGCCGTCGCT GCCTTCAAAG CTGGTTCCAT TGGCTGTGAC GTATGCCGAA GTCAGGCAGC TGAAGCTCAG CGTGCTTAAT GCCGGTGCAA CACGCCAGTC TTGGGCCGTA GGGAAGACTT GGCGGGTTGG TAGCGCTCAC GAGTCCTGTC TCGGCGAAGC GGTAGAGCTG AAGCCTGGAG AGTTGGCTGA ACTATTGCTT GAGCGTGTCG ATTGA
|
Protein sequence | MQEQLQARRL AWIETFRSRS RWDLKPGWYR VGDRDRRSLL QDDWAQIHRP DWVGRGLLVW PRGGLWLQLQ QEILCPEDWR QAQASCLRLV LGWWADQMRL WVDGVLVHEG DLFDTTCRWS LPESIGWERP LTLLLELRSP SHDDGALINS ELVLEPRAGA VDLDQVLLPE ALALYLEVAE VLPEAWLELD PHSQEATVAV DQQLRNAARP PGLIHWIGHA HLDLAWLWSV ADTWQAAERT FRSVLHLMQR FPDFHFSHSS PALYQWVEHH RPTLFAALQD ASRAGRWEPI NGPWVETDCV LVSTASLWRQ FTLGQQYSRS AFPEWSHHLA WLPDSFGFAA GLPAVATQTG IRWFCTHKLA WNASNPFPHR LFRWCSRGQA EVLALMLPPI GTDGNPMAML REQRSWQAST AVEEALWVPG VGDHGGGPTA EMLEQLQLWE DHPQALPQKP GTVRSYLETL EAHIETLPVW RDELYLELHR GCATTRPDQK RQNRHLERLL READLAIALL ACHLGDRADI PGTSASSLPD WRPLLFQQFH DILPGTSIPE VFEQAKPVWR SSCRESRRSR DQHLQQLFRS GVSCSDRFDP ESSAGCWVWM GLQPLQRWSP VLRLPRDQWG SAGRKLPCQE AKGGGTWVQL PVQQGVTAVP LQRSVSKALD VVDGLPVRGA VQIQPLDQGG WRIGNGLLEA DFGAEGLQQL WDRNGTPQLA GPLIWGRFRD RGEFWDAWDL ATDYRQHPLD LNWNGSIELV ERGPLVARLV LRGWAGSSAL RLDVQLRADC PWLELRLGVD WRQSHELLRL EVPLACSAVR WAADTSGGVI ERPAEAMTAR EKARWEVPLI SWLASESAAP GGGLAVLLDG PQGVQVCPDR LGVSLLRGAT WPDPSADRGW HRQQLALMPV PDGWSREAVP QAALAFREPG WLGPSDLKVS WSGLPSLPSK LVPLAVTYAE VRQLKLSVLN AGATRQSWAV GKTWRVGSAH ESCLGEAVEL KPGELAELLL ERVD
|
| |