Gene P9303_24641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_24641 
Symbol 
ID4776087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2161743 
End bp2164757 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content57% 
IMG OID640087984 
Productglycoside hydrolase family protein 
Protein accessionYP_001018460 
Protein GI124024153 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0383] Alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAGC AGCTTCAAGC CCGACGATTG GCGTGGATTG AAACTTTTCG AAGTCGCTCT 
CGTTGGGATC TCAAGCCAGG TTGGTATCGC GTAGGGGATC GTGATCGGCG GTCTCTGCTT
CAAGATGATT GGGCTCAAAT CCATCGACCA GACTGGGTTG GCAGGGGTTT ATTGGTCTGG
CCTCGAGGCG GACTATGGCT GCAATTGCAG CAGGAGATTC TTTGTCCTGA AGACTGGCGA
CAAGCTCAAG CAAGTTGTTT GCGGCTGGTC TTGGGTTGGT GGGCTGATCA GATGCGGCTG
TGGGTGGATG GTGTGTTGGT GCATGAGGGG GATCTGTTCG ATACCACTTG TCGGTGGAGC
CTTCCTGAAT CGATCGGTTG GGAGCGACCT CTAACACTCT TGCTGGAGTT GAGGAGTCCT
AGCCATGACG ATGGCGCGTT GATCAACAGT GAGTTGGTGC TGGAACCCAG GGCCGGTGCT
GTGGATCTAG ATCAGGTTCT GCTGCCTGAA GCGCTGGCGC TCTATTTAGA GGTTGCAGAG
GTGTTGCCGG AGGCTTGGCT TGAGCTGGAT CCGCACAGTC AGGAGGCAAC CGTGGCTGTC
GACCAGCAAC TTCGCAATGC AGCGCGGCCG CCGGGGCTGA TCCATTGGAT CGGCCATGCC
CATCTTGATT TGGCCTGGCT GTGGTCAGTG GCTGACACTT GGCAGGCGGC AGAGCGCACG
TTCCGCTCTG TGCTGCACCT GATGCAGCGA TTTCCTGATT TCCATTTCAG CCACTCCTCG
CCAGCTCTCT ATCAGTGGGT TGAACATCAC CGTCCAACTC TTTTTGCTGC GCTTCAGGAT
GCAAGTCGGG CAGGTCGTTG GGAGCCGATC AATGGCCCTT GGGTCGAGAC TGATTGTGTG
CTGGTGAGCA CTGCTTCGCT TTGGCGACAG TTCACGCTTG GACAGCAGTA CAGCCGATCA
GCATTTCCGG AGTGGTCCCA TCACTTGGCC TGGCTTCCAG ACAGCTTTGG CTTTGCCGCT
GGTTTGCCAG CGGTAGCTAC CCAGACAGGA ATCCGTTGGT TCTGCACCCA TAAACTGGCC
TGGAATGCCA GTAATCCCTT CCCGCATCGA CTGTTTCGTT GGTGCAGTCG TGGCCAGGCT
GAGGTGCTAG CGCTGATGCT TCCGCCAATT GGCACTGATG GGAATCCGAT GGCCATGCTT
CGCGAGCAGC GGTCTTGGCA GGCGTCTACC GCTGTTGAAG AGGCTTTGTG GGTTCCGGGA
GTTGGCGACC ACGGAGGTGG CCCCACTGCA GAAATGCTGG AGCAGCTGCA GCTTTGGGAG
GATCATCCTC AGGCTTTGCC CCAGAAGCCT GGCACAGTAA GGTCTTACCT TGAGACGCTT
GAGGCCCATA TTGAGACTTT GCCAGTGTGG CGTGATGAGC TCTATTTGGA GCTTCATCGC
GGTTGTGCGA CGACCCGTCC CGACCAGAAG CGTCAAAACC GACACCTTGA AAGGCTTTTG
AGGGAAGCAG ACCTGGCTAT AGCTTTGCTG GCTTGCCACC TTGGCGATAG GGCTGACATT
CCTGGAACCT CTGCATCTTC ATTGCCAGAT TGGCGGCCTT TGCTCTTTCA GCAATTTCAC
GACATCCTTC CAGGCACATC GATTCCTGAA GTGTTTGAGC AGGCTAAGCC GGTCTGGCGA
TCGTCATGTC GTGAATCAAG GCGGAGCCGC GATCAGCATC TGCAGCAGCT TTTTCGCAGT
GGGGTGTCCT GCTCCGATCG ATTCGATCCA GAGTCTTCTG CTGGGTGCTG GGTCTGGATG
GGTCTACAGC CACTACAGCG TTGGTCGCCT GTGCTGAGAT TGCCACGAGA TCAATGGGGC
AGCGCCGGTA GGAAGCTTCC ATGCCAAGAG GCCAAGGGAG GCGGAACCTG GGTGCAGTTG
CCTGTGCAGC AAGGGGTGAC AGCAGTTCCG CTTCAACGTT CTGTCAGTAA GGCCTTAGAT
GTTGTTGACG GCTTACCTGT TCGTGGGGCT GTGCAGATTC AGCCCCTTGA TCAAGGCGGT
TGGCGGATTG GTAATGGACT TTTGGAAGCT GATTTTGGTG CCGAAGGCTT GCAGCAGCTT
TGGGATCGGA ACGGGACACC TCAGTTGGCA GGACCTCTGA TCTGGGGGCG TTTTCGCGAT
CGTGGCGAAT TTTGGGACGC CTGGGATCTG GCGACCGATT ATCGCCAGCA TCCATTGGAT
CTGAACTGGA ATGGTTCGAT CGAGCTGGTG GAGCGCGGAC CTCTTGTAGC GCGCTTGGTC
TTGCGAGGTT GGGCTGGGAG CAGTGCTCTG CGCCTTGATG TGCAGCTGAG AGCTGATTGC
CCCTGGCTCG AGCTGCGTCT AGGCGTGGAT TGGCGTCAAA GTCATGAGCT ACTGCGTTTG
GAGGTTCCCC TGGCTTGCTC AGCGGTGCGA TGGGCGGCGG ATACCAGCGG TGGAGTCATT
GAACGTCCTG CTGAAGCTAT GACGGCTAGG GAAAAAGCAC GTTGGGAGGT CCCTTTGATC
TCCTGGTTGG CGAGCGAGTC GGCGGCGCCG GGCGGTGGCC TTGCGGTGTT ATTGGATGGG
CCTCAGGGCG TTCAGGTGTG CCCAGACAGG CTTGGTGTTT CTCTGCTTAG AGGGGCTACC
TGGCCAGATC CCTCGGCTGA TCGGGGTTGG CATCGTCAGC AATTAGCACT GATGCCTGTT
CCCGATGGTT GGAGTCGGGA GGCTGTTCCT CAGGCAGCGT TGGCTTTTCG AGAGCCGGGC
TGGTTGGGGC CTTCTGATCT CAAGGTGTCA TGGTCAGGAT TGCCGTCGCT GCCTTCAAAG
CTGGTTCCAT TGGCTGTGAC GTATGCCGAA GTCAGGCAGC TGAAGCTCAG CGTGCTTAAT
GCCGGTGCAA CACGCCAGTC TTGGGCCGTA GGGAAGACTT GGCGGGTTGG TAGCGCTCAC
GAGTCCTGTC TCGGCGAAGC GGTAGAGCTG AAGCCTGGAG AGTTGGCTGA ACTATTGCTT
GAGCGTGTCG ATTGA
 
Protein sequence
MQEQLQARRL AWIETFRSRS RWDLKPGWYR VGDRDRRSLL QDDWAQIHRP DWVGRGLLVW 
PRGGLWLQLQ QEILCPEDWR QAQASCLRLV LGWWADQMRL WVDGVLVHEG DLFDTTCRWS
LPESIGWERP LTLLLELRSP SHDDGALINS ELVLEPRAGA VDLDQVLLPE ALALYLEVAE
VLPEAWLELD PHSQEATVAV DQQLRNAARP PGLIHWIGHA HLDLAWLWSV ADTWQAAERT
FRSVLHLMQR FPDFHFSHSS PALYQWVEHH RPTLFAALQD ASRAGRWEPI NGPWVETDCV
LVSTASLWRQ FTLGQQYSRS AFPEWSHHLA WLPDSFGFAA GLPAVATQTG IRWFCTHKLA
WNASNPFPHR LFRWCSRGQA EVLALMLPPI GTDGNPMAML REQRSWQAST AVEEALWVPG
VGDHGGGPTA EMLEQLQLWE DHPQALPQKP GTVRSYLETL EAHIETLPVW RDELYLELHR
GCATTRPDQK RQNRHLERLL READLAIALL ACHLGDRADI PGTSASSLPD WRPLLFQQFH
DILPGTSIPE VFEQAKPVWR SSCRESRRSR DQHLQQLFRS GVSCSDRFDP ESSAGCWVWM
GLQPLQRWSP VLRLPRDQWG SAGRKLPCQE AKGGGTWVQL PVQQGVTAVP LQRSVSKALD
VVDGLPVRGA VQIQPLDQGG WRIGNGLLEA DFGAEGLQQL WDRNGTPQLA GPLIWGRFRD
RGEFWDAWDL ATDYRQHPLD LNWNGSIELV ERGPLVARLV LRGWAGSSAL RLDVQLRADC
PWLELRLGVD WRQSHELLRL EVPLACSAVR WAADTSGGVI ERPAEAMTAR EKARWEVPLI
SWLASESAAP GGGLAVLLDG PQGVQVCPDR LGVSLLRGAT WPDPSADRGW HRQQLALMPV
PDGWSREAVP QAALAFREPG WLGPSDLKVS WSGLPSLPSK LVPLAVTYAE VRQLKLSVLN
AGATRQSWAV GKTWRVGSAH ESCLGEAVEL KPGELAELLL ERVD