Gene P9303_19251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_19251 
Symbol 
ID4776944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1693029 
End bp1695542 
Gene Length2514 bp 
Protein Length837 aa 
Translation table11 
GC content57% 
IMG OID640087435 
Productbile acid beta-glucosidase 
Protein accessionYP_001017932 
Protein GI124023625 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4354] Predicted bile acid beta-glucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.636863 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTTT TTGGCGTTTC AGCTTTCAGA TCCTTATTGA AGCGTGGATC GAGCTTGAAG 
CCTTGGAGTG TGCCTGCCGC TAGCTGGAGC CGTGCTTTTG GATTGGCTTG GCAACAGCCA
TACACGGTGC GTTATGCCAG CAATCTCGAT GATGGGCCCT GGCATGGCAT GCCCCTGGGA
GGTTTCGGTG CCGGTTGTAT CGGGCGTAGC CCGCGCGGTG ATTTCAATCT CTGGAATCTT
GATGCTGGTG AACACTGGCA GGGCAGCATT CCTGATTGCC AGTTCGCTTT GTTTGAGCGG
CAGGGGGATC AGGTGCGTGC TCATGCGCTA GCTACAGCGC CACAAACCGA TGACTCTCAA
CCTGAACTAG ACAAGCCTCT CTCCTCTTGG AGCTGGTATC CAGCCAGTAC AGAGCAACGC
ACGACTGGTA GCTATTCAGC TCGCTATCCG CTCAGTTGGA CCCATTACAA AGGAGTCTTC
GCGGCTGAGC TGGTTTGCGA GGCGTTTAGT CCAATCCTTC CTGGTGATTA CCGACGCACC
AGCTACCCCC TGGCCGTGTT TCGTTGGCAG CTGCAGAACC CCACATCGCA ATCTCTGGAA
TTGTCGTTGC TGCTGAGTTG GCGCAACACA TGTGGCTGGT TTACGAACAC CGATCCCTCC
GCCAGCGTTC ATTTCCGCGA TGACGGCAGC CCTGAACACA GTTATGTGCC TGCTATCGGC
CGTGGCGAGG GCCAGTGCAA TCGTTGGATT GATCAACCAG GCCTGATTGG TGTCTTGATG
GAGGGTGAGC GAGCTGACCC CTTGGCTGAG GGGCAAGGGC AGTGGTGTTT GGCTGTCCCG
GATCATCTGC CTGGTGTGGA GGTGATGCGT TGTAGTCGTT GGGATCCCAG CGGAGATGGC
AGCGAACTGT GGAGTTCCTT CGCAAGTGAA GGAACCATTC CAAACAGCAA TGACACGCAC
AAGAGCTTGG CGGGCGAGCA GACCAGCGCT GCTTTAGCGG TGAAGGTCAC CCTGGCGCCT
GGTGAAAGTC TTGAGATCCC TGTGGTGATC AGTTGGGATC TACCGGTGAC GGCTTTCGCT
ACAGGCGTTC GCGATTTGCG GCGTTATACC GACTTTCACG GCAGCGATGG TCAGGCCGCC
GCCGCTCTGG CGGCCGAGGC CTTGCGTGAT TGGCCTGATT GGCGTGAGCA GATTGATGCC
TGGCAGGCGC CGGTGTTGGC GCGTGAGGAT TTGCCGGAAC GCTTGCGAAT GGCTTTGTTC
AATGAGCTCT ACGACTTGGC TAGCGGTGGC AGCCTTTGGA CGGCAGCCCG TCCAGGAGAT
CCGGTGGGGC GCTTTGGGGT TCTTGAGTGC TTCGACTACG CCTGGTACGA AAGTCTTGAT
GTGCGCCTCT ATGGCTCCTT GGCTCTGCTG CAGTTGTGGC CTGAGCTCGA TAAGGCTGTT
TTGCGTAGTT TTGCTCGAGC CATTCCAGCT GCAGATGCCA CGCCACGACC AATCGGCTGG
TACTTCACCC AGGGGCGTGG CCGGGTCGAG GCCCCCCGCA AGAGGGCGGC GGCTACCCCC
CATGATCTGG GAGCACCGAA TGAGTCGCCG TTTGATGCCA CCAACTACAC CGCTTATCAG
GATTGCAATC TCTGGAAAGA TCTGGCCAGT GATTATGTGT TGCAGGTATG GCGAACGTTT
CTTCTCTCAC CCAATGGCGA AGATCTGAGC TTCTTGGCTG AATGCTGGCC GGCGGCAGTG
CAAGCCCTCA GTTATCTCAA GCGCTTTGAT GTCAATCACG ATGGTCTACC CGATAACGGT
GGTGCGCCAG ATCAGACCTT TGACGATTGG CCTTTACAAG GCGTGAGCGC CTATTGCGGT
GCTCTTTGGA TTGCAGCTCT TGAGGCTGCT CTAGCCATGG CACAGCGCTT GCAATTGGAC
TTAGGACTCA ATACGGCAGA GGAGCAACAT CAGTTCAGCG GTTGGCTTGA GCAATCACGG
GCCAACTTCG ATCGGCTGCT TTGGAATGGG GAGTACTACA AGATTGATGC TGAGAGTGGT
ACGCCCGTGG TGATGGCTGA TCAACTCTGC GGCGATTTTT ATGCTCGCTT GCTGGGGCTG
CCCTCTGTTG TTGCTGATGA ACGCAGTCGC AGCAGTCTGA ATGCAGTCAA AGAGGCCTGC
TTTGAAGGCT TTGAAGGCGG CCGTTTGGGA GTTGCCAATG GTTTGTGCCG TGATGGCATG
CCACTTGATC CCAAGGGAAC CCATCCCTTG GAGGTATGGA CTGGCATCAA CTTCGGTCTA
GCGGCCTACT ACCGACTGAT GGGCGATGCA ACGACGGCCA CAGCCATCTG TTCAGCGGTG
GTCAATCAGG TGTATGGCGG TGGTTTGCAA TTCCGCACCC CAGAAGCGAT TACTGCAGTG
AAGACCTACA GGGCTTGCCA CTATTTGCGT GCAATGGCGA TCTGGGCCCT GTGGGCAACA
CATACGGATT GGCAGTTGAT TCCAGGTGCC GAACGGGCAG GGTCTGAGGG GTGA
 
Protein sequence
MALFGVSAFR SLLKRGSSLK PWSVPAASWS RAFGLAWQQP YTVRYASNLD DGPWHGMPLG 
GFGAGCIGRS PRGDFNLWNL DAGEHWQGSI PDCQFALFER QGDQVRAHAL ATAPQTDDSQ
PELDKPLSSW SWYPASTEQR TTGSYSARYP LSWTHYKGVF AAELVCEAFS PILPGDYRRT
SYPLAVFRWQ LQNPTSQSLE LSLLLSWRNT CGWFTNTDPS ASVHFRDDGS PEHSYVPAIG
RGEGQCNRWI DQPGLIGVLM EGERADPLAE GQGQWCLAVP DHLPGVEVMR CSRWDPSGDG
SELWSSFASE GTIPNSNDTH KSLAGEQTSA ALAVKVTLAP GESLEIPVVI SWDLPVTAFA
TGVRDLRRYT DFHGSDGQAA AALAAEALRD WPDWREQIDA WQAPVLARED LPERLRMALF
NELYDLASGG SLWTAARPGD PVGRFGVLEC FDYAWYESLD VRLYGSLALL QLWPELDKAV
LRSFARAIPA ADATPRPIGW YFTQGRGRVE APRKRAAATP HDLGAPNESP FDATNYTAYQ
DCNLWKDLAS DYVLQVWRTF LLSPNGEDLS FLAECWPAAV QALSYLKRFD VNHDGLPDNG
GAPDQTFDDW PLQGVSAYCG ALWIAALEAA LAMAQRLQLD LGLNTAEEQH QFSGWLEQSR
ANFDRLLWNG EYYKIDAESG TPVVMADQLC GDFYARLLGL PSVVADERSR SSLNAVKEAC
FEGFEGGRLG VANGLCRDGM PLDPKGTHPL EVWTGINFGL AAYYRLMGDA TTATAICSAV
VNQVYGGGLQ FRTPEAITAV KTYRACHYLR AMAIWALWAT HTDWQLIPGA ERAGSEG