Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_19251 |
Symbol | |
ID | 4776944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1693029 |
End bp | 1695542 |
Gene Length | 2514 bp |
Protein Length | 837 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640087435 |
Product | bile acid beta-glucosidase |
Protein accession | YP_001017932 |
Protein GI | 124023625 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4354] Predicted bile acid beta-glucosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.636863 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCTTT TTGGCGTTTC AGCTTTCAGA TCCTTATTGA AGCGTGGATC GAGCTTGAAG CCTTGGAGTG TGCCTGCCGC TAGCTGGAGC CGTGCTTTTG GATTGGCTTG GCAACAGCCA TACACGGTGC GTTATGCCAG CAATCTCGAT GATGGGCCCT GGCATGGCAT GCCCCTGGGA GGTTTCGGTG CCGGTTGTAT CGGGCGTAGC CCGCGCGGTG ATTTCAATCT CTGGAATCTT GATGCTGGTG AACACTGGCA GGGCAGCATT CCTGATTGCC AGTTCGCTTT GTTTGAGCGG CAGGGGGATC AGGTGCGTGC TCATGCGCTA GCTACAGCGC CACAAACCGA TGACTCTCAA CCTGAACTAG ACAAGCCTCT CTCCTCTTGG AGCTGGTATC CAGCCAGTAC AGAGCAACGC ACGACTGGTA GCTATTCAGC TCGCTATCCG CTCAGTTGGA CCCATTACAA AGGAGTCTTC GCGGCTGAGC TGGTTTGCGA GGCGTTTAGT CCAATCCTTC CTGGTGATTA CCGACGCACC AGCTACCCCC TGGCCGTGTT TCGTTGGCAG CTGCAGAACC CCACATCGCA ATCTCTGGAA TTGTCGTTGC TGCTGAGTTG GCGCAACACA TGTGGCTGGT TTACGAACAC CGATCCCTCC GCCAGCGTTC ATTTCCGCGA TGACGGCAGC CCTGAACACA GTTATGTGCC TGCTATCGGC CGTGGCGAGG GCCAGTGCAA TCGTTGGATT GATCAACCAG GCCTGATTGG TGTCTTGATG GAGGGTGAGC GAGCTGACCC CTTGGCTGAG GGGCAAGGGC AGTGGTGTTT GGCTGTCCCG GATCATCTGC CTGGTGTGGA GGTGATGCGT TGTAGTCGTT GGGATCCCAG CGGAGATGGC AGCGAACTGT GGAGTTCCTT CGCAAGTGAA GGAACCATTC CAAACAGCAA TGACACGCAC AAGAGCTTGG CGGGCGAGCA GACCAGCGCT GCTTTAGCGG TGAAGGTCAC CCTGGCGCCT GGTGAAAGTC TTGAGATCCC TGTGGTGATC AGTTGGGATC TACCGGTGAC GGCTTTCGCT ACAGGCGTTC GCGATTTGCG GCGTTATACC GACTTTCACG GCAGCGATGG TCAGGCCGCC GCCGCTCTGG CGGCCGAGGC CTTGCGTGAT TGGCCTGATT GGCGTGAGCA GATTGATGCC TGGCAGGCGC CGGTGTTGGC GCGTGAGGAT TTGCCGGAAC GCTTGCGAAT GGCTTTGTTC AATGAGCTCT ACGACTTGGC TAGCGGTGGC AGCCTTTGGA CGGCAGCCCG TCCAGGAGAT CCGGTGGGGC GCTTTGGGGT TCTTGAGTGC TTCGACTACG CCTGGTACGA AAGTCTTGAT GTGCGCCTCT ATGGCTCCTT GGCTCTGCTG CAGTTGTGGC CTGAGCTCGA TAAGGCTGTT TTGCGTAGTT TTGCTCGAGC CATTCCAGCT GCAGATGCCA CGCCACGACC AATCGGCTGG TACTTCACCC AGGGGCGTGG CCGGGTCGAG GCCCCCCGCA AGAGGGCGGC GGCTACCCCC CATGATCTGG GAGCACCGAA TGAGTCGCCG TTTGATGCCA CCAACTACAC CGCTTATCAG GATTGCAATC TCTGGAAAGA TCTGGCCAGT GATTATGTGT TGCAGGTATG GCGAACGTTT CTTCTCTCAC CCAATGGCGA AGATCTGAGC TTCTTGGCTG AATGCTGGCC GGCGGCAGTG CAAGCCCTCA GTTATCTCAA GCGCTTTGAT GTCAATCACG ATGGTCTACC CGATAACGGT GGTGCGCCAG ATCAGACCTT TGACGATTGG CCTTTACAAG GCGTGAGCGC CTATTGCGGT GCTCTTTGGA TTGCAGCTCT TGAGGCTGCT CTAGCCATGG CACAGCGCTT GCAATTGGAC TTAGGACTCA ATACGGCAGA GGAGCAACAT CAGTTCAGCG GTTGGCTTGA GCAATCACGG GCCAACTTCG ATCGGCTGCT TTGGAATGGG GAGTACTACA AGATTGATGC TGAGAGTGGT ACGCCCGTGG TGATGGCTGA TCAACTCTGC GGCGATTTTT ATGCTCGCTT GCTGGGGCTG CCCTCTGTTG TTGCTGATGA ACGCAGTCGC AGCAGTCTGA ATGCAGTCAA AGAGGCCTGC TTTGAAGGCT TTGAAGGCGG CCGTTTGGGA GTTGCCAATG GTTTGTGCCG TGATGGCATG CCACTTGATC CCAAGGGAAC CCATCCCTTG GAGGTATGGA CTGGCATCAA CTTCGGTCTA GCGGCCTACT ACCGACTGAT GGGCGATGCA ACGACGGCCA CAGCCATCTG TTCAGCGGTG GTCAATCAGG TGTATGGCGG TGGTTTGCAA TTCCGCACCC CAGAAGCGAT TACTGCAGTG AAGACCTACA GGGCTTGCCA CTATTTGCGT GCAATGGCGA TCTGGGCCCT GTGGGCAACA CATACGGATT GGCAGTTGAT TCCAGGTGCC GAACGGGCAG GGTCTGAGGG GTGA
|
Protein sequence | MALFGVSAFR SLLKRGSSLK PWSVPAASWS RAFGLAWQQP YTVRYASNLD DGPWHGMPLG GFGAGCIGRS PRGDFNLWNL DAGEHWQGSI PDCQFALFER QGDQVRAHAL ATAPQTDDSQ PELDKPLSSW SWYPASTEQR TTGSYSARYP LSWTHYKGVF AAELVCEAFS PILPGDYRRT SYPLAVFRWQ LQNPTSQSLE LSLLLSWRNT CGWFTNTDPS ASVHFRDDGS PEHSYVPAIG RGEGQCNRWI DQPGLIGVLM EGERADPLAE GQGQWCLAVP DHLPGVEVMR CSRWDPSGDG SELWSSFASE GTIPNSNDTH KSLAGEQTSA ALAVKVTLAP GESLEIPVVI SWDLPVTAFA TGVRDLRRYT DFHGSDGQAA AALAAEALRD WPDWREQIDA WQAPVLARED LPERLRMALF NELYDLASGG SLWTAARPGD PVGRFGVLEC FDYAWYESLD VRLYGSLALL QLWPELDKAV LRSFARAIPA ADATPRPIGW YFTQGRGRVE APRKRAAATP HDLGAPNESP FDATNYTAYQ DCNLWKDLAS DYVLQVWRTF LLSPNGEDLS FLAECWPAAV QALSYLKRFD VNHDGLPDNG GAPDQTFDDW PLQGVSAYCG ALWIAALEAA LAMAQRLQLD LGLNTAEEQH QFSGWLEQSR ANFDRLLWNG EYYKIDAESG TPVVMADQLC GDFYARLLGL PSVVADERSR SSLNAVKEAC FEGFEGGRLG VANGLCRDGM PLDPKGTHPL EVWTGINFGL AAYYRLMGDA TTATAICSAV VNQVYGGGLQ FRTPEAITAV KTYRACHYLR AMAIWALWAT HTDWQLIPGA ERAGSEG
|
| |