Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1239 |
Symbol | lacZ |
ID | 5711797 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1285260 |
End bp | 1287167 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641267151 |
Product | beta-galactosidase |
Protein accession | YP_001532582 |
Protein GI | 159043788 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.527812 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0426112 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGCGCA AGCTGGGCGT CTGCTACTAC CCCGAGCATT GGCCCGAAGA CCAATGGCCG CGGGACGCCG CACGCATGAA GGACGCGGGC CTCACCCTCG TCCGGATCGG GGAATTCGCC TGGTCCCGGC TCGAACCCTC CCCCGGCGAT CTGCGGTTCG ACTGGCTCGA CCGTGCCATT TCCGTTCTGG CCGAGGCCGG GCTGGAGGTC GTTCTGGGCA CCCCCACCGC CACACCGCCG CGCTGGATGC TCGACCGCCA CCCCGACATG CTGGCCGTGG ACGCCCGCGG CCAGCCGCGC AAGTTCGGCT CCCGGCGGCA CTACTGCTTC AGCCATCCCG GCTACCGGGC CGAGGCCGCG CGCATCGCCC GCCTGCTGGG GGAGCGGTAC GGCCGCGACC CCCGCATCGT GGCCTGGCAG ATCGACAACG AATACGGCTG CCACGACACA ACGCTCAGCT ATTCCGACGC CGCCCGGCAT GGGTTTCGCG ACTGGCTCGC CCAACGTTAT CAGTCCACCG ACGCGCTCAA CCGCGCCTGG GGAAACGTGT TCTGGTCTAT GGCGTACGAC CGCTTCGACC AGATCGACCT GCCGAACCTG ACCGTGACCG AGCCGAACCC GGCCCATGCG CTGGCCTTCC GGCGCTACGC ATCGGACATG GTGGTGGCCT TCAACCGCGC GCAGGTCGCG GCCCTGCGCC CCCTGACGGA CGCACCGCTG ATCCACAACT ACATGGGCCG GGTGACCGAG TTTGATCACC ACGCGGTGGG CGCGGACCTC GATATCGCCA GTTGGGACAG CTACCCGATG GGCTTTTTGC TCGACCGGGT CGAAGCACCC GCCGATCACA AGGCGGCCTA TCTCCGCCAG GGCGACCCGG ATTTCCAGGC CTTCCACCAC GACCTCTATC GCGGGGTCGG ACGGGACGGC CGCTGGTGGA TCATGGAACA GCAACCCGGC CCGGTGAACT GGGCGCCCTG GAACCCCGCC CCCCTGCCCG GCATGGTGCG GCTGTGGTCC CACGAAGCCT TCGCCCACGG CGCCGAGGCG GTCTGCTTTT TCCGCTGGCG CCAGGCCCCC TTCGCGCAGG AGCAGATGCA TGCGGGCCTT CTGCGCCCCG ACGACAGCCC GGCCCCAGGG CTGGAGGAGG CCGCGGCCCT GGCCGCAGAC CTCCCCCGGC TTCCCGACGT GTCACCGAGC CGCGCCCCCG TGGCGCTCGT GTTCGACTAC CCCTCCCAAT GGGCGTGGGA GGTGCAGCCC CAGGGCGCGG ATTTCGACTA TTTCGCCCTG TGCTTTGCGA TGTATCGCGG GCTGCGCAAG CTCGGCCTCT CGGTCGACAT CCTGCCCGCG GACCCGGCAC GGCTGGCGGG CCATGACCTG ATCCTTGTTC CCGGGCTTTT GCACCTGTCA GCCGACATGA CCGCATATCT CGCAACGACC CAGGCGCAGG TGCTGGTCGG CCCGCGCGCG GGTTCCAAGA CGCCGGAGAT GTCCATCGCC CTGCCGCTCG GCCCCAATCT GCAAGGGCTC GACGCCACCG TGACCCATGT CGAAACCCTG CCCCCCGGCG CCGAACGCGC GCTGCAACGC GGCGGGGCCG CCGAACGCTG GATCGAGGCC GTCGAGACCC GCGCCGAGAT CCTGGAGGAA ACCACCGAAG GCGCCCCCGT CCTGATCCGC ACCGGACGGC AACACTACCT CGCGGCCTGG CCGGACCCGG AGGCCATGGG CCGCATCCTG CGCGATCTCT GCTCGCGCGC AGGCATCCAG ACCACCGACA TGCCCGAAGG CGTCCGCCAA CGCGTCCACG GCCACCACAA GCTGGTGGTC AATTATTCCT CCGAAATGCG GGTTTTCGAA AACGATGCCT TGCCCCCTGC GGGTCTGGTG TGGAAATTGA TCCCATGA
|
Protein sequence | MTRKLGVCYY PEHWPEDQWP RDAARMKDAG LTLVRIGEFA WSRLEPSPGD LRFDWLDRAI SVLAEAGLEV VLGTPTATPP RWMLDRHPDM LAVDARGQPR KFGSRRHYCF SHPGYRAEAA RIARLLGERY GRDPRIVAWQ IDNEYGCHDT TLSYSDAARH GFRDWLAQRY QSTDALNRAW GNVFWSMAYD RFDQIDLPNL TVTEPNPAHA LAFRRYASDM VVAFNRAQVA ALRPLTDAPL IHNYMGRVTE FDHHAVGADL DIASWDSYPM GFLLDRVEAP ADHKAAYLRQ GDPDFQAFHH DLYRGVGRDG RWWIMEQQPG PVNWAPWNPA PLPGMVRLWS HEAFAHGAEA VCFFRWRQAP FAQEQMHAGL LRPDDSPAPG LEEAAALAAD LPRLPDVSPS RAPVALVFDY PSQWAWEVQP QGADFDYFAL CFAMYRGLRK LGLSVDILPA DPARLAGHDL ILVPGLLHLS ADMTAYLATT QAQVLVGPRA GSKTPEMSIA LPLGPNLQGL DATVTHVETL PPGAERALQR GGAAERWIEA VETRAEILEE TTEGAPVLIR TGRQHYLAAW PDPEAMGRIL RDLCSRAGIQ TTDMPEGVRQ RVHGHHKLVV NYSSEMRVFE NDALPPAGLV WKLIP
|
| |