Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcav_1457 |
Symbol | |
ID | 7860926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beutenbergia cavernae DSM 12333 |
Kingdom | Bacteria |
Replicon accession | NC_012669 |
Strand | + |
Start bp | 1660184 |
End bp | 1663006 |
Gene Length | 2823 bp |
Protein Length | 940 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643865544 |
Product | glycoside hydrolase family 2 immunoglobulin domain protein beta-sandwich |
Protein accession | YP_002881477 |
Protein GI | 229819951 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGAGA TCATCGACCT CGACGCGCCC GCACACATCC GCAACCATGC CGAGCTCGCC GCTGAGGTGG AGCGACTCCT CGCAAGCCTC GGCGACATCG GCGACGTCGG GCCATCGCCC GCCGGGCGTG CCGTGCTGAC GGGCTGGACG GCGCGGGACG TCGAGCACCG CCCTGCGGCC CCTGCGAGCG TGCCGGACGA CGAGCTGTTC CCGGTCTCCC TGCCGCACAC GGTCGAGGCG GCGTTCGTGC ACCTGAGGTG CCGTGTGCCG GCCGACCTGG TAGCCGGGGA GCGGCCGGTG CTCTGCGTCG GCGCCGCGGA CTACGAGGCG TGGGTGTACG TCGATGGCGT GCTGCGCGCC GAGCACAGCG GATACTTCGC TCCGTTCGAC GTCGCGCTGA CAGCGGCCGA CAGGTCCGGG TTCCAGCTCG ACATCGTCAC CCGGCGCGCC CGCGAGTCGT TCCGGTGGAC CGGCGAGAGC GAGGCACCGA ACCACGGCGA CGCCCATGGC AAGGGCCTCG GCTCGGTGTG CGGCGACGCC GTCGCTGCCG GGGCAGGACT GCTGCAGCCG GTGTGGATCG AGCACCGCCC CGCGACGCGT ATCGCTCAGC ACCGCGTCGT GGCTGGGGCC GACGGCGTCG TCCGCGTGGA GGTCGACCTG GAGTCGGCAG ACTCGGAGGA CCCGGCCCCG GGCGCACTGC ACCTCCTCGC CGAGATCCTC GAGGGCGACG ACGTCGTCGG GTCGTCCGCG GTAACGGCTG CCCGCCAGGG GCGCACGGTG CTCGACGTGT CGGTGGTCGA CCCGCATCCC TGGTCGCCGG CCGATCCCCA CCTGTACCGG CTCCGGCTCG CTGTCGTCGT CGACGGCGAG CTCCGGGACG AGGTCAGCGG TGCCGTCGGC CTACGGACGA TCGAACGCCG CGACGACGGG TCGCTGCTGC TCAACGGCGC ACCTCTGTAC CTGCGTGGCA CGACGACGAT CGGATGCTTC TGGGACGCCG CGTGGTCCGG CGACGCTGAC GCCGTGCTCC GCCAGCTGCT GGTCGTCAAG GCACTCTTCG GGAACACGAT CCGCGTTCAC GTCAGCGTGC TCCCGGACCT CTTCTACGCC TGCGCGGACC GCGTGGGGAT CCTGGTGTAT CAGGACGCTC CGCTGCAGTG GCACTGCTTC GAGCCGCGTA AGGACGACCT CGACGCTGAG CTGGACCAGA TCCGCGAGCT CGCCGTCGGG ATCCGCAACC ACCCGAGCGT GGCGCTCGTC TCGGTCCCGA ACGAGATGCA CATGGACATC CCGTTCCACG ACTCCGACCT CGAGTTCGTT GCTCGCGCAC ATGCCGTGCT CGAGTCCGAT GGTCCGCAGG CGGTTCTCCT GCAGGACTGG GGGGGAGAGG GTCGGGCACG GGTCGCGCAC CAGGCGCTGC ACGATTACCC GGGCTACTTC CACCACACCC CGGAAGCCGG GAGCGGCGTC ACCGACTGGG GAGGCGCCCG GCTCGACCCG GGTGTGCGCG CGATCGTCTC CGAGTACGGC GGTGGGGCAG TCCCGTCGTG GGAGGCGATG CTCCAGTCTC GCGCGGCGGC CGAACGCCGC GGTGAGCCGT TCACGCTCCC GGACACCCCG GACGGCCCGT GGACAGCGGA GGAGGGTGTC TTCGCCCAGA CCGGCGAGCT GCTCGACAAC CACCAGCGGA TGGTCGGCTG GCAGCCGTCG TTCGCCGCCT ATCACGCGGC GTCGCAGGCG GTGCAGGCGC GGGTGCTGAC GGCGCAGACC GGCCGGATCC GGCGCGATCG TGCGCGCACT TCCGGTGTGA TCCACCACTA CCTGCAGAAT CCGGCGCCCC ACTTCTACAA CCCGTGGGTC GACCTTCACG TCATCGACAG CGCCGGACAT CTCACAGGCG GCTTCGATGC GCTCCGGGAG GCGATGCGCC CGGTAGCACT CGACGTCCTG GGGTTGCCGT ACCGCGCCTA CGCCGACTCT GACCTGAGAG CGGAGCTGTG GGTCTACTCC GACCTCGCCG TCGCCACGGT CGGCCGGCTC GTCTGGTCCT GGAGCGCCGA CGGTCACGTT GTCGACGCTG GAACCACCGA CCTCACGATC GGGGCGGACT CCTCGACGCT CGTCGTCGAG CTCGCGCTGC GGGCGCCGTC GGCCGGCGGC GGCGCTCGGC TGAGCGTCGC CCTCGAGATT GAGGACGTGC GCTGGACCGA GAGCGAGGTC ACCGTCGGCG TGCACAACCA GCCTGAGACG CTGCCCGAGC CGGTGGCGGT GCTCGGCGAC GTCGCGGGTT TCGCGGAGCG CACAGCGTCG TGGCTCCCGA ACGTGGAGAC CTTCTCCACG GCGGGCGAGG CCACCAGCGT CGTCGTGACG CCGGACGTCG CGCTCGAGGA CGAGGTTGTC GCGCAGCTGC GCGCGTTCGC CCGCACCGGC GGGCGGGTCG TGCTGCTGGA GCGTTCGGCG GCCGACGACC TGCGCTGGGT GAGCAGGCGT ATCCCGGTCG GCGTCGCCGT CAACCAGATC GACGGTCTGG CCTCGGTCGA CATGTCGATG GTCTCCACCG GCCTGACGAG CGAGGACCTG TCGCGCTGGG CGACGCCGGA CGGCCGCGTC ATCGACGCTC CGCTGGTGTC GACCGACCGT CGCGGCGCCC CGATCTGGGC GCGCAGCGGC CATCGCCTCC AACTGGCGGC GCTCCAGGAG TTCGACCAGG AACCGGGCAC CGTCTTCCTG TGCCAGCTGC TCGTGTGGTC CACCCTCGGT CGCGAACCGA GTGCGGCGCG GGTATTGCGA GCGCTCCTCA CCACGCCGTA CCGCCCGCTG TGA
|
Protein sequence | MHEIIDLDAP AHIRNHAELA AEVERLLASL GDIGDVGPSP AGRAVLTGWT ARDVEHRPAA PASVPDDELF PVSLPHTVEA AFVHLRCRVP ADLVAGERPV LCVGAADYEA WVYVDGVLRA EHSGYFAPFD VALTAADRSG FQLDIVTRRA RESFRWTGES EAPNHGDAHG KGLGSVCGDA VAAGAGLLQP VWIEHRPATR IAQHRVVAGA DGVVRVEVDL ESADSEDPAP GALHLLAEIL EGDDVVGSSA VTAARQGRTV LDVSVVDPHP WSPADPHLYR LRLAVVVDGE LRDEVSGAVG LRTIERRDDG SLLLNGAPLY LRGTTTIGCF WDAAWSGDAD AVLRQLLVVK ALFGNTIRVH VSVLPDLFYA CADRVGILVY QDAPLQWHCF EPRKDDLDAE LDQIRELAVG IRNHPSVALV SVPNEMHMDI PFHDSDLEFV ARAHAVLESD GPQAVLLQDW GGEGRARVAH QALHDYPGYF HHTPEAGSGV TDWGGARLDP GVRAIVSEYG GGAVPSWEAM LQSRAAAERR GEPFTLPDTP DGPWTAEEGV FAQTGELLDN HQRMVGWQPS FAAYHAASQA VQARVLTAQT GRIRRDRART SGVIHHYLQN PAPHFYNPWV DLHVIDSAGH LTGGFDALRE AMRPVALDVL GLPYRAYADS DLRAELWVYS DLAVATVGRL VWSWSADGHV VDAGTTDLTI GADSSTLVVE LALRAPSAGG GARLSVALEI EDVRWTESEV TVGVHNQPET LPEPVAVLGD VAGFAERTAS WLPNVETFST AGEATSVVVT PDVALEDEVV AQLRAFARTG GRVVLLERSA ADDLRWVSRR IPVGVAVNQI DGLASVDMSM VSTGLTSEDL SRWATPDGRV IDAPLVSTDR RGAPIWARSG HRLQLAALQE FDQEPGTVFL CQLLVWSTLG REPSAARVLR ALLTTPYRPL
|
| |