Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcav_3080 |
Symbol | |
ID | 7858540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beutenbergia cavernae DSM 12333 |
Kingdom | Bacteria |
Replicon accession | NC_012669 |
Strand | + |
Start bp | 3434465 |
End bp | 3436798 |
Gene Length | 2334 bp |
Protein Length | 777 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643867177 |
Product | Arabinogalactan endo-1,4-beta-galactosidase |
Protein accession | YP_002883086 |
Protein GI | 229821560 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3867] Arabinogalactan endo-1,4-beta-galactosidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.211646 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCTCC CGGCCCAGGC GGCGTCGGCG CACCGCGACC CGCGGCCCGT CGAGGCGGGG ATCGTCGTCG AGCGGGTCGA GGGCATGCCG GAGGACTTCG TCAACGGCGT CGACGTCTCC TCCGTCCTGT CGCTCGAGGA GAGCGGTGTC ACCTTCCGCG ACCGCCGCGG CCGACCCGCC GACCTCTTCG AGGTGCTGGC GGACGCCGAC GTCACGCACG TGCGGATCCG GGTCTGGAAC GACCCGTTCG ACGCGGAGGG GCACGGGTAC GGCGGCGGGA ACGTCGACGT CGCCCGGGCC GTGGAGATCG GCCGGCGGGC GACGGCGGAG GGCCTGAGCG TGCTCGTCGA CTTCCACTAC TCCGACTTCT GGGCCGATCC CGGGAAGCAG CAGTCGCCCA AGGCCTGGGA AGGGCTGGCC ATCGAGGCCC GGGCTGCGGC CGCCGGCGAG TTCACCACGA GCGCCCTGAC AGCCTTCCGC GACGGCGGCG TCGACGTCGG GATGGTCCAG GTCGGCAACG AGACGAACAA CGGCGTCGCC GGCGTGACCG CCTTCGCGGA CATGGCGCAG ATCTTCAGCG CGGGCAGCGC GGCGGTCCGG ACGGTGTTTC CGGACGCGCT CGTGGCCGTG CACTTCACGA ACCCCGAGAC CGCGGGTCGC TACGCCGGCT ACGCCGCCGA GCTCGAGCGG TACGGCGTGG ACTACGACGT GTTCGCGAGC TCGTACTACC CGTTCTGGCA CGGGACGCTC GAGAACCTCA CGTCGGTGCT GTCGCAGGTC GCGACCACCT ACGACAAGAA GGTCATGGTG GCCGAGACGT CCTGGAACTC GACGTTCGAC GACGGCGACG GTCACCCGAA CACGATCACG TCGTCGTCGG GCTTCACGCA GTACCCGGCG AGCGTGCAGG GCCAGGCCAT GGCGGTGCGG GACGTGATCG AGGCGGTCGT GAACGTCGGC GACGCCGGGA TCGGTGTCTT CTACTGGGAA CCGGCGTGGC TGCCGGTCGG CCCGCCCGAG GAGGTCGAGC AGAACCGCCT GCTGTGGGAG CGGGACGGCT CCGGCTGGGC GACGAGCTAC GCCGGTGAGT ATGACCCGGA GGACGCCGGC GTCTGGTTCG GTGGCTCGTC CTGGGACAAC CAGGCGCTGT TCGACTTCTC GGGTCGGCCG CTGGAGTCGC TCATGGTCTT CGCTTACGCC CGCACCGGGG CGACGGCGCC GCGGGCGGTG ATCGCCGTCG AGCAGGTGGA GGTGGTGGTC ACCGAACGTG AGCCGATCGT GCTGCCTGAC GCCGTGGAGG TCACCTACAA CGACCGGTCC GTCGAGACCC AGGCCGTCAC ATGGAGCGAC GCGGTGGACT GGATCCGCGG GCCGGGTGAA TACCGCATCA CCGGGCTCAC CGGTGCCGGG GTGCTCGCCG AGGCGAGCGT GACCGTCCAG GCGGTGAACC ACGTGCGCAA CCCCGGCTTC GAGGATCCTG ACCTCAGCAT GTGGGTGATC ACCGGCGAGG GCGCCGAGGT GGCCGACGAT CCCGATGCGT TCGAGGGGTC GCGGGCGCTC AAGTTCTGGG CCGCCACCGA CTACTCGTTC GCGATCAGCC AACGTCTCGA GGGAGTTCCG GCCGGCACCT ACGCGCTCTC GGCCACGACG CAGGGCGACG ACGCCGGGGA GACCGACTCC ATGCTCCTGA GCGCGAGCAC GGCCGCGGGG GAGGTCAGCG CGCCGCTCGA GCTCGCCGGC TGGCGGAACT GGAGCACCGC GACGATCGAT GAGGTCGTCG TCGGCGACGA CGGCGTGGTG ACGGTGAGCG CGGCCTTCAC GGTGCAGGGT GGTGCGTGGG GCAACGTCGA CGCCGTCACG CTGTCGCAGG TCGACCGGAC CGAGATCGAC ACCACGGCGC TCGAGGCCGC GCTCGCCGAG GCATGGGCCG TCGACCGCAG CCTGTACACC GCGGAGTCGC TCCGGGTCCT CGACGACGCC GTCGAGAAGG CGGAGGTCGT CCTCGCCGGG AGCCGGGCCG AGCAGGCGGA CGCCGATGCG GCGACGTCGC TCCTCGTCGA GGCGTTGGCC GGCCTCGAGC CCGCCGACGA CGGGCCCGGC CGGCCGTGCC GCCCGGGTGG CGGTGGGCCG GGGCACCCCG GGCATCCCGG CGGACCCGGG CACCCCGGGC ACCCCGGGCA CCCCGGACAC CCCGGACACC CCGGACACCC CGGACACCCC GGCGGACCCG GACACGCTGG TCATCCGGGC GGAAGTGACG TCGAGCACCG AGGCGGCCCC GGCCATCCCG GAGGCCCCGG AGGCCCCGGC GGCCCTGGCC GCCCCTGCCG GTAG
|
Protein sequence | MILPAQAASA HRDPRPVEAG IVVERVEGMP EDFVNGVDVS SVLSLEESGV TFRDRRGRPA DLFEVLADAD VTHVRIRVWN DPFDAEGHGY GGGNVDVARA VEIGRRATAE GLSVLVDFHY SDFWADPGKQ QSPKAWEGLA IEARAAAAGE FTTSALTAFR DGGVDVGMVQ VGNETNNGVA GVTAFADMAQ IFSAGSAAVR TVFPDALVAV HFTNPETAGR YAGYAAELER YGVDYDVFAS SYYPFWHGTL ENLTSVLSQV ATTYDKKVMV AETSWNSTFD DGDGHPNTIT SSSGFTQYPA SVQGQAMAVR DVIEAVVNVG DAGIGVFYWE PAWLPVGPPE EVEQNRLLWE RDGSGWATSY AGEYDPEDAG VWFGGSSWDN QALFDFSGRP LESLMVFAYA RTGATAPRAV IAVEQVEVVV TEREPIVLPD AVEVTYNDRS VETQAVTWSD AVDWIRGPGE YRITGLTGAG VLAEASVTVQ AVNHVRNPGF EDPDLSMWVI TGEGAEVADD PDAFEGSRAL KFWAATDYSF AISQRLEGVP AGTYALSATT QGDDAGETDS MLLSASTAAG EVSAPLELAG WRNWSTATID EVVVGDDGVV TVSAAFTVQG GAWGNVDAVT LSQVDRTEID TTALEAALAE AWAVDRSLYT AESLRVLDDA VEKAEVVLAG SRAEQADADA ATSLLVEALA GLEPADDGPG RPCRPGGGGP GHPGHPGGPG HPGHPGHPGH PGHPGHPGHP GGPGHAGHPG GSDVEHRGGP GHPGGPGGPG GPGRPCR
|
| |