Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcav_3576 |
Symbol | |
ID | 7860840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beutenbergia cavernae DSM 12333 |
Kingdom | Bacteria |
Replicon accession | NC_012669 |
Strand | + |
Start bp | 3973718 |
End bp | 3977305 |
Gene Length | 3588 bp |
Protein Length | 1195 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 643867679 |
Product | hypothetical protein |
Protein accession | YP_002883580 |
Protein GI | 229822054 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.61557 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGAACC CCCCACCCCC GTGGACGCTC GACGGGGGCC GGCTGCGCGC CGTCGTCGCC GACGGCGGGG CCATCGCCCG GCTCGCCACC GACGACCTGG AGCTCGTGCA GCACGCGGGC TCCGCGCTCG AGCCGGGTCT GCTGGGGCTG TGGGTCCGGG ACGCGCGGCA CGGGCTGCAC CCGCTGCTCG GGCCCGGCAG CGGGTCACGC GTGGCTGTGA GCGGGGCCGG GTCTCAGGCG CCGGCCGCGG AGGCGCAGGA ATCCTCTCGC GGTGGCGCCG AGAACCTCTC GCGGATGGAG GCGGCCGGCG AGTGGGACGA CGGCGGGCTC GCGTGGCGCG TGCGGTTCTC GCTGGTCCCC GGGGAGCTCG CGTGGGCGTG GGACGTCGAG CTCTGGCCCA CGGACGGTGA GCCGCACGAC GTCGACGTGG TGCACGCGCA CGACGTGGCC CTCGCCCCGG CCGGGATGCT GCGCGCCAAC GAGCTGTACG TCAGCCAGTA CCTCGACGTC ACGCCGCTCG AGCACCCGAC GCACGGCTGG GCGCTCGCCG TCCGGCAGAA CCTGCCCGTG CCGGGCGGGC ACCCGTGGCT CGTGCTCGCG TGCACGGACC GCGCGGCCGC CTACGCCACC GACGCGATCG ACGTGCACGG CCTGGCCGCG CGGGCAGCCG AGCCGCCGCC GCTGGCGGGT GGGCTCCCGT CGCGTCGGCG GCAGCACGAG CACACGCTCG CTGCCCTGCA GACGCAGCAG CACCGGCTCG ACCCCGACGG CGCCTGGCGC GTCTCGTTCG CCGCGCTGCT GCGCACGGAC CACCCGGCCG CGACCGGCGA GGCCGACCTG GCCGCCGTCG AGGAGGCGAT CGCCCTGGCG CGCGGCGTGC GCGGAGCGGC AGGCGAGGCG CCGACCGCGC CTCCGCGCCC CGCCGTCGCG AGCGTCTACG CGCCCGCCCG CGTCGTGCCC GTGCGTGACC TGCGGACGGC GGAGGTCGCC GAGCGGTGGG GCGTGCCGGA GCGCGGCGTC GAGACCGCCG ACGACGGCGC GCTGCTGGCG TTCGCGCCCG ACGCCGAGCG GCACGTGGTG CTCGGCGCGA AGGAGCGCGC GGTCCTGCGG CCGCACGGCA CGATCCTGCG CGGCGGCCAC GGGCTCGCCC CCGACCCCGG CACCGTGGCC GTGACCGCGT GGATGACCGG CTCCCCCCTG TCGTACCTCA CGCGCGGGCA CGCGAGCTCC GCGCGCGCTC TCACGACCGT GCGCGGCTAC CTCGGGCTCC ATCGGGCGTA CGGCGTGCGG GTGCTCGTCG AGGACGTCAC CGACGGCGGC TGGCACCTGC TGGACGTCGC GTCGGCCTTC GAGATGACGC CCGACGGCGC ACGCTGGGTC TACGTCACCG ACGGCGTGAC GATCGAGCTG GCGACGTCCC TCACGTCCGA CGGCGCCGTC CTCGACGTCA CGAGCTCGGT GCCGCGGCGC TTCCTCCTCG CGTGCCACCT CGCGACGTCG GGTACGGACA ACGCCCCGGC GCCGGGCGAG CTGGAGACTC GCGTCACGCC CGACGGCGCG AGCGTGGTGT TCGGCCCGGC GACCCCGCTC GGGTCCGTGG CCCCCGGAGC GAGACTCACG CTGGTCGCCA CGGAGAACCC TCAGGCTCCG AGCTCCTCTC GGGAAGCCTC GGACAACGTC TCGGGAACGG TAGGGGCGGG CGACGACGGC ACGCTGTTCG ACGACGGCGC CTCCCGCGGT CTCCCGGTCG TCACGTTCGC GAGCGGCCCG ACGACGTCCC TGCGGCTCGA CGCCCGCCTC CACACCTCGC GAGAGGTTCG TGACCCCTCC GCGAGAGGTT CGTGGCCCGA CGAGGAGACG CTCGTGGCGG CGGAGCGAGG CGCAGAGGCA GGTTCCTCGA GCGCGGACGC CCCACCGGCC CCGGAACCCG AGCCCGCCGC CGTCGTCCTC CCCTCTCTCG CGCTGCCCTC CACGGCGGAC GCCGCCGCCG CGGCGGACGT CGACGCCGTC GCGCTGGCCG TCCCGTGGCT CGTGCGCGAC GCACTCGTGC ACTACCTGGC CCCCCGCGGC CTCGAGCAGT ACACCGGCGG CGCGTGGGGC ACCCGTGACG TGTCCCAGGG ACCGGTCGAG CTGCTGCTCG CGCTGGACCG CCCTGACGAC GTCCGCGCGC TGCTGCGCCG TGTGTTCGCC GCGCAGAACG CCGACGGCTC GTGGCCTCAG GCGTTCGGCT TCCTCCCCGG CGACGAGGAC TTCCGGCACG AGCCGCCCCA CGGGGACGTC GTGTTCTGGC CGGTCCTCGC GCTCGGCCGC CACCTGGTGA CGACCGGCGA CGCCGGCGTG CTCGACGACG TCGTCGGCTA CCACGCCGGC GCGCGGGCGG AGGAGTCGGT GCTCGAGCAC GCGCTGCGCG CGCTCGACGC CGCACGGGCC GCGACGCTCC CCGGCACCCA CCTCGCGGCG TACGGCCACG GCGACTGGAA CGACTCGCTC CAGCCCGCCG AGCCCGGGAT GACCTCGACG CTGACGAGCT CGTGGACGGT GACGCTGCAC CACCACGCGC TGCAGATGCT CACCGAGGGG CTCGCGGGCG CGGGCGTCCA CGAGGAGCTG GCCGACTCGC TCCGTGCGGA GGCGGCCGCC GTCGCCGCCG ACGCCCGGCG CCACCTCCTC GTGGACGGAG AGCTGGCGGG GTACGCGCAG CTCGCCCCGC CGGAGCCCGC CGGTGACGAC GGTGCGGCGG CCACGCCGGC GCACGTCACC CGCCTGCTCG TGCACCCGCG CGACGACGAG ACCGGTCTGA CGCACAGCGC CCTGCCGATG ATCCACGCGA TCGCCGAGGA CTTCTTCACA CCGGACGAGG CCTGCCACCA CGTGCAGATC CTCCGCGAGC ACCTGCTGGG CACCGACGGC ATGCGCCTGT TCGACCGCCC GGCGCAGTAC TCCGGCGGCC CGATGGTGCA CTTCCAGCGG GCGGAGAGCG CGACGTTCGT CGGGCGCGAG ATCGGCCTCA TGTACGTGCA CGCGCACCTG CGGTGGTGCG AGGCGCTCGC CCGGCTGGGC GACGCCGACG GCCTCTGGCT CGCGCTGCGG CAGGTGCTCG GCGGTGCGGT CGCCGGCGCC GTCCCGGGCG CCCGGCCCCG CCAGACGAAC ACCTACTTCT CCAGCTCGGA CGCCGCCGTC GCCGACCGCC CCGAGTTCGC CGCGCGGTAC GGCCAGGTCC GCTCCGGCGA GGTGGCCGTC GAGGGCGGGT GGCGGATCTA CTCCTCCGGC CCGGGGATCA CGGTGCGCAT CCTGGCCGAG GTGCTGCTCG GGATCCGTCG GCGCGGGGCG TGGATCGAGG TCGACCCGGT CCTTCCGCCC GAGCTCGACG GCCTGGAGGC GCGCGTCCCG TTGCTGGGCG GCGAGCTGGC CGTGACCTAC CGCGTCGGCA CCCGCGGCGC CGGCCCTTCG GAGATCCGCC TCGGCGGCCG GACGATCGAG TTCGAGCGGC TCGCGAACCC GTACCGTGAG GGCGGCGCCC GGGTGGACCT CGAGCCGCTC CGGGACGCCG TGACCACCCA CGAGACCCTG GAGATCGTGC TGCCGTGA
|
Protein sequence | MTNPPPPWTL DGGRLRAVVA DGGAIARLAT DDLELVQHAG SALEPGLLGL WVRDARHGLH PLLGPGSGSR VAVSGAGSQA PAAEAQESSR GGAENLSRME AAGEWDDGGL AWRVRFSLVP GELAWAWDVE LWPTDGEPHD VDVVHAHDVA LAPAGMLRAN ELYVSQYLDV TPLEHPTHGW ALAVRQNLPV PGGHPWLVLA CTDRAAAYAT DAIDVHGLAA RAAEPPPLAG GLPSRRRQHE HTLAALQTQQ HRLDPDGAWR VSFAALLRTD HPAATGEADL AAVEEAIALA RGVRGAAGEA PTAPPRPAVA SVYAPARVVP VRDLRTAEVA ERWGVPERGV ETADDGALLA FAPDAERHVV LGAKERAVLR PHGTILRGGH GLAPDPGTVA VTAWMTGSPL SYLTRGHASS ARALTTVRGY LGLHRAYGVR VLVEDVTDGG WHLLDVASAF EMTPDGARWV YVTDGVTIEL ATSLTSDGAV LDVTSSVPRR FLLACHLATS GTDNAPAPGE LETRVTPDGA SVVFGPATPL GSVAPGARLT LVATENPQAP SSSREASDNV SGTVGAGDDG TLFDDGASRG LPVVTFASGP TTSLRLDARL HTSREVRDPS ARGSWPDEET LVAAERGAEA GSSSADAPPA PEPEPAAVVL PSLALPSTAD AAAAADVDAV ALAVPWLVRD ALVHYLAPRG LEQYTGGAWG TRDVSQGPVE LLLALDRPDD VRALLRRVFA AQNADGSWPQ AFGFLPGDED FRHEPPHGDV VFWPVLALGR HLVTTGDAGV LDDVVGYHAG ARAEESVLEH ALRALDAARA ATLPGTHLAA YGHGDWNDSL QPAEPGMTST LTSSWTVTLH HHALQMLTEG LAGAGVHEEL ADSLRAEAAA VAADARRHLL VDGELAGYAQ LAPPEPAGDD GAAATPAHVT RLLVHPRDDE TGLTHSALPM IHAIAEDFFT PDEACHHVQI LREHLLGTDG MRLFDRPAQY SGGPMVHFQR AESATFVGRE IGLMYVHAHL RWCEALARLG DADGLWLALR QVLGGAVAGA VPGARPRQTN TYFSSSDAAV ADRPEFAARY GQVRSGEVAV EGGWRIYSSG PGITVRILAE VLLGIRRRGA WIEVDPVLPP ELDGLEARVP LLGGELAVTY RVGTRGAGPS EIRLGGRTIE FERLANPYRE GGARVDLEPL RDAVTTHETL EIVLP
|
| |