Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcav_3237 |
Symbol | |
ID | 7861709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beutenbergia cavernae DSM 12333 |
Kingdom | Bacteria |
Replicon accession | NC_012669 |
Strand | - |
Start bp | 3595706 |
End bp | 3597052 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643867338 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002883243 |
Protein GI | 229821717 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.422703 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0333054 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGGCG GAACGCTCGC CAACGTCGGC CTCGTCCTGC TCTTCATCCT CATCGGTGGC GTCTTCGCCG GGACGGAGAT CGCGCTGGTC TCGCTCCGCG AGAGCCAGAT CGACCAGCTG GAGAAGAAGA GCGCCCGCGG TGCGCGCGTC GCCGACGTGG CGCGCGACCC GAACCGGTTC CTCGCCGCCG TCCAGATCGG CGTCACGGTC GCAGGGTTCT TCTCGGCGGC CTACGGGGCC TCGACGATCG CGCCCGACGT CGCCCCCGTC CTCGCGGGCT GGGGCGTGCC GGACGCGCTT GCCGACACCC TCGCCCTCGT GGGCCTCACG CTCGTCATCG CGTACCTCTC CCTGGTGCTC GGCGAGCTCG TGCCGAAGCG GATAGCGCTG CAGCGCTCGT CCTCCGTGGC GGTCACCGTC GCCCCCACCC TGGACCGGTT CGCGACGCTC ATGCGGCCGG TGATCTGGCT GCTGTCGGTC TCGACGAACG CCGTGGTCCG CCTGTTCGGC GGCGACCCGT CGGCGACCGG CGAGGAGATG TCCGACGAGG AGCTGCGTGA CCTCGTCATC GCGCACGAAG GGCTTCCCGA GGACGAGCGT CGGATCCTGC GGGACGTCTT CGACGCCGCT GAGCGGTCGA TCAGCGAGGT CATGCGGCCG CGGCACGAGG TCGTCTTCCT CGACGCGGAC CTGGCGATCG ACGAGGCCAC GCGGTCCGTC GTCGGCCAGC CGTACTCGCG CTATCCCGTC ATCGGCGAGG ACTTCGACGA CGTCCTCGGC TTCGTGCACG TGCGCGACCT CTTCCTCGCC GAGGTCGGCA CCCACGCCGG CGACGACGAC GCCGGTGCCC TGCCGCTGAC GCCCCCCGGT GCCGGGCCGG CCCGCACCGT CCGCGACCTC GTGCGCGAGA TCGTCGTGCT GCCCTCGACG AACGCGCTGC TGCCCTCGAT GTCGATGATG CGTCGCGCCC GCATCCACAT CGCCGTCGTC ATCGACGAGT ACGGCGGCAC CGACGGCATC GTCACGCTCG AGGACCTCGT GGAGGAGCTC GTCGGCGAGA TCCACGACGA GCACGACGCC GAGGCGGTGG TCGCGGACGA GGCCGGCGAC GGCGACATCG TCGTCGACGC CGGCCTCAAC CTCGAGGACT TCGCCGACGA GGTCGGTTTC GAGCTCGCGG ACGAGGGCGA CTACGAGACC GTCGGCGGGT ACGTGCTCGA CCGCCTGGGC CGCGTCGCGG AGCCGGGCGA CGTCGTCCCG GCGGGCGACC ACGTGCTCGA GGTCGTCGAG ACGGACGGCC GGCGCATCGT CCGCGTCCGG GTGCGGCGTA CCGACCGCCT ATCGTGA
|
Protein sequence | MDGGTLANVG LVLLFILIGG VFAGTEIALV SLRESQIDQL EKKSARGARV ADVARDPNRF LAAVQIGVTV AGFFSAAYGA STIAPDVAPV LAGWGVPDAL ADTLALVGLT LVIAYLSLVL GELVPKRIAL QRSSSVAVTV APTLDRFATL MRPVIWLLSV STNAVVRLFG GDPSATGEEM SDEELRDLVI AHEGLPEDER RILRDVFDAA ERSISEVMRP RHEVVFLDAD LAIDEATRSV VGQPYSRYPV IGEDFDDVLG FVHVRDLFLA EVGTHAGDDD AGALPLTPPG AGPARTVRDL VREIVVLPST NALLPSMSMM RRARIHIAVV IDEYGGTDGI VTLEDLVEEL VGEIHDEHDA EAVVADEAGD GDIVVDAGLN LEDFADEVGF ELADEGDYET VGGYVLDRLG RVAEPGDVVP AGDHVLEVVE TDGRRIVRVR VRRTDRLS
|
| |