Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0791 |
Symbol | |
ID | 3845813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 921076 |
End bp | 923397 |
Gene Length | 2322 bp |
Protein Length | 773 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637838094 |
Product | cellulose synthase regulator protein |
Protein accession | YP_438988 |
Protein GI | 161723113 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0115252 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGCCGA TTGCAACGTT CGCTTGCGCA TTCGCGATCG CCTCCGCGCT CGCTCATCCG ACGGCATCGC GCGCCGCGCC CCAGCCGCAG CGCGCGCAGC CGCAGCGGGA GCAGCCGCTG AGCCCGCCGC CGGCATCGCT CGCCGCCTCC GCGCCGCTCG TCATGCCGCT CGCCGCGCCG AAGCCGCCCG GCATCGCCGG CGCCGCGATC CGCGTGCCGT TCGCGACGCT CGGCGCATAC GAGCCGCTGC GCCTGCGCGG CAGCGACACC GCGCGAACCG TCAACGTCGG CGTGCGGCTC GACCGCGTGG TCACGGCCGC GCGGCTGCGG CTCACGTATA CGTATTCGCC GTCGCTCGTG TTTCCGGTGT CGCATCTGAA GGTGTCGATC AACGGCGAGG CGGTCGCGAC GCTGCCGTTC GACGGCGGGC ACGCCGGCCG CGCGATCACG CAGGAGATTG CGCTCGATCC GCGCTACTTC TCCGATTTCA ACCAGATCGA GCTGCGCCTC ATTGCGCACT ACACGCTCGA CCATTGCGAG GACCCGGAAA ATTCGGCGCT TTGGGCCGAC GTGAGCCCGA CGAGCGAATT GATCTTCGAT GAGGCTTCGG TGCGGCTGCC GAACGATCTC GCGCTGTTGC CCGCGCCGTT CTTCGATCGT CGCGACAACA GCCGGCTGAG GCTGCCGTTC GTGCTGCCCG CGACGCCGGA CGACGCGACG CTGCGCAGCG CGGGCGTGCT CGCGTCCTGG TTCGGTGCGC TCGCCGATTA CCGGCAGGCG CGCTTTCCGG TGTCGTCGGC GCTGCCCGCG AACGATCACG CGGTGGTTGT CGGCACGGCG GCGCAATTGC CGGCGTCGCT CGCGCTGCCG CCGATCGACG GGCCGATGCT CGTCGTCGCA GACAATCCGG CCGCGCCCGA CAAGAAGCTG CTCGTCGTCA CGGGCCGCAG CGCGGCCGAC GTCGATGCGG CGACGAACGC GCTCGTGCTC GGCAACGCCG CGCTGTCCGG CCCGTGGGCG CGCGTGTCGC GCATCGACAT CGGCGCGCCG CGCGAGCCCT ACGACGCGCC GCGCTGGGTG CCGGTGAACC GGCCGGTGAC GTTGCGCGAG CTCGTAGACG ACCCGGCCGA CCTGCAAGTG CGCGGCAGCG CGCCCGATCC GATCCGCCTG AACCTGCGCG TGCCCGCCGA TCTGCATTCG TGGGGCGGCG CGGGCGTGCC GCTCGCGCTG CACTATCGCT ACACCGCGCC GACCGTGCGC AGCGATTCGA TGCTCGCCGT CGAGATCAAC GACCAGCTCG TGCAGTCGTA CCGGCTCTCG CCGCGCAGCC AGGACGCGCT CGGCCGCGTG CAGTTGCCGC TCTTGTCCGG CTCGGACAGC CGCGCGACGA ACGACGTCGA CATCCCGGCG TTTCGCGTCG GCAGCGCGAA CCAGCTGCAG TTGCGCTTCA CGCTCGATTC GGAGAAGACG GGGCTTTGCA CGGGCGTCGC GAGCGAACCG CAGCGCGCGG CGATCGATCC CGATTCGACG ATCGACTTCT CGCGCTTCAT CCACTACGCG ATGCTGCCGA ATCTCGCGTA TTTCGCGAAC AGCGGCTTTC CGTTCACGCG CTACGCGGAT CTTTCGCAGA CCGCGATCGT GCTGCCGCAG CGGCCGTCGC CGGCCGAGCA GGAAGCGTAT CTGACGATGC TCGGCCACAT GGGGCAATGG ACCGGCTTCC CGGCGCTGCG CGTGCAGGTC GCGCGGGCCG CCGACGCGCC GCGCATCGCG CACAAGGATC TGCTCGTGAT CGACGGCGCG CCGCCGTATG CGCAACTGGC GCACTGGCGC GACGCGCTGC CCGTCGCGAT CGGCGAGGGC GCGGGCGAGA GCGGCGGCGG TTTCTCGCGC GCGGCGTTCT CGGTGAAGGA GCGCTGGCCC GGCGACGCGC GCTCGCCGGC GGGCGGGGCG CGCTTCGAGC AGAGCGGCGC CCTCGCCGCG CTGTTCGGAT TCGAGCGGCC GGGCAGCGAC GGGCGCAGCG TCGTCGCGCT GACGGCGACC GACGCGCGGC ATCTCGGCGA TCTGCTCGAC GTATTCGAAA AGCCCGGCCT CGTCGCGCAA CTGCAGGGCG ACGTCGCGCT CGTGCGCTCG GGCGCGGTCG AGAGCCTGCG CGTCGGCGAG CCGTATCTCG TCGGCTACGT GCCGTGGTAT GCGCGCGTAT GGACGGCGGT CGCGAAGCAT CCGGCGCTGC TCGGCCTGCT CGGCGCGGCG GCCGGGCTGC TGCTCGCGCT CGGCGCGTTC GGCGCGTTGC AGCGGATCGC CGCGCGGCGG CGGGGGATCT GA
|
Protein sequence | MKPIATFACA FAIASALAHP TASRAAPQPQ RAQPQREQPL SPPPASLAAS APLVMPLAAP KPPGIAGAAI RVPFATLGAY EPLRLRGSDT ARTVNVGVRL DRVVTAARLR LTYTYSPSLV FPVSHLKVSI NGEAVATLPF DGGHAGRAIT QEIALDPRYF SDFNQIELRL IAHYTLDHCE DPENSALWAD VSPTSELIFD EASVRLPNDL ALLPAPFFDR RDNSRLRLPF VLPATPDDAT LRSAGVLASW FGALADYRQA RFPVSSALPA NDHAVVVGTA AQLPASLALP PIDGPMLVVA DNPAAPDKKL LVVTGRSAAD VDAATNALVL GNAALSGPWA RVSRIDIGAP REPYDAPRWV PVNRPVTLRE LVDDPADLQV RGSAPDPIRL NLRVPADLHS WGGAGVPLAL HYRYTAPTVR SDSMLAVEIN DQLVQSYRLS PRSQDALGRV QLPLLSGSDS RATNDVDIPA FRVGSANQLQ LRFTLDSEKT GLCTGVASEP QRAAIDPDST IDFSRFIHYA MLPNLAYFAN SGFPFTRYAD LSQTAIVLPQ RPSPAEQEAY LTMLGHMGQW TGFPALRVQV ARAADAPRIA HKDLLVIDGA PPYAQLAHWR DALPVAIGEG AGESGGGFSR AAFSVKERWP GDARSPAGGA RFEQSGALAA LFGFERPGSD GRSVVALTAT DARHLGDLLD VFEKPGLVAQ LQGDVALVRS GAVESLRVGE PYLVGYVPWY ARVWTAVAKH PALLGLLGAA AGLLLALGAF GALQRIAARR RGI
|
| |