Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A4528 |
Symbol | |
ID | 3749729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | + |
Start bp | 1497041 |
End bp | 1499011 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637762819 |
Product | hypothetical protein |
Protein accession | YP_368768 |
Protein GI | 78065999 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03369] cellulose biosynthesis protein BcsE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0373387 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.198235 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATACGG AACTCGATGC CACCCGTCCC GCGCTCGGCA CCGCCGGCCT GCTGGGTCGC CTGCGTACGC TGCTGGGCGC GCCGGCCGGC GGCCGTGTCG GCACGGTGAG CCGGCTCGCG ATCGACGGCT TGCCCGATGC GTGGACGCAA CTGGAAGCCG GCGGCCTGTA TGCGATCTAC GCGGCCGGCG GCACGTCGGC TTGCGATGCG CTCGTGTGGG AGAGTGCGCG ACAGGCACGC ACGCGCGACG TGACGGTCGT GCTTGCGCGC GAGGCGGCGC AGGTCGCCGA ACAGATGCAG GCGCGCGGTT TCGCGGGCGG CGCACTGCCG GCCAGCTGGC CGCGCAACCT GAACGTGCTG GCGATGCCGC CTGCGGCCGA GCCGGCCAAC GGTGACGGCA CCAGCACCGG CGCCGGCGCA ACGGCGGACG CACCGCGCCG CGCGGCGCCG GGCCCGTTCG CACGGCTGAC CGGCGGCCTG CGCGCGATGA AGCGCTACGG CTTTCGCGCG CGCTCGCTGT ATTTCGTCGA AGGCGCGCAG CGCTGGTTCA GCTGGGACGA CCCGATCGCG CTCGCCGACG AGGCGCATGC TCTCGCCGAC TGGTGCCGCA CGCGCCGGAT CGCGCTCGTG CTGCTGCTCG ATCCCGAAGC GACCCGTGCG GGCGACGAAG CGCGCACCGA CGATGCGCCG CTCGTACGCG ATGCGGATCG TACGCCGCGC AGCGGTTTCC ACGGTATCTG CTCGGGCGTC GCGCAGTTGC AGCGCACGCA CGGCGAGCTG CTGTGGGTCG TCGATTTCTG GCGTGCGGGC GACACGCTTG CGGCCGGCGA AGTCCGGCCG CTGCGCTTTG CGCCGGGCGG CCGGCTGTCG GCGAGCGTCG ACGGCGGCGC GACCGAGCCG GCGCACCGGA TGAAGCTCGC GAGCGACGAA GATCGTGTCG TCGTCAGCCG CGCAGTGCTC GAAGGCACGA ACCGCGCGCC GGACGGCTGG GAGATCGTCG AAGACAACGC GGCCGTCGTC GCGGCGTGCG CGCATGCGCA AGCGGCGACC GCGGTGCTCG TGTTCCGCTC GCATGCGCAA CTCGAAGCGC TCTGCGCCGA CGTGCACGCG CTGCGCCGGC AGTGCGGCGG TGCGCTGAAG ATCGCGGTCG TCGAGCGCGG CGAAGTGCTG CGCCAGCAGT TCGAGATGCT CGTGCTGAGC GTCGGCGCGA GCCGCGTCGT CGCGCGCGAC CTGCCGGTAT CGCGGATGCA GGCCGCCGTG CACGCGCTGC GCGGCCAGCT CTATGCGCGG CCCGTCGCGG CCGACTACCG TGCGGCGCTC GCGGCCGCGC TCGGCGATTC GGTGCTCGGC TACCTGCCGG TCGGCGCGTT CTGCCTGCGG ATTCGCGCGG TGCTCGATCG CGGCGCCGTG CTGGCGCTGC CGCACACGCT CGCGAAGATC TCGCTGCTGC CGGGCGTGTC GCACGTCGAC GCGCTGCGGT TTTGCCGGCC GCGCCGCGCG GGCGACGTCG TGACGGCCGA TGCCGCGCAC CTGTACGTGT TCCTGTTCGC GTGCGAGCCG GCCGATGCGG AGGACGCGCT CGCGCGCATC TTCGACGTGC CGGTCGACGC GCTGTCCGAC CGTGTCGTGT GCCTCGGACA CGGCAGCATC GACACCGAGC TCAATGCGCT GAAGGCCGAG AACCGCCGCG CGCCGATCGC CGACTACAGT GATCTGTTCG CGGCCACGCA GCCGGCGGGT GCGACCCGCA AATCGTCCGT GCGCGCCGAT CCCGGCGCCG CGTTGCCCGA CGCGAGCGAA GTGAGCGGCG CGATCGACGC AATCGATGCG ATCGTCGCGC TCGAGGCGAT CGGTGCGACG CCGACAGCCG CGGCTATCGA TGCATCGGCG CCGGAACCCG TCATTGCCGC CGCGCGCAAG CGCAGCGCCG TCCGCAGCGC GATGCCGCTG CGCAAGGAGG GGGTCGCATG A
|
Protein sequence | MNTELDATRP ALGTAGLLGR LRTLLGAPAG GRVGTVSRLA IDGLPDAWTQ LEAGGLYAIY AAGGTSACDA LVWESARQAR TRDVTVVLAR EAAQVAEQMQ ARGFAGGALP ASWPRNLNVL AMPPAAEPAN GDGTSTGAGA TADAPRRAAP GPFARLTGGL RAMKRYGFRA RSLYFVEGAQ RWFSWDDPIA LADEAHALAD WCRTRRIALV LLLDPEATRA GDEARTDDAP LVRDADRTPR SGFHGICSGV AQLQRTHGEL LWVVDFWRAG DTLAAGEVRP LRFAPGGRLS ASVDGGATEP AHRMKLASDE DRVVVSRAVL EGTNRAPDGW EIVEDNAAVV AACAHAQAAT AVLVFRSHAQ LEALCADVHA LRRQCGGALK IAVVERGEVL RQQFEMLVLS VGASRVVARD LPVSRMQAAV HALRGQLYAR PVAADYRAAL AAALGDSVLG YLPVGAFCLR IRAVLDRGAV LALPHTLAKI SLLPGVSHVD ALRFCRPRRA GDVVTADAAH LYVFLFACEP ADAEDALARI FDVPVDALSD RVVCLGHGSI DTELNALKAE NRRAPIADYS DLFAATQPAG ATRKSSVRAD PGAALPDASE VSGAIDAIDA IVALEAIGAT PTAAAIDASA PEPVIAAARK RSAVRSAMPL RKEGVA
|
| |