Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1985 |
Symbol | |
ID | 3844491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 2412646 |
End bp | 2413863 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637839286 |
Product | chain length determinant domain-containing protein |
Protein accession | YP_440179 |
Protein GI | 83718074 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3524] Capsule polysaccharide export protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGAAC TCGAATCCGG TGGCGCGGGG CTGGATGACG TCGAACGGCG CGCGCTCCGC GGCCCCGCCC TGGGCCGCGC CGACGTGCTG ATCGCGCTCG GTCACGGCAA GGGGCTGATC GCGCGCATCG TCGCCGCGGC GGTGCTGCTC GGCATCGCGC TCGCGCTCGT GCTGCCGCCC ATCTACGAGG CGAGAACCGT GCTGCTGCCG CCGGACGAGT CGCGCGGATT GTTCGGTCAT TCGCTGGGCA GTCTCGACGT GATCGCGGGC GCGGCGATGG GCATCGATTT CAAGACGCCG GGCGAGCTGT ACGTCGCGCT GTTGAAGAGT ACTTCGATCG AGGACAGCCT GATCCGGCAG TTCGGCCTGC GCAAGCGATA TCGCGTCGAC ACGATGCATG CCGCGCGCAA GGCGTTGCAG TCGCGCGTGA GCGTCACGCT CGACAAGAAA TCCGGCCTGC TGACCATCGC GGCCGACGAC ACCGACCCGG CCGTGGCGGC GGATCTGGCG AACGCGCACG TCGCGGCGCT CGCGAAGCTG CTCGAGCGCA TTGCGGTGAC GCAGGCGCAG CAGCGGCGCG CGTTTCTCGA AAAGGAGGTG GACAAATCGC GTGTCGCGCT CGCCAATGCA CAGGATGCGT ATGTCAAGTT GCAGGCGAAA TCCGGCGTCG TCAGCGTCGA CGCCGACACG CAACTCGCGA TCCGGCACAG CGCGGAGATC CGATCGCTGC TGGCCGCCAA GCAGATCGAG CTGAGCTCGA TCGGCACTTA TGCGACGGCC GAGAATCCGC AGGTCAAACG GATCGAAGCC GAGGTGTCGA CGCTCAAGGC GCAGCTCGAG AAGATCGAGA ACGGCGACGC CGCGTCGCTC AGGGGATCGG ATGCGGGCAT GGCCACGCTG CGCAGCTACC GCGAAATGAA GTATCAGGAG AACGTCGTCG ACGTTTTGTC GAGGCAGCTC GAGCTGGCGC GCGTCGACGA GGCGAAGAGC GGGCCGCTCG TGCAGCAGGT CGACGTGGCC GCGCCGCCCG AGCGCAAGGC CAAGCCGTCG CGGCTGCTGA TCCTGCTCGC GAGTTTCGCG GGGGGCTTCG TGCTGGCGGT GACGGTCGTC ATCGGCAGGG AGTTCGGCAG GCAGGCGGTC GACCACGCGC GCAGAAGCGG GGATCTCGCG CGCCTCAAGC ACGCGTGGGC GATAACTTTC AAGAGGACGC GATCGTGA
|
Protein sequence | MAELESGGAG LDDVERRALR GPALGRADVL IALGHGKGLI ARIVAAAVLL GIALALVLPP IYEARTVLLP PDESRGLFGH SLGSLDVIAG AAMGIDFKTP GELYVALLKS TSIEDSLIRQ FGLRKRYRVD TMHAARKALQ SRVSVTLDKK SGLLTIAADD TDPAVAADLA NAHVAALAKL LERIAVTQAQ QRRAFLEKEV DKSRVALANA QDAYVKLQAK SGVVSVDADT QLAIRHSAEI RSLLAAKQIE LSSIGTYATA ENPQVKRIEA EVSTLKAQLE KIENGDAASL RGSDAGMATL RSYREMKYQE NVVDVLSRQL ELARVDEAKS GPLVQQVDVA APPERKAKPS RLLILLASFA GGFVLAVTVV IGREFGRQAV DHARRSGDLA RLKHAWAITF KRTRS
|
| |