Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1954 |
Symbol | |
ID | 3847399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 2207737 |
End bp | 2209038 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637841623 |
Product | hypothetical protein |
Protein accession | YP_442483 |
Protein GI | 83718863 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGACA GCGCGCAGGA CAGCATCGAC GAAGCAACGC GGCCCCCGCG CGTCTCGTGG CTCGCGACGC TGCGCGGTCC GTTTGCGTAC CGCACGTTCG CTTCGATCTG GATCGCGAGC CTCGTCGGCA ACATCGGCGG ATCGATTCAG ACGGTTGCGG CGTCGTGGCT GATGACGTCG ATGGCGCCGT CGCCGGCGAT GGTCTCGCTC GTGCAGACGG CGTTCACGCT GCCGATCGCG CTGTTCGCGC TGATGTCGGG CGTCGCCGCC GATGCGTGGG ATCGCCGCAC GGTGATGCTG CTGTCGCAGG CGCTGATGTT CTCGGTCGCG CTGTGCCTGG TCGCGCTCGC GGTCGCGGGC GCGATGACGC CGATGCGCCT GCTCGTGTGC ATGTTCGTCG GCGGCTGCGC GGGCGCGATG TTCCAGCCCG CGTGGCAGTC CGCCGTGACC GAGCAGGTGC CGGCGCACGA GCTGTCCGCG GCGATCGCGC TCGACAGCTT CTCGATGAAC TTCGCGCGCA CCGCCGGGCC GGCCCTGGGC GGCTTCGTCG TCGCATCGGT GTCGCCGAAT GCGGCGTTCG TTCTCAGCGC ACTGTCGTAC GCGGGTCTCA TCTACGTGCT GTCGCGGTCG ATTCGCGGAG CCGCCGCGAG AACGCCCGCG CGGGCGCGTC TCGCGACGAT GCTGATGCAG GGCGTTCGCT ATTGCTGCCG CACGCCCGGC ATTCGCGGCA CGTTGATTCG CAGCAGCCTG TTCGGGTTGC TCGGCAGCCC CGTCTGGGCG CTGCTGCCGC TCTTCGCGAA GACGCAGTTC GGCGGCGAGG CGCGCACCTA CGGAATCCTG CTCGCATCGT TCGGCGCGGG CGCGGCGTCC GGCGCGCTGG GCGGCGCGGC ATGGCGCGCG CGACTCGGCC GCGAGGCGCT GATCCGGCTG TGCACGCTCA CGTTCGCCGC CGGCATGCTG GCGACGGCGT GGAGCCCGTG CCAGGCGGTC GCGATGCTCG GCCTCGCCGT CGCGGGCGGG AGCTGGGTCG TGGTCGTGTC GACCTACAAC CTCACGATTC AGATGGCGTC GCCTGCGTGG GTGGCGGGGC GATCGCTGTC GCTGTTTCAC TCGTTCATCG TCGGCGGGCT GTCGATCGGC AGTTACCTGT GGGGCGTCGC CGCAACGGGC AGTTCGATCA ACTCGGCATT CGCGGTATCG GCGCTGATGA TGGCCGCGTC GGCGTGTCTC GCGGCATGGC TGCCGTTGCC GACGCGCGAG GCGATCGACG AGCGCGCGCA CGGCGAGCCG CAACGGACAT GA
|
Protein sequence | MTDSAQDSID EATRPPRVSW LATLRGPFAY RTFASIWIAS LVGNIGGSIQ TVAASWLMTS MAPSPAMVSL VQTAFTLPIA LFALMSGVAA DAWDRRTVML LSQALMFSVA LCLVALAVAG AMTPMRLLVC MFVGGCAGAM FQPAWQSAVT EQVPAHELSA AIALDSFSMN FARTAGPALG GFVVASVSPN AAFVLSALSY AGLIYVLSRS IRGAAARTPA RARLATMLMQ GVRYCCRTPG IRGTLIRSSL FGLLGSPVWA LLPLFAKTQF GGEARTYGIL LASFGAGAAS GALGGAAWRA RLGREALIRL CTLTFAAGML ATAWSPCQAV AMLGLAVAGG SWVVVVSTYN LTIQMASPAW VAGRSLSLFH SFIVGGLSIG SYLWGVAATG SSINSAFAVS ALMMAASACL AAWLPLPTRE AIDERAHGEP QRT
|
| |