Gene BTH_I1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I1954 
Symbol 
ID3847399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp2207737 
End bp2209038 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content71% 
IMG OID637841623 
Producthypothetical protein 
Protein accessionYP_442483 
Protein GI83718863 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGACA GCGCGCAGGA CAGCATCGAC GAAGCAACGC GGCCCCCGCG CGTCTCGTGG 
CTCGCGACGC TGCGCGGTCC GTTTGCGTAC CGCACGTTCG CTTCGATCTG GATCGCGAGC
CTCGTCGGCA ACATCGGCGG ATCGATTCAG ACGGTTGCGG CGTCGTGGCT GATGACGTCG
ATGGCGCCGT CGCCGGCGAT GGTCTCGCTC GTGCAGACGG CGTTCACGCT GCCGATCGCG
CTGTTCGCGC TGATGTCGGG CGTCGCCGCC GATGCGTGGG ATCGCCGCAC GGTGATGCTG
CTGTCGCAGG CGCTGATGTT CTCGGTCGCG CTGTGCCTGG TCGCGCTCGC GGTCGCGGGC
GCGATGACGC CGATGCGCCT GCTCGTGTGC ATGTTCGTCG GCGGCTGCGC GGGCGCGATG
TTCCAGCCCG CGTGGCAGTC CGCCGTGACC GAGCAGGTGC CGGCGCACGA GCTGTCCGCG
GCGATCGCGC TCGACAGCTT CTCGATGAAC TTCGCGCGCA CCGCCGGGCC GGCCCTGGGC
GGCTTCGTCG TCGCATCGGT GTCGCCGAAT GCGGCGTTCG TTCTCAGCGC ACTGTCGTAC
GCGGGTCTCA TCTACGTGCT GTCGCGGTCG ATTCGCGGAG CCGCCGCGAG AACGCCCGCG
CGGGCGCGTC TCGCGACGAT GCTGATGCAG GGCGTTCGCT ATTGCTGCCG CACGCCCGGC
ATTCGCGGCA CGTTGATTCG CAGCAGCCTG TTCGGGTTGC TCGGCAGCCC CGTCTGGGCG
CTGCTGCCGC TCTTCGCGAA GACGCAGTTC GGCGGCGAGG CGCGCACCTA CGGAATCCTG
CTCGCATCGT TCGGCGCGGG CGCGGCGTCC GGCGCGCTGG GCGGCGCGGC ATGGCGCGCG
CGACTCGGCC GCGAGGCGCT GATCCGGCTG TGCACGCTCA CGTTCGCCGC CGGCATGCTG
GCGACGGCGT GGAGCCCGTG CCAGGCGGTC GCGATGCTCG GCCTCGCCGT CGCGGGCGGG
AGCTGGGTCG TGGTCGTGTC GACCTACAAC CTCACGATTC AGATGGCGTC GCCTGCGTGG
GTGGCGGGGC GATCGCTGTC GCTGTTTCAC TCGTTCATCG TCGGCGGGCT GTCGATCGGC
AGTTACCTGT GGGGCGTCGC CGCAACGGGC AGTTCGATCA ACTCGGCATT CGCGGTATCG
GCGCTGATGA TGGCCGCGTC GGCGTGTCTC GCGGCATGGC TGCCGTTGCC GACGCGCGAG
GCGATCGACG AGCGCGCGCA CGGCGAGCCG CAACGGACAT GA
 
Protein sequence
MTDSAQDSID EATRPPRVSW LATLRGPFAY RTFASIWIAS LVGNIGGSIQ TVAASWLMTS 
MAPSPAMVSL VQTAFTLPIA LFALMSGVAA DAWDRRTVML LSQALMFSVA LCLVALAVAG
AMTPMRLLVC MFVGGCAGAM FQPAWQSAVT EQVPAHELSA AIALDSFSMN FARTAGPALG
GFVVASVSPN AAFVLSALSY AGLIYVLSRS IRGAAARTPA RARLATMLMQ GVRYCCRTPG
IRGTLIRSSL FGLLGSPVWA LLPLFAKTQF GGEARTYGIL LASFGAGAAS GALGGAAWRA
RLGREALIRL CTLTFAAGML ATAWSPCQAV AMLGLAVAGG SWVVVVSTYN LTIQMASPAW
VAGRSLSLFH SFIVGGLSIG SYLWGVAATG SSINSAFAVS ALMMAASACL AAWLPLPTRE
AIDERAHGEP QRT