Gene BTH_II1196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II1196 
Symbol 
ID3845465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp1400643 
End bp1401644 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content69% 
IMG OID637838498 
ProductL-arabinose ABC transporter, periplasmic L-arabinose-binding protein 
Protein accessionYP_439392 
Protein GI83716589 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.155666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATTGC GTTGGCTTCA AGCCGCCCTC GTCTGCACGA GTCTCGCCGC CGGTTTGTCG 
GCGGCGGCAC CCGCGCGTGC GCAAGGCGCG GCCCCGGTGA AGATCGGCTT TGTCGTCAAG
CAGCCCGACG ACCCGTGGTT TCAGGACGAA TGGCGCTTCG CCGAGCAGGC GGCGAAGGAC
AAGCACTTCA CGCTCGTGAA GATCGCCGCG CCGAGCGGCG AGAAAGTGTC GACCGCGCTC
GACAGCCTTG CCGCGCAAAA GGCGCAAGGC GTGATCATCT GCGCGCCCGA CGTGAAGCTC
GGTCCCGGCA TCGCCGCGAA GGCGAAGCGC TACGGAATGA AGCTGATGTC GGTCGACGAT
CAACTCGTCG ACGGGCGCGG CGCGCCGCTT GCCGACGTGC CGCACATGGG CATCTCCGCC
TACCGGATCG GCCGGCAGGT CGGCGACGCG ATCGCCGCCG AGGCGAAGCG GCGCGGCTGG
AATCCGGCCG AGGTCGGCGT GCTGCGGCTC GCGTACGACC AGTTGCCGAC CGCGCGCGAG
CGCACGACGG GCGCGGTCGA CGCGCTGAAG GCGGCCGGCT TTGCGGCCGC GAACGTCGTC
GACGCGCCGG AGATGACGGC CGACACCGAA GGCGCGTTCA ACGCCGCGAA CATCGCGTTC
ACCAAGCACC GGAACTTCAG GCACTGGGTG GCGTTCGGAT CGAATGACGA CACGACGGTC
GGCGCGGTGC GCGCGGGCGA AGGCCGCGGC ATCGGCACGG ACGACATGAT CGCGGTCGGC
ATCAACGGCA GCCAGGTCGC GCTGAACGAA TTCGCGAAAC CGAAGCCGAC GGGCTTTTTC
GGCTCGATCC TGCTGAATCC GCGGCTGCAC GGCTACGACA CGTCGGTCAA CATGTACGAC
TGGATCACGC AGAACCGGAC GCCGCCGCCG CTCGTGCTGA CCTCCGGCAC GCTGATCACG
CGCGCGAACG AGAAGACGGC GCGCGCGCAG CTCGGGCTGT GA
 
Protein sequence
MGLRWLQAAL VCTSLAAGLS AAAPARAQGA APVKIGFVVK QPDDPWFQDE WRFAEQAAKD 
KHFTLVKIAA PSGEKVSTAL DSLAAQKAQG VIICAPDVKL GPGIAAKAKR YGMKLMSVDD
QLVDGRGAPL ADVPHMGISA YRIGRQVGDA IAAEAKRRGW NPAEVGVLRL AYDQLPTARE
RTTGAVDALK AAGFAAANVV DAPEMTADTE GAFNAANIAF TKHRNFRHWV AFGSNDDTTV
GAVRAGEGRG IGTDDMIAVG INGSQVALNE FAKPKPTGFF GSILLNPRLH GYDTSVNMYD
WITQNRTPPP LVLTSGTLIT RANEKTARAQ LGL