Gene BTH_II1221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II1221 
Symbol 
ID3844660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp1443995 
End bp1445302 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content69% 
IMG OID637838523 
Productshikimate transporter 
Protein accessionYP_439417 
Protein GI83717811 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCAG CATTCGACGC GGCCGGCACG ATCGGCACCG CGCCCGCTCA CCGCAACCAG 
GCCCGCAAGG CCGCGATCGG CAGCTTCGTC GGCGCGGTGG TCGACTGGTA CGACTTCCTG
CTGTACGGCA TCGTCGCCGC GCTCGTCTTC AACCATGCGT TCTTTCCGAA CGTCAGCCCG
ACGATGGGCA CGCTCGCCGC GTTCGCGACG TTCGGCGTCG GCTTCCTGTT CCGCCCGCTC
GGCGGCGTCG TGTTCGGCCA TTACGGCGAC CGGCTCGGCC GCAAGCGGAT GCTCGTGCTG
ACGGTGATGC TGATGGGCGT GTCGACGGTC GCGATCGGCC TGCTGCCGAC CTTCGGCACG
ATCGGCTGGT GGGCGCCCGC GCTGCTCGTC GCGCTGCGCG CGGTGCAGGG CTTCGCGGTC
GGCGGCGAAT GGGGCGGCGC GGCGCTGATG GCGGTCGAGA GCGCGCCCGA GAAAAAGAAG
GCGTTCTACA GCAGCGGCGT GCAGGTCGGC TACGGCGTCG GGCTCGTGCT CGCGACGGGC
ATCGTATCGA TCCTCGGCCA CACGCTCGGC GACGCCGCGT TTCGGGCGTG GGGCTGGCGG
CTGCCGTTCG TGTTCAGCAT CGTGCTCGTG CTGATCGGCC TGTGGGTGCG TTCGAGCATG
GACGAATCGC AGGAATTCGT CGAGAAAGTC GAGCACGGCC ACCGCAAGCT GAAGCTGCCG
GTGCTCGAGG CGCTGACCCG TCATCCGAGG GCGTTCGTCT ACATCGTCGC GCTGCGCCTC
GCCGAGTTGT TCACGATGTA CATCGTGACC GCGTTCGCGC TCAGCTATTC GACGTCGAAC
CTCGGGATGT CGCGCGACCT GTTCCTGAAC ATCGGCCTGC TCGTCGGCGC GCTGAGCTGC
GTGACGATCC CGTGCTTCGC GTGGCTCGCC GACCGCTTCG GGCTGCGCCG CGTCTACATC
GCCGGCGCGC TCGTCGGGCT CGCGTGCGCG GTGCCGTTCT TCGTCGCGCT CGAGGCGCGC
GCGACCGTCT GGATCGTGAT CTGCTCGGTG ATGCTCGCGA ACGTCGCGCA CGACATGGTC
GTGAGCGTCC AGCAGCCGCT CTTCACCGAG CTGTTCGGCG CCGAGTACCG CTACAGCGGC
GCGGGCGTCG GCTATCAGTT CGCGAGCGTG GTGGGCGGCG GCTTCACGCC GTTCATCGCG
GTCGCGCTCG TGAGCTTCGG CGGCGGCTCG TGGCATCTCG TCGCCGCGTA TCTCGCGACG
GGCTGCCTCG TCTCGACGCT CGTGGCCGCG CGGATGCGCG CCGCCTGA
 
Protein sequence
MTPAFDAAGT IGTAPAHRNQ ARKAAIGSFV GAVVDWYDFL LYGIVAALVF NHAFFPNVSP 
TMGTLAAFAT FGVGFLFRPL GGVVFGHYGD RLGRKRMLVL TVMLMGVSTV AIGLLPTFGT
IGWWAPALLV ALRAVQGFAV GGEWGGAALM AVESAPEKKK AFYSSGVQVG YGVGLVLATG
IVSILGHTLG DAAFRAWGWR LPFVFSIVLV LIGLWVRSSM DESQEFVEKV EHGHRKLKLP
VLEALTRHPR AFVYIVALRL AELFTMYIVT AFALSYSTSN LGMSRDLFLN IGLLVGALSC
VTIPCFAWLA DRFGLRRVYI AGALVGLACA VPFFVALEAR ATVWIVICSV MLANVAHDMV
VSVQQPLFTE LFGAEYRYSG AGVGYQFASV VGGGFTPFIA VALVSFGGGS WHLVAAYLAT
GCLVSTLVAA RMRAA