Gene BURPS1106A_A1627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1627 
SymbolaraF 
ID4905545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1596916 
End bp1597917 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content70% 
IMG OID640144733 
ProductL-arabinose ABC transporter, periplasmic L-arabinose-binding protein 
Protein accessionYP_001075661 
Protein GI126457278 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.569362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATTGC GCTGGCCCCA AGCCGCCCTC GTCTGCGCGA GCCTCGCCGC CGGTTTGTCG 
GCGGCGGCGC CCGCGCATGC GCAAGGCGCG GCCCCGGTGA AGATCGGCTT CGTCGTCAAG
CAGCCCGACG ACCCGTGGTT TCAGGACGAA TGGCGCTTCG CCGAGCAGGC GGCGAAGGAC
AAGCACTTCA CGCTCGTGAA GATCGCCGCG CCGAGCGGCG AGAAGGTGTC GACCGCGCTC
GACAGCCTCG CCGCGCAAAA GGCGCAGGGT GTGATCATCT GCGCGCCCGA CGTGAAGCTC
GGCCCCGGCA TCGCCGCGAA GGCGAGGCGC TACGGGATGA AGCTGATGTC GGTCGACGAT
CAGCTCGTCG ACGGGCGCGG CGCGCCGCTC GCCGACGTGC CGCACATGGG CATTTCCGCA
TACCGGATCG GCCGGCAGGT CGGCGACGCG ATCGCCGCCG AGGCGAAGCG GCGCGGCTGG
AATCCGGCCG AGGTCGGCGT GCTGCGCCTC GCGTACGACC AGTTGCCGAC CGCGCGCGAG
CGCACGACGG GCGCGGTCGA TGCGCTGAAG GCCGCGGGCT TCGCGGCGGC GAACGTCGTC
GACGCGCCGG AGATGACGGC CGATACCGAA GGCGCGTTCA ACGCCGCGAA CATCGCGTTC
ACCAAGCATC GGAACTTCAA GCACTGGGTG GCGTTCGGAT CGAATGACGA CACGACGGTC
GGCGCGGTGC GCGCGGGCGA GGGGCGCGGC ATCGGGGCGG ACGACATGAT CGCGGTCGGC
ATCAACGGCA GCCAGGTCGC GCTGAACGAA TTCGCGAAAC CGAAGCCGAC GGGCTTTTTC
GGCTCGATCC TGCTGAATCC GCGGCTGCAC GGCTACGACA CGTCGGTCAA CATGTACGAC
TGGATCACGC AGAACCGCGC GCCGCCGCCG GTCGTGCTGA CGTCCGGCAC GCTGATCACG
CGCGCGAACG AAAAGACGGC GCGCGCGCAG CTCGGGCTGT GA
 
Protein sequence
MGLRWPQAAL VCASLAAGLS AAAPAHAQGA APVKIGFVVK QPDDPWFQDE WRFAEQAAKD 
KHFTLVKIAA PSGEKVSTAL DSLAAQKAQG VIICAPDVKL GPGIAAKARR YGMKLMSVDD
QLVDGRGAPL ADVPHMGISA YRIGRQVGDA IAAEAKRRGW NPAEVGVLRL AYDQLPTARE
RTTGAVDALK AAGFAAANVV DAPEMTADTE GAFNAANIAF TKHRNFKHWV AFGSNDDTTV
GAVRAGEGRG IGADDMIAVG INGSQVALNE FAKPKPTGFF GSILLNPRLH GYDTSVNMYD
WITQNRAPPP VVLTSGTLIT RANEKTARAQ LGL