Gene BURPS1106A_A2337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2337 
Symbol 
ID4903350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2314466 
End bp2315485 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content73% 
IMG OID640145442 
Productputative lyase 
Protein accessionYP_001076370 
Protein GI126457117 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2301] Citrate lyase beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCGC TCACACCCGC AGAAGTGCTG TACGAAGGCG TTGCGCCGCC CGCGATCCTG 
CCGTGCTGCG ATCATTACGC GGGCAGCGAG AAGCTGATGC TCAAATCGCT CGCGCTGCAG
GCCGAGCTCG GTCCCGTCTT CGACATCACG CTCGACTGCG AGGACGGCGC GGCCGTCGGC
CGCGAGGCCG AGCACGCGGC GCGCGTCGCC GCGCTCGTCG GCGGCGAGGC GAACCGCTTC
GGGCGCGTCG GCGTGCGTAT CCACGACATT TCCCACCCTC ACTGGCGCGA CGACGTGCGC
GTCGTCCTGC GCGCGGCGCG CCCGCCCGCG TACCTGACGC TGCCGAAGGT CGGCGGCGCG
GCCGACGCGG CCGAAATGTG CGCGTTCATC GAGGCGTCCC GCGTCGAGCT CGGCATCGCG
CAGCCGATCC CCGTCGACGT GCTGATCGAG ACGCACGGCG CGCTCGCCGA CGCCGCGCGG
ATCGCCGCGC TGCCGATCGT CGCGACCCTG AGCTTCGGCC TGATGGATTT CGTATCCGCG
CATCACGGCG CGATTCCGGA CGACGCGATG CGCGCGCCCG GCCAGTTCGA CCACCCGCTC
GTGCGCCGCG CGAAGCTCGA GATCGCCGCC GCGTGCCACG CGCACGGCAA GACGCCGTCG
CACAACGTGA CGACCGAGGT ACGCGACATG CGCGTCGTCG CGAACGACGC GCGCCGCGCC
CGCGAGGAAT TCGGCTACAC GCGGATGTGG AGCATCCACC CGGCGCAGAT CCGCGAGATC
GTCGCCGCGT TCGCGCCGCG CGCGGACGAC ATCGCGCGCG CGAGCCGCAT CCTGCTCGCC
GCGCAGGCGG CCGACTGGGG CCCGACGCGG CATGACGACG CGCTGCACGA CCGCGCGAGC
TACCGCTACT ACTGGGCGGT GCTGCGCCGC GCGCGCGCGA CCGGCCAGCC GCTGCCCGCC
GAGTCGGCGC CGCTCTTCGG CGACGCCGGC GAACGGGCCG CGCGGGGACG CGAAAAATGA
 
Protein sequence
MSALTPAEVL YEGVAPPAIL PCCDHYAGSE KLMLKSLALQ AELGPVFDIT LDCEDGAAVG 
REAEHAARVA ALVGGEANRF GRVGVRIHDI SHPHWRDDVR VVLRAARPPA YLTLPKVGGA
ADAAEMCAFI EASRVELGIA QPIPVDVLIE THGALADAAR IAALPIVATL SFGLMDFVSA
HHGAIPDDAM RAPGQFDHPL VRRAKLEIAA ACHAHGKTPS HNVTTEVRDM RVVANDARRA
REEFGYTRMW SIHPAQIREI VAAFAPRADD IARASRILLA AQAADWGPTR HDDALHDRAS
YRYYWAVLRR ARATGQPLPA ESAPLFGDAG ERAARGREK