Gene BURPS1106A_A2398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2398 
Symbol 
ID4903812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2375842 
End bp2376861 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content68% 
IMG OID640145503 
Productcarboxymethylenebutenolidase 
Protein accessionYP_001076430 
Protein GI126458022 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0412] Dienelactone hydrolase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGCATCG TGTGCGGGTT CCGCCTCGGC GGGGCGGGCG GCGGCGCGCG GTACGACTGC 
ATCGCGCCGG CCGCCGTGAT TCGAATTCAT GTCGGCGACG CGTGTGCCGC GTGTGCGCGA
CAGCGAGAAG AGAGGAGCGC ATCCATGTTG AAACCCGAAG TCGACAGTCT AGTTCCGCAC
GTTCCGTTCA GCCGCCGCAA GTTCGTCCAG GCGGCGCTCG GCGGCACGTT CGCCGCGGCG
GTGCTGCCCG TGTCCGCGCA GACGATCACC ACCGATGCCG CCGGCCTAGA CGTCGACACC
GTCCAGATCC GCTCGGGCGA CGCGAGCGTG CCCGCCTACC GCGCGCAGCC GGACGGCAAG
AGCAATCTGC CGGTGATCGT CGTGATCCAC GAGGTGTTCG GCGTGCATGC GCACATCGCC
GACATCTGCC GGCGCTTCGC GAAGCTCGGC TATCTGGCGA TCGCGCCGGA TCTGTATGCG
CGGCAGGGCG ATCCGTCGAA GCATGCGTCG ATCCAGGAGC TGATCGCTCA GGTGGTCAGC
AAGGTGCCCG ACCGTCAGGT GATCGAGGAT CTCGACGCGA CGGTGCGATG GGCGGGCAAG
AACGGCGGGG ACCTGTCGCG GCTCGGCGTG ACGGGTTTTT GTTGGGGCGG CCGGCAGACG
TGGCTTTTCG CCGAGCACAA TCCGCACGTG CGCGCCGCTG TCGCGTGGTA CGGCAAGGTG
GCCGGCGAGA CGAACGAGAT GACGCCGTTC AATCCTGACG ATCATGCGGC GCAACTGAAG
GCGCCGACGC TCGGCCTCTA CGGCGGCAAG GACGACAGCA TTTCGCAGCG CTCGCTCGCG
CGGATGCGCG AGCGTCTCGC GGCGGCCGGC ACGCAGGCCG CGCGCGAATC GGAAATCGTC
GTGTATCCGG ATGCGGGCCA CGCGTTCTTC GCCGATTACC GCCCGAGTTA CGTGAAGGCC
GATGCCGACG ACGGCTGGAA GCGCGCTATC GCATGGTTTC GGCATCACGG CGTGATGTGA
 
Protein sequence
MGIVCGFRLG GAGGGARYDC IAPAAVIRIH VGDACAACAR QREERSASML KPEVDSLVPH 
VPFSRRKFVQ AALGGTFAAA VLPVSAQTIT TDAAGLDVDT VQIRSGDASV PAYRAQPDGK
SNLPVIVVIH EVFGVHAHIA DICRRFAKLG YLAIAPDLYA RQGDPSKHAS IQELIAQVVS
KVPDRQVIED LDATVRWAGK NGGDLSRLGV TGFCWGGRQT WLFAEHNPHV RAAVAWYGKV
AGETNEMTPF NPDDHAAQLK APTLGLYGGK DDSISQRSLA RMRERLAAAG TQAARESEIV
VYPDAGHAFF ADYRPSYVKA DADDGWKRAI AWFRHHGVM