Gene BTH_II2133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II2133 
Symbol 
ID3844545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp2620286 
End bp2621482 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content69% 
IMG OID637839434 
Productsuccinylglutamate desuccinylase/aspartoacylase family protein 
Protein accessionYP_440321 
Protein GI83717173 
COG category[R] General function prediction only 
COG ID[COG3608] Predicted deacylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGGCCGC GACGATCCAC GTACAATCGT TCATTCACCG ACATCGAACG ACGCGCGGCG 
GCTGCGCCGC TGTCTCCCAT CATGCAAACG CACACCCATC CGCTGATCTC GCCGGCCGTC
GGCACGGCGC GCCACATCAC GAGTTTTCAT TACGGCCCGC GCGGCGGGAA GAAGGTGTAC
ATCCAGGCGT CGCTGCACGC GGACGAACTG CCCGGCATGC TCGTCGCGAC GCTGCTGCGC
CGCAAGCTCG CGGCGCTCGA GGCGGCGGGC AAGCTGCGCG ACGAGATCGT CGTCGTGCCG
GTCGCGAATC CGATCGGCCT CGCGCAGCAC GTATTCGGCG ATCATCTCGG CCGCTTCGAG
CTCGGCTCGA TGCAGAACTT CAACCGCAAT TTCCACGATC TCGCCGCGCT CGTGATCCCG
CGCATCGAAG GGCATCTCAC GCACGACGCG AACGCGAACC TCGCCGCCGT GCGACGCGCG
ATGGGCGAGG CGCTCGACGA GCAGAAGCCG CGCACCGAGC TCGAATCGCA GCGGCTCGCG
CTGCAGCGGC TGTCGTACGA CGCCGACATC GTGCTCGATC TGCACTGCGA CTGCGACGCG
GTGATGCACA TCTACACGAA TCCGGACCTG TGGGAAGACG TCGAGCCGCT GTCGCGCTAT
CTGGGCGCGA AGGCGTCGCT GCTCGCGCTG AATTCGGTCG GCAATCCGTT CGACGAGATC
CACAGCTTCT GCTGGTCCGA GCTGCGCCAG CGCTTCGGCG AGCGCCATCC GATTCCGAAC
GGCACGATCT CGGTGACGGT CGAGCTGCGC AGCGAGCGCG ACGTGTCGTA CGAGCTCGCC
GAGCACGACG CGCAGGCGCT CATCGAATAT CTGACGCTGC GCGGCGCGAT CGACGGCACG
CCCGCGCCGC AGCCGCCGCT CGAATTCGCG GCGACGCCGC TCGCGGGCAC CGATCCGCTC
GTCGCGCCGG TGTCGGGGGT GATCGTGTTC CGCACGCCGG TCGGCGTGTG GATCGACGCG
GGCCAGGACG TGGCCGACAT CGTCGATCCG CTGACCGATC GCGTCGTCAC GCTCAAGAGC
AGCGTGTCCG GCGTGCTGTA CGCGCGGCAG ATCGCGCGTT TCGCGACGGC CGGGATGGAA
GTCGCGCGGA TCGCCGGCGC GACGCCGATC CGCGCCGGGT CGCTGCTGTC GGCCTGA
 
Protein sequence
MRPRRSTYNR SFTDIERRAA AAPLSPIMQT HTHPLISPAV GTARHITSFH YGPRGGKKVY 
IQASLHADEL PGMLVATLLR RKLAALEAAG KLRDEIVVVP VANPIGLAQH VFGDHLGRFE
LGSMQNFNRN FHDLAALVIP RIEGHLTHDA NANLAAVRRA MGEALDEQKP RTELESQRLA
LQRLSYDADI VLDLHCDCDA VMHIYTNPDL WEDVEPLSRY LGAKASLLAL NSVGNPFDEI
HSFCWSELRQ RFGERHPIPN GTISVTVELR SERDVSYELA EHDAQALIEY LTLRGAIDGT
PAPQPPLEFA ATPLAGTDPL VAPVSGVIVF RTPVGVWIDA GQDVADIVDP LTDRVVTLKS
SVSGVLYARQ IARFATAGME VARIAGATPI RAGSLLSA