Gene Bcen_4401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcen_4401 
Symbol 
ID4094714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia AU 1054 
KingdomBacteria 
Replicon accessionNC_008061 
Strand
Start bp1632231 
End bp1633421 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content66% 
IMG OID638017688 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_624256 
Protein GI107026745 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGACT CCCTCAACTT CGACACGCTT GCCGTGCGCG CGGGCACGCT GCGCAGCGAC 
TTCAACGAGC ATTCGGAAGC GCTGTTCCTC ACGTCGAGCT TCTGCTTCTC GAGCGCGGCC
GACGCGGCCG AGCGCTTCGC GAATTCGGAA GACTATTTCA CCTATTCGCG CTTCACGAAT
CCGACCGTCA CCATGTTCCA GGAGCGTCTC GCGGCGCTCG AGGGCGGCGA GGCGTGCATC
GCGACGGCGT CGGGCATGGC CGCGATCATG TCGGTCGTGA TGTCCGCGCT GCAGGCAGGA
GACCACCTCG TCAGCTCGCG CAGCCTGTTC GGCTCGACGC TCGGGATGTT CTCGCAGATC
TTCAGCAAGT TCGGGATCAC GACGACCTTC GTCGACCCGA CCGACCTGAA CGCGTGGCAG
GAAGCGGTGC GGCCGGAAAC GAAGATGTTC TTCCTCGAGA CGCCGTCGAA CCCGCTGACC
GAGCTCGCCG ACATCGAGGC CATCGGCAAG ATCGCGAAGG CGGCGAACGC GCTGTTCGTC
GTCGACAACT GTTTCTGCAG CCCGGTACTG CAGCAGCCAC TGAAGCTCGG CGCGGATGTC
GTGATGCACT CCGCAACGAA ATTCCTCGAC GGGCAGGGCC GCGTGCTCGG CGGCGCACTG
GTCGGCTCGA AGGAATTCAT CATGGGCAAG GTGTTCCCGT TCGTGCGCAG CGCGGGCCCG
ACGCTGTCGG CGTTCAACGC GTGGGTGCTG CTGAAGGGGG TGGAGACGCT GTCGCTGCGC
GTCGAGAAGC AGTCGGCGAA CGCGCTGGAG ATCGCGCGCT GGCTCGACTC GCATCCGGCG
GTGGCGCGCG TGTTCTATCC GGGGCTCGAA TCGCATCCGC AGCATGAACT CGCGAAGCGT
CAGCAGAAGG CGGGCGGTGC GATCGTGTCG TTCGAGCTGA AGGGCGACAC GCCCGAGCAG
CAGCGCGCGA ACGCATGGCG CGTGATCGAC GGCACGAAGC TGGTGTCGAT CACCGGCAAC
CTCGGCGACA CGCGTACGAC GATCACGCAT CCGGCCACCA CGACGCACGG CCGGATTACG
CCGGAAGCGC GTGCGGCGGC GGGGATCACC GAAGGGTTGA TCCGCCTCGC GGTCGGCCTG
GAGCATGCGG GCGACATTCG CAACGATCTG GCGCGTGGTC TGGACGGCTG A
 
Protein sequence
MDDSLNFDTL AVRAGTLRSD FNEHSEALFL TSSFCFSSAA DAAERFANSE DYFTYSRFTN 
PTVTMFQERL AALEGGEACI ATASGMAAIM SVVMSALQAG DHLVSSRSLF GSTLGMFSQI
FSKFGITTTF VDPTDLNAWQ EAVRPETKMF FLETPSNPLT ELADIEAIGK IAKAANALFV
VDNCFCSPVL QQPLKLGADV VMHSATKFLD GQGRVLGGAL VGSKEFIMGK VFPFVRSAGP
TLSAFNAWVL LKGVETLSLR VEKQSANALE IARWLDSHPA VARVFYPGLE SHPQHELAKR
QQKAGGAIVS FELKGDTPEQ QRANAWRVID GTKLVSITGN LGDTRTTITH PATTTHGRIT
PEARAAAGIT EGLIRLAVGL EHAGDIRNDL ARGLDG