Gene Bcep18194_B3011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B3011 
Symbol 
ID3754778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp3399934 
End bp3400953 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content67% 
IMG OID637767858 
ProductAraC family transcriptional regulator 
Protein accessionYP_373765 
Protein GI78063857 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.476287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.587833 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGATA CCGACCGTTA TACGACGGCG AATCTGCCGG TCCATCTGTT GCGGTGCCTC 
GCGGAGACAA GCAAGGAGCT GGGCATCGAC CCCACGCGGC TGTGCCTCGG GCTCGGCTTC
GACGTCGCGG ACCTGTCGAA TCCGTCGTGC CGGATTTCCC TGCGTCAGGC GAGCACGATG
ATCCGCCGCG CGCTCGACAT GGCGCCGGGG CGGGCGCTCG GCCTCGAACT CGGCACGAGC
GAGACGATCG CGTCGATCGG CCTGGTCGGC TATGCGATGC TGACGAGCCC GACGCTGAAG
GATGCGATCT CCGTCGGGAT GGAACTGCAG CGCCACACGG GGCCGCTGAT GCGCTTCGAG
GTGATCTCGG ATGCGCGCAC GCTGTCGATC CGCGCGACCA ACGTCTTTCT CGAACCCGAC
ATCGAGGCGT TCCTCGTCGA GGAAGCGTTC GGCAGCTTCA TGAAGATCGG GCGCTCGCTC
GTCGGCCCCG CGTTCCAGCC GAAGGTCGTC GATCTCAGCT ACCCGCCGCC GGCCTATGCG
GAGCAATACA CGCGCGTGTT CCCGTGCCCG GTGCGGTTCG AACAGGAGCA GAACCTGTTT
TCATGCGACG CGGCGCTCGG CAACCGCCCG ATCGCGACCC ACGATCCGCT CGCGCATCGC
CAGGCGCTCG AATTCCTGCA GGACGCGCTG CCGCCCGAAC CCGAAGGCAC CGAGTTTCTC
GAATCGATCG AACGGATCAT GCGGCGCGAC CTGCGGCATG CGCCGTCGCT CGCCGAAATC
GCCGCGCAGC TGTGCATGAG CGAGCGCACA CTCCGCCGGC GGCTTGCCGA CCAGGGCGTG
TCGTATCAGA CGGTGATCGA CACGATCCGC AGGAAGCGCG CGTTCACGCT GCTGAGCAAC
CCGCGGCTGT CGATCGAGGA CGTCGCGCAC GAAGTCGGGT TCAGCGACGC GCATAACTTC
CGGCGCGCGT TCAAGCGGTG GACGGGGCAC GGGCCGCGCG AAGGACAGCG GCCGGCGTAG
 
Protein sequence
MDDTDRYTTA NLPVHLLRCL AETSKELGID PTRLCLGLGF DVADLSNPSC RISLRQASTM 
IRRALDMAPG RALGLELGTS ETIASIGLVG YAMLTSPTLK DAISVGMELQ RHTGPLMRFE
VISDARTLSI RATNVFLEPD IEAFLVEEAF GSFMKIGRSL VGPAFQPKVV DLSYPPPAYA
EQYTRVFPCP VRFEQEQNLF SCDAALGNRP IATHDPLAHR QALEFLQDAL PPEPEGTEFL
ESIERIMRRD LRHAPSLAEI AAQLCMSERT LRRRLADQGV SYQTVIDTIR RKRAFTLLSN
PRLSIEDVAH EVGFSDAHNF RRAFKRWTGH GPREGQRPA