Gene Bcep18194_C7696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_C7696 
Symbol 
ID3734583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007509 
Strand
Start bp1342940 
End bp1343965 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content66% 
IMG OID637761397 
ProductLysR family transcriptional regulator 
Protein accessionYP_367384 
Protein GI78060809 
COG category[K] Transcription 
COG ID[COG0583] Transcriptional regulator 
TIGRFAM ID[TIGR03418] putative choline sulfate-utilization transcription factor 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.349394 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGCGT CGACGAACCT GCTCGACTCA GGCGGCAATG CACGTTGCGC CGGTCCGGCG 
GCCCGCCCGC TTCGACGCGC CGCAATACGA CCGACTTCCG TCCCGAGAGA AGCCCCCGAC
ATGGTTAGCC GCTTGAAGCA CCTCCCGCCG CTGTCGTATC TCACCGCGTT CGAGGCAGCG
GCCCGGCATG AGAGTTTCAC GAGTGCCGCC GAGGAACTGT GCGTGACCCA GAGCGCGATC
AGCCGGCAGA TCCGGCTGCT CGAGGAAACG CTGGGCTGCG CGTTGTTCGT GCGGTCTCAC
AAGGCGGTGT CGCTCACCGA CGGCGGGCGG AAGTTCCAGC GGACCGTCAA TGCGGCGCTG
GATCTGCTTG CCGCCGCCGC GTACGAATTG CGCGTGCAGG CGTCGACGTC GACGGTCACG
GTGTCGGCGG ATCTCGCGAT CGCGTCGCAC TGGCTGATTC CGCGCCTGCC GAAGTTTCGG
GCCGAGCATC CGGAGATCAT CATTCACGTC GATGCGTCGG ACGAGGACAC GCGCAACATC
CGGGAAGGCG CGGACCTGGC GATCCAGTTC GGCGACGGCT ACTGGCCGGC CTGCAACGCG
CGGTTCCTGC TCGAGGAAGA GATCTTCCCG GTGTGCACGC CGGCGTATCT GGCCCGACTC
GCGCCGATGG CGCACACGCG GGATCTGCTG CGCGCGACGT TGATCCATCT CGAGACGCGT
CATTGGGACT GGATGGACTG GGCAACCTGG TTCGCGCATC ACGACATCGC GCTGACCGAG
CCGCGGCAGG ACCTTTACAT CAACAACTAT CCGGCGGTCC TGCAGGCCGC GATGGGCGGG
CAGGGCATCG CGATGGGATG GCGCTATCTG GCCGACGACA TGCTGGCGAC CGGGGTGCTG
GTGCGGCCGA TCGAGACCTC GGTGCGGACC GGCCGCGGGT TTTATCTGCT GCATCCGAGC
GATACGTTGT TGAGCCCGGA GGCGCGGATC TTCTGCGACT GGATCGTCGG GCAATGCGCG
GAATAG
 
Protein sequence
MFASTNLLDS GGNARCAGPA ARPLRRAAIR PTSVPREAPD MVSRLKHLPP LSYLTAFEAA 
ARHESFTSAA EELCVTQSAI SRQIRLLEET LGCALFVRSH KAVSLTDGGR KFQRTVNAAL
DLLAAAAYEL RVQASTSTVT VSADLAIASH WLIPRLPKFR AEHPEIIIHV DASDEDTRNI
REGADLAIQF GDGYWPACNA RFLLEEEIFP VCTPAYLARL APMAHTRDLL RATLIHLETR
HWDWMDWATW FAHHDIALTE PRQDLYINNY PAVLQAAMGG QGIAMGWRYL ADDMLATGVL
VRPIETSVRT GRGFYLLHPS DTLLSPEARI FCDWIVGQCA E