Gene BURPS668_A1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1008 
Symbol 
ID4888024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp976862 
End bp977839 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content70% 
IMG OID640130948 
ProductAraC family transcriptional regulator 
Protein accessionYP_001062007 
Protein GI126445286 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.351707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCAGC GCCACACCCG ACTCGAATCC GCCGCGCACG CGCCGCCCCG GCCCGATGCG 
CAAACGCTTG CGCCGCGCGA GGCCGCGCGC CGCGAGCTCG CCGCGCTGAT CGAGCGCTTC
GCGCCCGCCG ACGGCGCGCA CCCGAGCGCG ATTCCCGCGC TGTCGTTCTT TCGCTGCTCG
TCGCCCGTCG ATCTCGGCTG CAGCGTCACG CGCGCCGCGT TCGTGTTCGC CGCGCAGGGC
GCGAAGCGGG TAACGGTCGC GGGGCAGGCG TACGAATACG ATCATCAGCA GTGCCTCGTC
ACGTCGGTCG ATCTGCCGAT GCTGTCGCAG GTCACGCGCG CGTCGGCCGG CGCGCCGTAT
CTGTGCGTGA AGGTCGCGCT CGACGTGCAG CGCATCGCCG AGCTCTCGGC CGAGATGCGG
ATGCCGCCGC CGGAGGCGGT GCCCACGGGC GAGGGAATCG TCGTCGGCGC GCTGTCCGAG
CCGCTTTTCG ACGCGGCGCT GCGGCTCGTG CGATTGCTCG ATACTCCAGC CGACATCCCG
ATCCTCGCGC CGCTGATCGA AAAGGAGCTG CTGTACCGGC TGCTGACGAG CGGGCTGGGC
GCGCGGCTGC GGCACATCGC GGTCGCGGGC AGCCAGACGT ACCGGATCGC GCGTGCGATC
GAATGGCTTC GTCATCACTA CACGGAGCCG CTCAGGGTCG AGACGCTCGC GCAGCAGGTC
AATATGAGCG TGTCGTCGCT GCATCATCAC TTCAAGCACG TGACGACGCT CAGCCCGCTC
CAGTATCAGA AGCAACTGCG GCTGCACGAG GCGCGCCGGC TGCTGCTCGG CCAGCACGGC
GACGTCGGTT CGGTCGCGCT CAGGGTCGGA TACGACAGCC CGTCGCAGTT CAGCCGCGAA
TACAGCCGGC TGTTCGGCGC GCCGCCGTTG CGCGACGTCG TGCAACGGCG GCGCAACGGG
ACGGGCGTTC AGGAGTGA
 
Protein sequence
MDQRHTRLES AAHAPPRPDA QTLAPREAAR RELAALIERF APADGAHPSA IPALSFFRCS 
SPVDLGCSVT RAAFVFAAQG AKRVTVAGQA YEYDHQQCLV TSVDLPMLSQ VTRASAGAPY
LCVKVALDVQ RIAELSAEMR MPPPEAVPTG EGIVVGALSE PLFDAALRLV RLLDTPADIP
ILAPLIEKEL LYRLLTSGLG ARLRHIAVAG SQTYRIARAI EWLRHHYTEP LRVETLAQQV
NMSVSSLHHH FKHVTTLSPL QYQKQLRLHE ARRLLLGQHG DVGSVALRVG YDSPSQFSRE
YSRLFGAPPL RDVVQRRRNG TGVQE