Gene BURPS1106A_A2058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2058 
Symbol 
ID4905136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2024297 
End bp2025289 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content71% 
IMG OID640145163 
ProductAraC family transcriptional regulator 
Protein accessionYP_001076091 
Protein GI126455829 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAGA TGCACGCGCC CCGCGGCGGC ACGCTGCTGC AATTCTTTTC GACCGACGAC 
ATGCCGCTCG CGCGCGCGGC GGCGTTCTGG AGCGCGCACG TGTTCCAGTG CGAGGACGTG
CGCGCGTCGT CCGCGCGCGC GTTTCACGGG CACGGCTTTC TCTGCCGCTG CGAGCGCGGC
CGGTTCGTTC GCTTTCGCGG CGCGTCGCTC GACACGCGCA TCGGCGCGGC GTGGCTGAGC
GCCGCGCCAG CCGATGCGTA CGTGACGATC TGCGCGCTGC ATGCGGGCGA GTGCACGGTC
GAAGCGCCCG GCTTGCCGGA TGTGCGCTTT CGTGCGAACG AGCTGTTCAT GCTGGACGGC
GGGCAGCCGA TGCGCGTGCG CTGGAGCGAG CCGTGTTTCA GCGCGCTCAG GCTGCCGCGC
GCATCGGTGG GGCGTACGCT CGGCCAGGCG GCGATGGACG CGTCGCCGAG CGCGGCTTCG
TTGCAGGAGG CGCGGCTCGC GCCGTTTCTC GCGGCGGAGC TCGCGCTGAT CGGCGGTCGC
GGCCCGACGC TGTCGTCGGA CGAGCTCGAT TACATGCTCG CGCGCGCAGC CGAGCTCGGC
CGCACGCTGC TTCAGGCGGC GCTGTCGTCG CGCGCGCGGC GCGGCGCGCC CGCGCGCGCC
GACCGGCTGC AGGCCGCGTA TCGCTACATC GAGCAGCATC TTCATCTGCC GACGCTCACA
CCCGAGCGGA TCGCCGACGC GATCCATTGC TCGCGCACGC AGCTCTATCG GCTGTTCCGC
CATGAATCGC AGACGGTGAA GGCGGCGCTG CGCGAGGCGC GGCTGAACCG CAGCCTCGGC
TATCTCGAGC AGCCCGAGGT TACGCTCAGC ATCGGCGAGA TCGCGCATGC TTGCGGTTTT
CCCGATCAGT CGACGTTCGG CAAGCTGTTT CGCCGGCGCT TCGGCAGAAC GCCCGGCGAG
GTGCGCCGCG CCGCGCGGGG ATGTTCGATA TGA
 
Protein sequence
MAKMHAPRGG TLLQFFSTDD MPLARAAAFW SAHVFQCEDV RASSARAFHG HGFLCRCERG 
RFVRFRGASL DTRIGAAWLS AAPADAYVTI CALHAGECTV EAPGLPDVRF RANELFMLDG
GQPMRVRWSE PCFSALRLPR ASVGRTLGQA AMDASPSAAS LQEARLAPFL AAELALIGGR
GPTLSSDELD YMLARAAELG RTLLQAALSS RARRGAPARA DRLQAAYRYI EQHLHLPTLT
PERIADAIHC SRTQLYRLFR HESQTVKAAL REARLNRSLG YLEQPEVTLS IGEIAHACGF
PDQSTFGKLF RRRFGRTPGE VRRAARGCSI