Gene BURPS668_A1878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1878 
Symbol 
ID4886757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1830283 
End bp1831581 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content67% 
IMG OID640131816 
ProductAraC family transcriptional regulator 
Protein accessionYP_001062873 
Protein GI126444140 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0305888 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGCCC GATATCCCGG CGATTCGAAT TATCGATTCG CACAGATTTT GTGCTTTAAC 
GCGGAAACGT ATTTTCGTGG TGCATCGCCC TGCACAAGAA AATATCGCGC ACGGCGATCC
CGATTCGCGG TGCGCGCGCC TCAGGCTCGC GCGAGCGAAC CGCGCGCGGC GGTGCCGGAC
GCCCGTTTCG CATCGCCGCG GCGATTCCCA CTAGAATGGT GTCCGTCGGC GCACGGCGCG
CGAGCGGCGG CGACCGGCGC GGCCGCGCGG CACGGTATGA AACTTGCGTC TCATGGATGG
CGCGCGCCCC GCGCACACAT GCCGACGACA TCGAAACACC CGCCGGCGTG CGCGGCGCCC
AGCACGCGCA CGGAGCGGCC TCGCCCGCCG CCGGCTGTTT CGTTTCATCG GAATCGGGGT
TCGACCGTGG CCAAGCTAGA CCATCGCAAC CAGTCGCGTT ACTGGCACTC TCCCGGCATT
TCAGGGGTCG ATCTGTTGCT CGCCGACTTC ACGACGCACG ACTACGCGCC GCACGTGCAC
GATTCGCTTG TCGTCGCCGT CACGGAAGTC GGCGGTTCGG TGTTCAAGAG CCGCGGGCAG
ACGCGCCTCG CCGAGCCGAA CGCCGTGCTC GTGTTCAATC CGTGCGAGCC GCATTCGGGG
CGCATGGGCG GCAGCAGCCG CTGGCGCTAC CGGTCGTTCT ACCTCGCGGA AGCGGGCCTT
TCCCGCGTGC TGACGTTGCT CGGCATGGCG CAGCCGCGCT TTTTCACGTC GAACGTGCTC
GACGATCCTC AGCTCGTCGA ACAGTTTCTC ACCCTGCACC GCGCGATGGA CGAGCAGGAC
GATCTGCTGC GGCAGCAGGA ACTGCTCGTC AGCAGCTTCG GCACGCTGTT TTCGCGGCAC
GGGCTCCAGG CCGGGCTCGG CGCCGGCCCC GGCTTCGGCA CGAAGGCGGG CCTGCCGGCG
CTCAAGCCCG CGCTCGATCT GATGAACGAT TGCTTCGACC ACGCGCTCAC CCTCGAGCAG
ATCGCGGCGG CGGCGGGCCT CACGTCGTTC CAGCTGATCA CCGCGTTCAA CCGCGTGATC
GGCCTCACAC CGCACGCGTA CCTGAACCAG TTGAGGTTGC GCGCGGCGCT GCGCGAGCTG
CAGGCCGGCC GCTCGCTCGC CGACGCCGCG CTGACATCGG GCTTCTACGA TCAAAGCGCG
CTTTGCAACC ACTTCAAGCG CACGTTCGGG ATGACGCCGA TGCAGTACAC GCGCGCGCTC
GCGCCCGGCA AGCGCGCGCT CGCGCCGATC GGAATCTGA
 
Protein sequence
MDARYPGDSN YRFAQILCFN AETYFRGASP CTRKYRARRS RFAVRAPQAR ASEPRAAVPD 
ARFASPRRFP LEWCPSAHGA RAAATGAAAR HGMKLASHGW RAPRAHMPTT SKHPPACAAP
STRTERPRPP PAVSFHRNRG STVAKLDHRN QSRYWHSPGI SGVDLLLADF TTHDYAPHVH
DSLVVAVTEV GGSVFKSRGQ TRLAEPNAVL VFNPCEPHSG RMGGSSRWRY RSFYLAEAGL
SRVLTLLGMA QPRFFTSNVL DDPQLVEQFL TLHRAMDEQD DLLRQQELLV SSFGTLFSRH
GLQAGLGAGP GFGTKAGLPA LKPALDLMND CFDHALTLEQ IAAAAGLTSF QLITAFNRVI
GLTPHAYLNQ LRLRAALREL QAGRSLADAA LTSGFYDQSA LCNHFKRTFG MTPMQYTRAL
APGKRALAPI GI