Gene BURPS668_0132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0132 
Symbol 
ID4883771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp128639 
End bp129652 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content64% 
IMG OID640126060 
ProductAraC-type DNA-binding domain-containing proteins 
Protein accessionYP_001057187 
Protein GI126440673 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.968788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTCAA GCGACAAACC CCGCTCGCTC GCCGCACGAC GTCCCGCGTC GCTGCATGCC 
GTCGCGGTCG CGGTCGACAT GTTGCAGCGG CGAGGCCTGA GCACGGAACT GATCCTCAGC
GGCTCGGGCA TCGCGCCCGC CGAGTTGCGC CAGCCGAACA AGATCATCTC GCATGCGCAG
GAGATGGTGA TCTATCACAA CGCGTGGCGG ATGACGGGCG ATTCGGCGAT CGGCCTCGCG
ATGGCCGACG CCGTGCCGCT CACCGCGTAC ATGCCGCTCG GGCTCGCGAT GATGGTCAGC
CCGACGCTCG GCGCCGCGAT CGAGCTCGCG AACAGTTGCC CGCTGCTCGC ATTGTGCTAT
TTCACCACGC GCCTCGAAAC AAAAGGCTCG CGAGCCGTCA TCACGTTCTC CGATTATTCG
TATCGGCCCG ATCTCTACGT GCTCAACACC GACATGTGTC TCGCGGGCCT GCGCAGGCAG
ATGTTCGATC TGCTCGGCGG CCCGCCGACA TTCCGTCAGG TGACGCTCGC TTTCGACGCG
CCGAAACATG CGTACGCATA CGAATCGCTG TTTCAATGTC CGATCAGATT CTCCGCGCCC
GCGCATTCGT TCACGCTCGA CGCGAACTGC ATGAACACAC CGTTGCCGAT GGCGCATCAA
CTCGAGCATA TGATCGCGAA GGACGCGTGC GTGCGGCGCG AGCAGGAACT CGAGCAATGG
GTTGCGGCGG ACGTCGTCGG CAAGGCGCTC CATTATCTGT ACGACCATCC GTTCACGGGC
ACCGTGCCCG CGCTCGCGGG TGCGCTCGGC ATGTCGACCC GCACGCTGCA GCGCAAGCTC
AAGCAGTCGG GCACGTCGCT GCAGCGTCTG CTCGAACAGG TGAGGCGCGA TCTGCTGATT
CAGGATCTGG CGCTGGGCTC GCGCTCGCGA AAGGACATCG CACGGCACAT CGGCTACAAG
GATCCGACCT CTGTGAGCCG CGCGCGACGC AGATGGGCGA AAGAAGATTC GTGA
 
Protein sequence
MASSDKPRSL AARRPASLHA VAVAVDMLQR RGLSTELILS GSGIAPAELR QPNKIISHAQ 
EMVIYHNAWR MTGDSAIGLA MADAVPLTAY MPLGLAMMVS PTLGAAIELA NSCPLLALCY
FTTRLETKGS RAVITFSDYS YRPDLYVLNT DMCLAGLRRQ MFDLLGGPPT FRQVTLAFDA
PKHAYAYESL FQCPIRFSAP AHSFTLDANC MNTPLPMAHQ LEHMIAKDAC VRREQELEQW
VAADVVGKAL HYLYDHPFTG TVPALAGALG MSTRTLQRKL KQSGTSLQRL LEQVRRDLLI
QDLALGSRSR KDIARHIGYK DPTSVSRARR RWAKEDS