Gene BURPS668_A1987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1987 
Symbol 
ID4886596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1924372 
End bp1925823 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content61% 
IMG OID640131925 
Producthypothetical protein 
Protein accessionYP_001062982 
Protein GI126443667 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCTCAA TGCTCTATTT TCCGATGGTA TCGGCTCTGA GCTTGCTCGG TGCCGACGCG 
CCGACGCACT TGCGTTCGCA CTTGAAGTTG ATTCTGGGCG GCGAGTTCAA TGCGGCGCTC
GAAAGATCGA GCGAATGGGC CGAAACGACG GTGGCATCGG AGCGGACATC ATGGGATCTG
CAATTGCACG CGGATCTGCA GCTGGTGCTT GGCTTCGAAG TCGAAGCCGA AGAAAACTAT
CGGCGCGCCC AGCGAAAAAT TCGCGGCTCA AACAGTAAGA TTCGCATCGC GACCTGCCGG
AACGCCGCGT GGCAAGCCCT GTTCCGCTAC CGGGTCACGA CCGCGCTCGC GTGTTTTTCC
CGAATCTGCG ACGAGCCCGG CATCGAGGCC GGCGGATTGG TGGAGGCGCG CTTTGGGATC
GCCTGCGCGC TCTATGAAAT GGGGCGGATA GACGATGCGT TTGATGCGAT CGATTCGATG
GAGAAGATCG CCGAACAGCA ATCGGACGAG ATGCGCGCGC ACTGGAAAGA CTTGATCGCC
GTGTTGCGTT TCGATCTCGT CGTGCAAAGC GAATTGCGCC GGGCTGCGGC GTTCGTCGAT
CATGTGTATT GGCAATCTGC GCAGTCGATG AGCCGGGTGG ACCGCGCGCA CGGTGTGTCG
GAGGCCGCCG TATCCGTCGA GACGCCGCTG CTGCGCGGCC GGGTGGCCTA TCTGCTGCAG
TTGCGATGCG CGGCCGCGGG CAATCGGGAC GCCGTCGCCG AGTTGGCGCG TTGCCTCGAT
GCGGCGGGCG AGCAGGGATT CGTCGACTTT CGATACACGC TGCGCCTCGA GATTGCGCTC
GCCCTGCTCG CGGGCGACGC GCCCAATTTG GCGCAATTCG TGTTGGAGCC GATTTCCGAT
ACATTGCATG GCGCAGAGTC GAGCCGCCGC TATCGGGAAT ATTTCTATTG CGCCGCGAAG
GTGCATCTGG CGCAGGACCA CACGCAGGAA TCGCTGGCCT TATACCGACG CTACGCGCTG
ATCGCGATGA GATGTCTGCG CGAGGACGCG CTGATCGGCA GGCAGTTCCT GGTCGGGCAG
GAACTGAAGC AGCTTCCTCA GTCCGACGAT GTGACCGTGC GCTTGCCGTT GAAATATCGG
CGCGCCTATC ACTATATTCT CCAGAATCTC AACCGTAGCG ACCTTTCGGT TCGGGAGATC
GCGGCGGAGA TCGGCGTCAC GGAGCGCGCG CTGCAGAACG CATTCAAGAT CTACCTCGGG
CTTTCCCCGC GTGAGCTGAT CCGCTCGCGG AGAATGGAGC GTATCCGCAC GGAACTCGTC
GATTTCACGT TGACGGGTGA GCGCAACGTC AAGGAGGCGG CCCGAAAATG GGGTGTCCAG
AATGGTTCGA CACTCGTGAT CGCCTATCGG AAGGAGTACG ACGAAACCCC TTCGGAAACG
CTCGCGCGCT GA
 
Protein sequence
MFSMLYFPMV SALSLLGADA PTHLRSHLKL ILGGEFNAAL ERSSEWAETT VASERTSWDL 
QLHADLQLVL GFEVEAEENY RRAQRKIRGS NSKIRIATCR NAAWQALFRY RVTTALACFS
RICDEPGIEA GGLVEARFGI ACALYEMGRI DDAFDAIDSM EKIAEQQSDE MRAHWKDLIA
VLRFDLVVQS ELRRAAAFVD HVYWQSAQSM SRVDRAHGVS EAAVSVETPL LRGRVAYLLQ
LRCAAAGNRD AVAELARCLD AAGEQGFVDF RYTLRLEIAL ALLAGDAPNL AQFVLEPISD
TLHGAESSRR YREYFYCAAK VHLAQDHTQE SLALYRRYAL IAMRCLREDA LIGRQFLVGQ
ELKQLPQSDD VTVRLPLKYR RAYHYILQNL NRSDLSVREI AAEIGVTERA LQNAFKIYLG
LSPRELIRSR RMERIRTELV DFTLTGERNV KEAARKWGVQ NGSTLVIAYR KEYDETPSET
LAR