Gene BURPS1106A_A1894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1894 
Symbol 
ID4905001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1858480 
End bp1859931 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content61% 
IMG OID640145000 
Producthypothetical protein 
Protein accessionYP_001075928 
Protein GI126455619 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCTCAA TGCTCTATTT TCCGATGGTA TCGGCTCTGA GCTTGCTCGG TGCCGACGCG 
CCGACGCACT TGCATTCGCA CTTGAAGTTG ATTCTGGGCG GCGAGTTCAA TGCGGCGCTC
GAAAGATCGA GCGAATGGGC CGAAACGACG GTGGCATCGG AGCGGACATC ATGGGATCTG
CAATTGCACG CGGATCTGCA GCTGGTGCTT GGCTTCGAAG TCGAAGCCGA AGAAAACTAT
CGGCGCGCCC AGCGAAAAAT TCGCGGCTCA AACAGTAAGA TTCGCATCGC GACCTGCCGG
AACGCCGCGT GGCAAGCCCT GTTCCGCTAC CGGGTCACGA CCGCGCTCGC GTGTTTTTCC
CGAATCTGCG ACGAGCCCGG CATCGAGGCC GGCGGATTGG TGGAGGCGCG CTTTGGGATC
GCCTGCGCGC TCTATGAAAT GGGGCGGATA GACGATGCGT TTGATGCGAT CGATTCGATG
GAGAAGATCG CCGAACAGCA ATCGGACGAG ATGCGCGCGC ACTGGAAAGA CTTGATCGCC
GTGTTGCGTT TCGATCTCGT CGTGCAAAGC GAATTGCGCC GGGCTGCGGC GTTCGTCGAT
CATGTGTATT GGCAATCTGC GCAGTCGATG AGCCGGGTGG ACCGCGCGCA CGGTGTGTCG
GAGGCCGCCG TATCCGTCGA GACGCCGCTG CTGCGCGGCC GGGTGGCCTA TCTGCTGCAG
TTGCGATGCG CGGCCGCGGG CAATCGGGAC GCCGTCGCCG AGTTGGCGCG TTGCCTCGAT
GCGGCGGGCG AGCAGGGATT CGTCGACTTT CGATACACGC TGCGCCTCGA GATTGCGCTC
GCCCTGCTCG CGGGCGACGC GCCCAATTTG GCGCAATTCG TGTTGGAGCC GATTTCCGAC
ACATTGCATG GCGCAGAGTC GAGCCGCCGC TATCGGGAAT ATTTCTATTG CGCCGCGAAG
GTGCATCTGG CGCAGGACCA CACGCAGGAA TCGCTGGCCT TATACCGACG CTACGCGCTG
ATCGCGATGA GATGTCTGCG CGAGGACGCG CTGATCGGCA GGCAGTTCCT GGTCGGGCAG
GAACTGAAGC AGCTTCCCCA GTCCGACGAT GTGACCGTGC GCTTGCCGTT GAAATATCGA
CGCGCCTATC ACTATATTCT CCAGAATCTC AACCGTAGCG ACCTTTCGGT TCGGGAGATC
GCGGCGGAGA TCGGCGTCAC GGAGCGCGCG CTGCAGAACG CATTCAAGAT CTACCTCGGG
CTTTCCCCGC GTGAACTGAT CCGCTCGCGG AGAATGGAGC GTATCCGCAC GGAACTCGTC
GATTTCACGT TGACGGGTGA GCGCAACGTC AAGGAGGCGG CCCGAAAATG GGGTGTCCAG
AATGGTTCGA CACTCGTGAT CGCCTATCGG AAGGAGTACG ACGAAACCCC TTCGGAAACG
CTCGCGCGCT GA
 
Protein sequence
MFSMLYFPMV SALSLLGADA PTHLHSHLKL ILGGEFNAAL ERSSEWAETT VASERTSWDL 
QLHADLQLVL GFEVEAEENY RRAQRKIRGS NSKIRIATCR NAAWQALFRY RVTTALACFS
RICDEPGIEA GGLVEARFGI ACALYEMGRI DDAFDAIDSM EKIAEQQSDE MRAHWKDLIA
VLRFDLVVQS ELRRAAAFVD HVYWQSAQSM SRVDRAHGVS EAAVSVETPL LRGRVAYLLQ
LRCAAAGNRD AVAELARCLD AAGEQGFVDF RYTLRLEIAL ALLAGDAPNL AQFVLEPISD
TLHGAESSRR YREYFYCAAK VHLAQDHTQE SLALYRRYAL IAMRCLREDA LIGRQFLVGQ
ELKQLPQSDD VTVRLPLKYR RAYHYILQNL NRSDLSVREI AAEIGVTERA LQNAFKIYLG
LSPRELIRSR RMERIRTELV DFTLTGERNV KEAARKWGVQ NGSTLVIAYR KEYDETPSET
LAR