Gene BURPS1710b_A2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A2001 
Symbol 
ID3694060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2433925 
End bp2435346 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content73% 
IMG OID637732255 
ProductGntR family transcriptional regulator 
Protein accessionYP_337152 
Protein GI76817606 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.577314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTCTG CGTCGCGGAT CGATTGCGAT GCCGCATCGT ATGACCGAGA ATCAATACAA 
TCAAATAATT GTCATGGATA CATCGGCATG GCTCACGCCC GCTACAAGCG CCTCGTCGAC
ACGCTCGCCG CCGATATCCG CTCGGGCCGC CTGCCGCCCG GCGCGCGCCT GCCGACGCAT
CGCAAGCTCG CGGCGGCCGA GGGCCTCGCG CTCGTCACGG CGACGCGCGT CTACGCGGAG
CTCGAGGCAA TGGGGCTCGT GAGCGGCGAG ACCGGCCGCG GCACGTTCGT GCGGGAAACC
GCGCTGCCGC GCGGGCTCGG CGTCGACCAG CACGCGAGCG CCGCCGGCGT CGTCGATCTC
GCGTTCAACT ATCCGTCGCT GCCCGAGCAG GCCGAGCTGC TGCGCGGCGC GCTGCGCCAG
CTTGCGTCGT CGGGCGATCT CGACGCGCTG CTGCGCTACC AGCCGCACGG CGGGCGGTGG
CACGAGCGCG CGTCGGTCGC CCGCCATCTC GCGCGCCGCG GGCTGTCGGT CGACGCGCAA
CGCGTCGCGA TCGTCAACGG CGCGCAGCAC GGGCTCGCGG TGACCGCGAT GGCGCTGCTG
CGGCCGGGCG ACGTCGTCGC CGTCGACGCG CTCACCTACC CCGGCTTCAA GGTCGTCGCC
GATGCGCAGC ACCTCGAGCT CGCGCCGCTT CCGGCATCCG GCCAGGGCCC CGACCCCGAC
GCGCTCGAGC GCCTTTGCAG GACGCGGCGC GTGCGCGCGG TGTACACGAT GCCGACGCTG
CACAATCCGC TCGGCTGGGT GACGAGCGCG CACCGCCGGC GCCGGCTCGT CGCGATCGCG
CGCCGCCACG GGCTGCTGAT CATCGAGGAC GGCGCGTATG CGTTCCTCGC CGACGATCCG
CCCGAGCCGA TCGCCGCGCT CGCGCCGGAG GCTACCGTCT ACGTGTCCGG GCTGTCGAAG
AACGTCGCGA CCGGGCTGCG CGTCGGCTTC GTCGCGGCGC CCGAGCCGTG GGCGCCGGCG
ATCGAGCGCG CGATTCGGGG CACGACGTGG AACACGCCCG GCGTGATGAC GGCGATCGCC
TGCGGCTGGC TCGACGACGG AACGGTCGAG CGGCTCGAGG CGGACAAGCG CCGCGACGCC
GCGGCGCGGC AGGCGATCGC GAGCGAAGCG TTCGCGGGGC TGCGCTGCAT CCGCCATCCG
GCGTCATATT TCGTGTGGCT GCCGCTCGCC GACGACGCGC GCGCCGACCG GGTCGCGATG
ACGCTGATGC GCGAGCGGGT GGCGGTGTCG ACCGCCGAGC CGTTCGCGAC GTCCGCGCAC
GCGCCGCACG CGATCCGCGT CGCGCTCGGG TCCGTCGATC CGCCGACGCT GCGCGACGCG
CTCGGCAAGG TGCGACGGGC GATCGACGCG CATTCGTATT AG
 
Protein sequence
MSSASRIDCD AASYDRESIQ SNNCHGYIGM AHARYKRLVD TLAADIRSGR LPPGARLPTH 
RKLAAAEGLA LVTATRVYAE LEAMGLVSGE TGRGTFVRET ALPRGLGVDQ HASAAGVVDL
AFNYPSLPEQ AELLRGALRQ LASSGDLDAL LRYQPHGGRW HERASVARHL ARRGLSVDAQ
RVAIVNGAQH GLAVTAMALL RPGDVVAVDA LTYPGFKVVA DAQHLELAPL PASGQGPDPD
ALERLCRTRR VRAVYTMPTL HNPLGWVTSA HRRRRLVAIA RRHGLLIIED GAYAFLADDP
PEPIAALAPE ATVYVSGLSK NVATGLRVGF VAAPEPWAPA IERAIRGTTW NTPGVMTAIA
CGWLDDGTVE RLEADKRRDA AARQAIASEA FAGLRCIRHP ASYFVWLPLA DDARADRVAM
TLMRERVAVS TAEPFATSAH APHAIRVALG SVDPPTLRDA LGKVRRAIDA HSY