Gene BURPS1710b_A1345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1345 
SymbolrpoN 
ID3694157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1656150 
End bp1657625 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content70% 
IMG OID637731599 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_336502 
Protein GI76819614 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCCA CGCTCGCGTT GCAAATGCGT CAACACCTGG CGCTCACGCC GCGCTTGCAG 
CAGTCGCTGC GTTTGCTCCA GCTTTCGTCG CTCGAGTTTC AACAGGAACT GCGTCAGGCG
CTCGATACCA ATCCGTTTCT CGAAGACGTG CAATCGCCCG ACGACGATGC TGCCGAAGCC
GCGCCGAAGC CGGGCGAGAC GCCCGCCGCC GACGCAAACG CGAGGGCGGA CGACGGCTAC
GCCGAGCGCG ACGAGGGCCC GTTCGCGACC GACGCGTCGC CGCCCGCCGG CCAGGACGTG
CCGCTGACGG CGGAGCTCTC GGCGCGCGGC TCGAGCCGGC GCTCCGACGA CGCGTCGGAT
CTCGAGCCCG GCGACTGGAT GACGGCCGAG CCGACGCTGC ACGAGCATCT GCACGACGCG
CTGCGCCTTT GCCAGCTCAC CCGGCGCGAC CGCACGCTCG CGCGCATGAT CATCGACGCG
CTCGACGACG ACGGCTATCT GCGCCAGGCG CTGCCCGAGC TCGCGGCGGC GGCCGATCCG
CTGCTGCATC CGGCCGAGCA GGAACTGCTC GTCGCGCTGC GGCTCGTGCA GTCGCTCGAT
CAGCCCGGCA TCGGCGCGCG CACGCTGTCC GAATGCCTGT TGCTGCAGCT CGACGCGATG
CCGGCGGACA CGCCGGGCGT CGAATGCGCG AAGGAGATCG CCGCGCATCA CCTCGAGCGT
CTCGCACGCC GCGAGACGGC CGAGATGCAG CGCCGCATCG GCTGCGACAC GCACACGCTG
CGCATCGCAT GCACGCTCGT GCGCCGGCTC GATCCGCGCC CCGGCAACCA GTACGGCAGC
ACGGCGGGCA ACTATGTCGT CCCCGATGTG ATCGTGCGGC AGGTGCGCAA CGACTGGCTC
GTCACGATCA ACCCCGCCGT GATGCCGCGC GCGCGCATCC ATCGCCGCTA CGCGGAGCTG
TTCGCGCAAT CGAGCGGCTC GAATCAGTCG CCGCTCGGCC AGCAACTGCA GGAGGCGCGC
TGGCTGATCC GCAACGCGCA GAAGCGTTTC GACACGATCC GCCGCGTCGG CGAGTGCATC
GTCGAGCGGC AGCGCGACTT TTTCCGCTAC GGCGAGATCG CGCTGAAGCC GCTCGTGCTG
CGCGACATCG CCGACGAGCT CGGCCTGCAC GAATCGACGA TCTCGCGCGC GACCGGCAAC
AAGTACATGT CGACGCCGCA CGGCACGTTC GAGTTCAAGC ACTTCTTCCC GCGCAAGCTC
GAGGCGGCGG GCAAGGGCGC GTGCTCGGCG GCCGTCGCGA GGGTGCTGAT CCGCGACATG
ATCGCGGCCG AACAGGCGAT CGATCCGCTG TCGGACGTCG CGCTCGCGCA GCGTCTGGCG
GGGCGCGGGA TCGTGCTCGC GCGCCGCACC GTCACGAAGT ATCGGCAGGC GATGAAGATC
CCGCCCGCGG AATTGCGCCG CCGCGCGCCT CTATGA
 
Protein sequence
MSATLALQMR QHLALTPRLQ QSLRLLQLSS LEFQQELRQA LDTNPFLEDV QSPDDDAAEA 
APKPGETPAA DANARADDGY AERDEGPFAT DASPPAGQDV PLTAELSARG SSRRSDDASD
LEPGDWMTAE PTLHEHLHDA LRLCQLTRRD RTLARMIIDA LDDDGYLRQA LPELAAAADP
LLHPAEQELL VALRLVQSLD QPGIGARTLS ECLLLQLDAM PADTPGVECA KEIAAHHLER
LARRETAEMQ RRIGCDTHTL RIACTLVRRL DPRPGNQYGS TAGNYVVPDV IVRQVRNDWL
VTINPAVMPR ARIHRRYAEL FAQSSGSNQS PLGQQLQEAR WLIRNAQKRF DTIRRVGECI
VERQRDFFRY GEIALKPLVL RDIADELGLH ESTISRATGN KYMSTPHGTF EFKHFFPRKL
EAAGKGACSA AVARVLIRDM IAAEQAIDPL SDVALAQRLA GRGIVLARRT VTKYRQAMKI
PPAELRRRAP L