Gene BURPS1106A_A2995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2995 
SymbolrpoN 
ID4903649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2917776 
End bp2919251 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content70% 
IMG OID640146098 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_001077024 
Protein GI126457119 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.342471 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCCA CGCTCGCGTT GCAAATGCGT CAACACCTGG CGCTCACGCC GCGCTTGCAG 
CAGTCGCTGC GTTTGCTCCA GCTTTCGTCG CTCGAGTTTC AACAGGAACT GCGTCAGGCG
CTCGATACCA ATCCGTTTCT CGAAGACGTG CAATCGCCCG ACGACGATGC TGCCGAAGCC
GGGCCGAAGC CGGGCGAGAC GCCTGCCGCC GACGCAAACG CGAGGGCGGA CGACGGCTAC
GCCGAGCGCG ACGAGGGCCC GTTCGCGACC GACGCGTCGC CGCCCGCCGG CCAGGACGTG
CCGCTGACGG CGGAGCTCTC GGCGCGCGGC TCGAGCCGGC GCTCCGACGA CGCGTCGGAT
CTCGAGCCCG GCGACTGGAT GACGGCCGAG CCGACGCTGC ACGAGCATCT GCACGACGCG
CTGCGCCTTT GCCAGCTCAC CCGGCGCGAC CGCACGCTCG CGCGCATGAT CATCGACGCG
CTCGACGACG ACGGCTATCT GCGCCAGGCG CTGCCCGAGC TCGCGGCGGC GGCCGATCCG
CTGCTGCATC CGGCCGAGCA GGAACTGCTC GTCGCGCTGC GGCTCGTGCA GTCGCTCGAT
CAGCCCGGCA TCGGCGCGCG CACGCTGTCC GAATGCCTGT TGCTGCAGCT CGACGCGATG
CCGGCGGACA CGCCGGGCGT CGAATGCGCG AAGGAGATCG CCGCGCATCA CCTCGAGCGT
CTCGCACGCC GCGAGACGGC CGAGATGCAG CGCCGCATCG GCTGCGACAC GCACACGCTG
CGCATCGCAT GCACGCTCGT GCGCCGGCTC GATCCGCGCC CCGGCAACCA GTACGGCAGC
ACGGCGGGCA ACTATGTCGT CCCCGATGTG ATCGTGCGGC AGGTGCGCAA CGACTGGCTC
GTCACGATCA ACCCCGCCGT GATGCCGCGC GCGCGCATCC ATCGCCGCTA CGCGGAGCTG
TTCGCGCAAT CGAGCGGCTC GAATCAGTCG CCGCTCGGCC AGCAACTGCA GGAGGCGCGC
TGGCTGATCC GCAACGCGCA GAAGCGTTTC GACACGATCC GCCGCGTCGG CGAGTGCATC
GTCGAGCGGC AGCGCGACTT TTTCCGCTAC GGCGAGATCG CGCTGAAGCC GCTCGTGCTG
CGCGACATCG CCGACGAGCT CGGCCTGCAC GAATCGACGA TCTCGCGCGC GACCGGCAAC
AAGTACATGT CGACGCCGCA CGGCACGTTC GAGTTCAAGC ACTTCTTCCC GCGCAAGCTC
GAGGCGGCGG GCAAGGGCGC GTGCTCGGCG GCCGTCGCGA GGGTGCTGAT CCGCGACATG
ATCGCGGCCG AACAGGCGAT CGATCCGCTG TCGGACGTCG CGCTCGCGCA GCGTCTGGCG
GGGCGCGGGA TCGTGCTCGC GCGCCGCACC GTCACGAAGT ATCGGCAGGC GATGAAGATC
CCGCCCGCGG AATTGCGCCG CCGCGCGCCT CTATGA
 
Protein sequence
MSATLALQMR QHLALTPRLQ QSLRLLQLSS LEFQQELRQA LDTNPFLEDV QSPDDDAAEA 
GPKPGETPAA DANARADDGY AERDEGPFAT DASPPAGQDV PLTAELSARG SSRRSDDASD
LEPGDWMTAE PTLHEHLHDA LRLCQLTRRD RTLARMIIDA LDDDGYLRQA LPELAAAADP
LLHPAEQELL VALRLVQSLD QPGIGARTLS ECLLLQLDAM PADTPGVECA KEIAAHHLER
LARRETAEMQ RRIGCDTHTL RIACTLVRRL DPRPGNQYGS TAGNYVVPDV IVRQVRNDWL
VTINPAVMPR ARIHRRYAEL FAQSSGSNQS PLGQQLQEAR WLIRNAQKRF DTIRRVGECI
VERQRDFFRY GEIALKPLVL RDIADELGLH ESTISRATGN KYMSTPHGTF EFKHFFPRKL
EAAGKGACSA AVARVLIRDM IAAEQAIDPL SDVALAQRLA GRGIVLARRT VTKYRQAMKI
PPAELRRRAP L