Gene BURPS668_A3119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A3119 
SymbolrpoN 
ID4886549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2959332 
End bp2960807 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content70% 
IMG OID640133055 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_001064110 
Protein GI126443717 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.194904 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCCA CGCTCGCGTT GCAAATGCGT CAACACCTGG CGCTCACGCC GCGCTTGCAG 
CAGTCGCTGC GTTTGCTCCA GCTTTCGTCG CTCGAGTTTC AACAGGAACT GCGTCAGGCG
CTCGATACCA ATCCGTTTCT CGAAGACGTG CAATCGCCCG ACGACGATGC TGCCGAAGCC
GCGCCGAAGC CGGGCGAGAC GCCCGCCGCC GACGCAAACG CGAGGGCGGA CGACGGCTAC
GCCGAACGCG ACGAGGGCCC GTTCGCGACC GACGCGTCGC CGCCCGCCGG CCAGGACGTG
CCGCTGACGG CGGAGCTCTC GGCGCGCGGC TCGAGCCGGC GCTCCGACGA CGCGTCGGAT
CTCGAGCCCG GCGACTGGAT GACGGCCGAG CCGACGCTGC ACGAGCATCT GCACGACGCG
CTGCGCCTTT GCCAGCTCAC CCGGCGCGAC CGCACGCTCG CGCGCATGAT CATCGACGCG
CTCGACGACG ACGGCTATCT GCGCCAGGCG CTGCCCGAGC TCGCGGCGGC GGCCGATCCG
CTGCTGCATC CGGCCGAGCA GGAACTGCTC GTCGCGCTGC GGCTCGTGCA GTCGCTCGAT
CAGCCCGGCA TCGGCGCGCG CACGCTGTCC GAATGCCTGT TGCTGCAGCT CGACGCGATG
CCCGCGGACA CGCCGGGCGT CGAATGCGCG AAGGAGATCG CCGCGCATCA CCTCGAGCGT
CTCGCACGCC GCGAGACGGC CGAGATGCAG CGCCGCATCG GCTGCGACAC GCACACGCTG
CGCATCGCAT GCGCGCTCGT GCGCCGGCTC GATCCGCGCC CCGGCAACCA GTACGGCAGC
ACGGCGGGCA ACTATGTCGT CCCCGACGTG ATCGTGCGGC AGGTGCGCAA CGACTGGCTC
GTCACGATCA ACCCCGCCGT GATGCCGCGC GCGCGCATCC ATCGCCGCTA CGCGGAGCTG
TTCGCGCAAT CGAGCGGCTC GAATCAGTCG CCGCTCGGCC AGCAACTGCA GGAGGCGCGC
TGGCTGATCC GCAACGCGCA GAAGCGTTTC GACACGATCC GCCGCGTCGG CGAGTGCATC
GTCGAGCGGC AGCGCGACTT TTTCCGCTAC GGCGAGATCG CGCTGAAGCC GCTCGTGCTG
CGCGACATCG CCGACGAGCT CGGCCTGCAC GAATCGACGA TCTCGCGCGC GACCGGCAAC
AAGTACATGT CGACGCCGCA CGGCACGTTC GAGTTCAAGC ACTTCTTCCC GCGCAAGCTC
GAGGCGGCGG GCAAGGGCGC GTGCTCGGCG GCCGTCGCGA GGGTGCTGAT CCGCGACATG
ATCGCGGCCG AACAGGCGAT CGATCCGCTG TCGGACGTCG CGCTCGCGCA GCGTCTGGCG
GGGCGCGGGA TCGTGCTCGC GCGCCGCACC GTCACGAAGT ATCGGCAGGC GATGAAGATC
CCGCCCGCGG AATTGCGCCG CCGCGCGCCT CTATGA
 
Protein sequence
MSATLALQMR QHLALTPRLQ QSLRLLQLSS LEFQQELRQA LDTNPFLEDV QSPDDDAAEA 
APKPGETPAA DANARADDGY AERDEGPFAT DASPPAGQDV PLTAELSARG SSRRSDDASD
LEPGDWMTAE PTLHEHLHDA LRLCQLTRRD RTLARMIIDA LDDDGYLRQA LPELAAAADP
LLHPAEQELL VALRLVQSLD QPGIGARTLS ECLLLQLDAM PADTPGVECA KEIAAHHLER
LARRETAEMQ RRIGCDTHTL RIACALVRRL DPRPGNQYGS TAGNYVVPDV IVRQVRNDWL
VTINPAVMPR ARIHRRYAEL FAQSSGSNQS PLGQQLQEAR WLIRNAQKRF DTIRRVGECI
VERQRDFFRY GEIALKPLVL RDIADELGLH ESTISRATGN KYMSTPHGTF EFKHFFPRKL
EAAGKGACSA AVARVLIRDM IAAEQAIDPL SDVALAQRLA GRGIVLARRT VTKYRQAMKI
PPAELRRRAP L