Gene Cpha266_1547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1547 
Symbol 
ID4569132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1756464 
End bp1757918 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content44% 
IMG OID639766129 
ProductRNA polymerase, sigma 54 subunit, RpoN 
Protein accessionYP_911993 
Protein GI119357349 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGACT TAAAACTTCA ACTGAAACAA AAAGCGCTTC TTTCTGCCCA GCAGATTTTA 
GGCAGTCAGC TACTTCACCT TCCACTTGCC AATCTTGAAC AAAGAATAGA TGAAGAGCTT
CAGGAGAATC CTTTGCTTGA AATGCTTGAT GAAGAGCGTG GCAGCGCCGA AGATCTTCCT
GTTGATGTGA ATTCTCAGGT TGGTGATGAA CTCGCTGATC CCGTAGAACG ATTCAGCAGC
ATAACCAGCA AGGAAGAGCG TGATTTTCCG ATCCGGAATG AACGGTATGA ACGAGGTTCG
ACTTCTTTCA ATAAGGCCGG TACCGGGGAT CGTTTTTTTC AGGCTGTTCA ATATGACAGT
TTTCATGAAC AGCTTCTTAA ACAGCTTGTC TTGCAGGAAG ATATCGGTGA AAAAGAGATC
ATGATTGCTG TGGAAGTGCT TGGCAATCTT GATCATGACG ATTATCTCAC TGAAGACATT
TCCGTCATAC GTGACGGTCT TCGGTTAAAT GATATCGATG TTTCCGAGCG TGAAGTAAAG
AAGATTATTG ACAGAATTTC TCGTCTTGAT CCGCAGGGTA TTGCCGTGAA GGATCTTCGG
GAACGTCTTC TTGTTCAGTT GGAAGCCGGA ACATTTGGAG CCCGCCAGAG CGCTTCAGCG
CTTGCAGTGA GAATTCTGAC GGTCTGTTTT GATGATTTTA TTCACAAACG CTATAATCGG
CTGCTTAAAA CAATCGATGC GTCCAAAAAA CAGATTGAAG CCGCACTTGA GATTCTCGGC
GAACTTGACC CTCATCCGGG CGGGGTTTTT CAAAACGAAT TGGGTCATTA CATTGTCCCT
GATTTTATTG TGAGTTATGA ACAGGGTGAA TTGACAGCGA TGTTGAATGA TACCAGCACC
CTTTCCGTCA AAGTTTCTGA CAGGTATGAG AGTATTCTGA AAAACAGGAA AGCCTCAAAG
GCGGAAAAAC AGTTTATTCG CAGTAATCTG CAGCGAGCAA AAGAGTTTAC CAGCGCCCTG
CAACTGCGGC GGCAGACGCT CATGAAGGTT ATTGAGGCGC TTTTGCACTA TCAATACGAT
TTTTTTGTTT CCGGACCATC GTTTCTGGTT CCGCTTGGCA TGAAAACGAT TGCGGAAGCA
GCTTCCCTTG ATATATCGAC GATCAGCCGA GCCGTTAACG GTAAATATGT ACAAACCCGT
TTTGGCGTTT TTGAACTGAA ATATTTTTTC AGCGCAGGAC TTGCGACAGA GGATGGTGAT
GATCTGTCGA GCAAAATTAT CAAGCAGTAC ATTGGAGAAA TGGTAAGCGG CGAGGATCCT
CAACAGCCGT TAAGCGATGA GCTTCTTACC GAATTGTTGA AAAAAAAGGG TATACATATT
GCCAGGCGAA CGGTTGCAAA ATACCGCGAA CAAATGCAAA TTTCAGTTGC AAGATTAAGG
AAAAAAATAT TTTAA
 
Protein sequence
MGDLKLQLKQ KALLSAQQIL GSQLLHLPLA NLEQRIDEEL QENPLLEMLD EERGSAEDLP 
VDVNSQVGDE LADPVERFSS ITSKEERDFP IRNERYERGS TSFNKAGTGD RFFQAVQYDS
FHEQLLKQLV LQEDIGEKEI MIAVEVLGNL DHDDYLTEDI SVIRDGLRLN DIDVSEREVK
KIIDRISRLD PQGIAVKDLR ERLLVQLEAG TFGARQSASA LAVRILTVCF DDFIHKRYNR
LLKTIDASKK QIEAALEILG ELDPHPGGVF QNELGHYIVP DFIVSYEQGE LTAMLNDTST
LSVKVSDRYE SILKNRKASK AEKQFIRSNL QRAKEFTSAL QLRRQTLMKV IEALLHYQYD
FFVSGPSFLV PLGMKTIAEA ASLDISTISR AVNGKYVQTR FGVFELKYFF SAGLATEDGD
DLSSKIIKQY IGEMVSGEDP QQPLSDELLT ELLKKKGIHI ARRTVAKYRE QMQISVARLR
KKIF