Gene Cpha266_2028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2028 
Symbol 
ID4569148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2343961 
End bp2344860 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content49% 
IMG OID639766609 
ProductRNA polymerase, sigma 32 subunit, RpoH 
Protein accessionYP_912464 
Protein GI119357820 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.738085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCAAC TTAAAATAAG CAAGCAGATT ACCAATCGTG AGAGCCTGTC GCTTGATCGG 
TATCTGCAGG AGATAGGAAA GTATGATTTA CTGACCGCCG AAGATGAGGT GAAACTGACC
AAGGCGATCA AGGAGGGTTA TGATACACCG GTTGATACCG TCGAATACAG AAGGGCCAAG
CGTGCGCTTG ACAAGTTGAT CAAGGGAAAC CTGAGGTTTG TTGTTTCTGT TGCCAAACAG
TACCAGAATC AGGGGCTTAC GCTCGGCGAT CTTATTAATG AAGGGAATCT TGGTTTGATC
AAGGCAGCCA AACGCTTCGA TGAAACGAGG GGATTCAAGT TTATCTCCTA TGCGGTCTGG
TGGATTCGTC AGTCTATTCT TCAGGCGCTT GCCGAACAGT CGAGGATTGT GAGGCTGCCG
CTGAACAGGG TCGGAACCCT GAACAAGATC AGCAAGGCTT ACAGCCAGTT GGAACAGGAG
TTCGAACGCG ATCCGAATAC GCGGGAACTT GCCAATCTTC TCGATATGGA TTCCCAGGAT
GTTGCCGATA CGCTCAAGAT TGCCGGAAGG CATGTTTCTG TTGATGCTCC GTTTGCGCAG
GGTGATGATA ATCGCCTTCT CGATGTTCTT CAGAATGACG GTCATCTTCC TGACCATGGG
CTCAACAAGG ACTCTCTCAC TCTTGAAGTT GAACGATCTC TCTCCGTGCT TGCTCCGAGA
GAAGCGGACG TGATCCGTTC CTATTTCGGC ATAGGGATGG ATAATCCACT GACCCTTGAG
GAGATTGGCG AAAAATTCAA GCTGACCCGC GAGCGTGTTC GCCAGATCAA GGAAAAAGCG
ATACGCAGGT TGCGCCAGTC GGCATACAGC AAGATTCTTA AGGAGTATAT CGGCAGTTAA
 
Protein sequence
MRQLKISKQI TNRESLSLDR YLQEIGKYDL LTAEDEVKLT KAIKEGYDTP VDTVEYRRAK 
RALDKLIKGN LRFVVSVAKQ YQNQGLTLGD LINEGNLGLI KAAKRFDETR GFKFISYAVW
WIRQSILQAL AEQSRIVRLP LNRVGTLNKI SKAYSQLEQE FERDPNTREL ANLLDMDSQD
VADTLKIAGR HVSVDAPFAQ GDDNRLLDVL QNDGHLPDHG LNKDSLTLEV ERSLSVLAPR
EADVIRSYFG IGMDNPLTLE EIGEKFKLTR ERVRQIKEKA IRRLRQSAYS KILKEYIGS