Gene Cagg_0979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0979 
Symbol 
ID7268351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1211455 
End bp1212600 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content53% 
IMG OID643565828 
ProductRNA polymerase, sigma 70 subunit, RpoD subfamily 
Protein accessionYP_002462333 
Protein GI219847900 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.048773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.10867 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAGC AACTGCTCGC CGCAGCACGG GTGCGCGGGT ACATTACCCA CGCCGACATT 
CTTGCTACCT TCCCTAATCC TGAGCACGAC ATTGCCGAGA TCGATCAGTT GTACGCCATG
CTGCAAGCTG AGGGCATTAG GGTTGTCGAG TCAAGTGATG AACTAGACGG TCCGACTGAT
TTTGAACCAG AAACCGATCA CGATTTGACT GTTGATTTGC CCGATCTGGG TGAAATTGCG
TTTGACGATC CGGTACGTAT GTATTTGCAA GAGATCGGGC AAGTTCCTCT ATTGACTGCC
GAGCAAGAGG TTGAATTAGC TAAAGCAATG GAGGCCGGCG CTATTGCCCG TCAACGTCTT
GACCGTGAAG AGTACGCATC AGCACGTGAA CGGTTTGAGT TGGAGCGCGC CGTCCAACAA
GGTCAAGATG CCCGCCACCA TCTGATTCAG GCTAATTTGC GATTGGTTGT GAGTATTGCA
AAGAAATATA CCTCTTATGG CCTGACCATG ATGGATTTGG TGCAGGAAGG TAATATTGGC
CTGATGCGAG CAGTTGAAAA GTTTGATTAT AAGAAGGGCC ATAAGTTCAG TACTTACGCA
ACGTGGTGGA TCCGACAGGC CATTACCCGT GCCATTGCCG ACCAGAGCCG CACGATCCGC
TTGCCGGTTC ATATGGGTGA AGCGATCAGC CAGGTGAAAC GTACTTCCCA TCGCCTCCAG
CAGACGATGC AGCGTGAACC AACACCTGAA GAGATCGCCG ATGCAATGGG CATTTCGGCG
GGTAAAGTAC GCCGCACGCT CGAGGCAAGC ATGCACCCGC TCTCACTTGA GATGCCGGTC
GGTCAAGAGG GCGAAGGGCG CATGGGTGAT TTCATCGAGG ATGATCGCAT CTCGACACCG
GCCGAGGCCG CAGCAGCTTC ATTGCTGCGC GAACAACTCG AAGAGGTGTT GATGAAATTG
CCTGAGCGCG AACGGAAGAT TATTCAGTTG CGCTATGGTT TGAAAGATGG CCGCTACCGC
ACCCTCGAAG AGGTGGGGAT TGAGTTTGGG ATTACCCGCG AACGGATTCG CCAAATCGAA
GCAGTCGCGC TGCGGAAGCT GCGTCATCCC CATTTGGGGA AGAAGCTGCG CGGTTATCTC
GATTAA
 
Protein sequence
MIEQLLAAAR VRGYITHADI LATFPNPEHD IAEIDQLYAM LQAEGIRVVE SSDELDGPTD 
FEPETDHDLT VDLPDLGEIA FDDPVRMYLQ EIGQVPLLTA EQEVELAKAM EAGAIARQRL
DREEYASARE RFELERAVQQ GQDARHHLIQ ANLRLVVSIA KKYTSYGLTM MDLVQEGNIG
LMRAVEKFDY KKGHKFSTYA TWWIRQAITR AIADQSRTIR LPVHMGEAIS QVKRTSHRLQ
QTMQREPTPE EIADAMGISA GKVRRTLEAS MHPLSLEMPV GQEGEGRMGD FIEDDRISTP
AEAAAASLLR EQLEEVLMKL PERERKIIQL RYGLKDGRYR TLEEVGIEFG ITRERIRQIE
AVALRKLRHP HLGKKLRGYL D