Gene Cagg_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0201 
Symbol 
ID7269115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp251218 
End bp252414 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content59% 
IMG OID643565070 
ProductRNA polymerase, sigma 70 subunit, RpoD subfamily 
Protein accessionYP_002461585 
Protein GI219847152 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.974231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAGA GCCAGCCCGA GACCGACCGC ACCAGTTCTG CGATCAGCCC CGCCGATGTG 
CAGGCCCTGA TCGCGCAGGG ACGGCAACAG GGCTTTGTCA CCTTTGAGGA TATTCAGCGG
CTGGTTCCCA ATCTCGAAGA GTCGGTCGAG CAGATCGACA GTATTTATGC CGCCTTAGCC
GAAGCCGGTG TTCCCGTCCA CGACGGTGAT GAACCGGTTA GTGGGGATGA GTCGCTGACA
CCGTCAGTAA CCGATCTTGA CCTCGACGAT GAGCTGTCAG ACGCACTGCT TAGTGATAGT
GTGCGCCTCT ATTTGCGCGA GATTGGGCAA GTGCCGCTGT TGACCGCCGA ACAAGAGAAG
CAGTTGGCAC AGATGATCGA GCGCGGCCAA GCGGCTGAGC GTAAACTGGC TACGCTGCCG
CCCGACAGCC CTGAAGCAGC AAAGTTGCGT CGGATAAAGG AGCAGGGCGA TGAAGCACGC
CAAAAGATGG CAGCAGCGAA CTTGCGTTTG GTCGTGAGCA TCGCCAAACG TTACCGTGAT
CGTGGTCTGC CGCTGCTCGA CCTGATTCAG GAGGGGAGTC TTGGCCTGCT CCGCGCTATC
GAAAAATTCG ATCACACGAA GGGGTATAAG TTCAGTACCT ATGCGACATG GTGGATCAAG
CAGGCCCTCT CCCGTGCGCT GGCCGACCAG TCACGATTAG TGCGCTTGCC CGTTCATCTC
GGTGAGACGC TCAATCGGAT TCAGTCTGCG CGTCGGCAAC TCACCCAGTC GCTGGGCCGT
GAGCCGACCG ATACTGAGCT GGCGAATCAT CTTGGGATGA GCGAAGAAAA GCTGCGTGAA
CTACGGCGGA CTGCGCAAGA TCCGGTTTCG CTTGCGACAC CGGTCGGTGA AGAGGCTGAT
AGTACGCTCG CCGACTTCAT CCCTGATCCG CACGCGCTCG ATGCTGACGA TGCCGCCGCT
AGCGGTATGT TGCGCCAGCA GATTGCCGCT GCCCTCGACC AGCTCAGTGA GCGTGAACGG
CGGGTGCTTG AGCTGCGCTA CGGTCTTGCC GATGGCCAAC CACGCACACT TGAAGAGGTG
GGCAAAGCAT TTGGGGTGAC ACGCGAACGG GTGCGGCAAA TTGAAGTGAA GGCGCTGCGC
AAGCTGCGCC ATCCCCGCTT AGGGAAGCTG CTGAAGGATT ACCTCGATCA GGCGTAG
 
Protein sequence
MTQSQPETDR TSSAISPADV QALIAQGRQQ GFVTFEDIQR LVPNLEESVE QIDSIYAALA 
EAGVPVHDGD EPVSGDESLT PSVTDLDLDD ELSDALLSDS VRLYLREIGQ VPLLTAEQEK
QLAQMIERGQ AAERKLATLP PDSPEAAKLR RIKEQGDEAR QKMAAANLRL VVSIAKRYRD
RGLPLLDLIQ EGSLGLLRAI EKFDHTKGYK FSTYATWWIK QALSRALADQ SRLVRLPVHL
GETLNRIQSA RRQLTQSLGR EPTDTELANH LGMSEEKLRE LRRTAQDPVS LATPVGEEAD
STLADFIPDP HALDADDAAA SGMLRQQIAA ALDQLSERER RVLELRYGLA DGQPRTLEEV
GKAFGVTRER VRQIEVKALR KLRHPRLGKL LKDYLDQA