Gene Cagg_0903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0903 
Symbol 
ID7267976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1131248 
End bp1132378 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content58% 
IMG OID643565751 
Productanti-sigma-factor antagonist 
Protein accessionYP_002462257 
Protein GI219847824 
COG category[T] Signal transduction mechanisms 
COG ID[COG1366] Anti-anti-sigma regulatory factor (antagonist of anti-sigma factor) 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00377] anti-anti-sigma factor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000110303 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000527056 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTGCTTG ATGATTTGCC ACTACCGACG GCCTATGCTC CCGATGATGG TTCGGTGCAA 
TGCAACCGGC GATGGTTGGA ATGGAGTAGT GTCACCGAAC CGCCAGTTGA TCTGGTTGAT
GCATGTGCGC AGGTAGTGGC GTGGGAGGCG GCGGATATGG CAGCAGCCCG TGAATGGATT
GCATTACTTA CGCCTGATGA TCCGCTATTA CAGTTGGGAG CGCCGTTGCG CTCTGATCGA
TCAAAGTATG TTCAGGTATT GATTCGTCGC AGTGGATCGG CATTGGTCGC GCAGCTTGTA
GATGTGACGG CGTTGCTCGA TGGACAGCGG GCGGCAGAGC GGCGACTTGA AACGATACTG
GGGGCGTTGG ATAGTTTAGA GGAAGGCTTT CTGTTACTTG ATGCTGAAGA TCGGATCGTT
ATGTGCAACC GGCGCTACCG GGAGTTATAC GCTATTAGTG CCGATCTGAT TGTGCCGGGG
CGACCTTTTG CGGAATTTAT CAGGTTGGGT GCTGAGCGCG GCCAGTACGC CGAGGCAATT
GGGCGAGTTG ACGAGTGGGT GGCCGAGCGG TTGCGGTTGC ATGCCGAGTT GGCGCCGATT
GAGCAGCACT TTGCCGATGG ACGTTGGATC AGGATCGTCG AGCGGCGGAC TGCGGACGGT
GGGGCTGTCG GTTTGCGGAT CGATATTACC GACATCAAAC AGGCCGAAGA GTTGCGTCGG
CAGTTGACGA TTCGGGAAGA GGTGATTGCG GCACAGGCGG CGCTGTTGGC CGAACTCTCG
ACGCCGTTGC TTGAAGTGGC GGAGCACGTG TTGTTGGCGC CGATGATCGG GGCATTTGAC
AGTACGCGGG TGGCGAGTTT GATTGAGGTG CTGTTGCGAA CGGTTCAACA GCGGCGAGCA
CGGGTGGTTG TGCTCGATGT GACGGGTGTT CCTGTGATCG ATACGCAGGT GGCACACGCG
ATATTGCAGT GTGCGGTTTC GATCCGCTTG TTGGGGGCAC GGTTAGTGCT CACCGGCATT
CGGCCTGATG TCGCGCAAAC GCTCGTTGCG TTGGGGGTTG ATTTGAGTGC GATTGTGACG
CGGGCTGACT TACGGGATGG GATACGCTAT GCGTTGCGGA GTCAGGGGTA G
 
Protein sequence
MLLDDLPLPT AYAPDDGSVQ CNRRWLEWSS VTEPPVDLVD ACAQVVAWEA ADMAAAREWI 
ALLTPDDPLL QLGAPLRSDR SKYVQVLIRR SGSALVAQLV DVTALLDGQR AAERRLETIL
GALDSLEEGF LLLDAEDRIV MCNRRYRELY AISADLIVPG RPFAEFIRLG AERGQYAEAI
GRVDEWVAER LRLHAELAPI EQHFADGRWI RIVERRTADG GAVGLRIDIT DIKQAEELRR
QLTIREEVIA AQAALLAELS TPLLEVAEHV LLAPMIGAFD STRVASLIEV LLRTVQQRRA
RVVVLDVTGV PVIDTQVAHA ILQCAVSIRL LGARLVLTGI RPDVAQTLVA LGVDLSAIVT
RADLRDGIRY ALRSQG