Gene Cagg_2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2066 
Symbol 
ID7269225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2528312 
End bp2529835 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content54% 
IMG OID643566901 
Productprotein serine phosphatase with GAF(s) sensor(s) 
Protein accessionYP_002463390 
Protein GI219848957 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCTCT CTCTACTCAG CCTGCAATAC GATCGCATTC AAGCACTTGC CGAGGCGTGG 
TTAGCACATG GTGCGCAGGC GTTTGGCGTT TATGCGAATG GCAGGGCATT AGCGTATTGG
CCGGCAGGGC AGCGGCTCTT GGCGCCCGAC ATCACGGCCC CCATTTACCA ATACGGCGAG
GTCGCCGGTG AGTTGCGGTT GACCGGTTTG CGCGATGAAG CTGCGCGTCG CCGGTTGCAA
GCTGAAGCGA ATCTGATCGG CTATATCTTG CAGCTTGAGT ACGAATTGCA ATGTATGACA
GCCGATCTCG TCGCCAGTCA AGATCAACAG TTGGCGCTCT ACCGGCTCAC TCAAGCCATG
CGTGATTTGG TGACAATCCG GGAGACGCTC GATACGGTCA TTGTTGAAGC GAAGCGGATG
GTAAAAGCAC AGGCCGGGTT TGCAACGTAT GTTCCGACCA ATGGCGGCGA ACCGTTGCTC
GTACAGTCAT CGGAGCAACG CTTGAGTCCG GAGAGTATTT GGCGGTTATA TTGGCAACTA
CAAACCGAAG ATCGCCCCAT CGTACTGAAT GAAAGTGATG GTGATCTGCG ACGGCCACCC
GGTGTGCGCA ATCTCTTACT CCTTCCGATC CGGGTACGCG GTATGATCAT GGCGAGCATC
GGCCTGATCG ATCGGAGTGG TGACTTTGGA ACACCGGAAT TGAAGTTGGG ACGGGCGATT
GCCGAGCAAG CCAGTGCTCA AATTGAGCGG ATCTTGCTCT ATCAGGAAAT GATCGAGCAA
GCGCGGCTGC GGAGCGAGAT GGATCTGGCC CGTCGGGTGC AGACCGATCT CTTACCACGG
ACGTTGCCCG ACGTACCCGG TCTTGACCTG TACGCCTATT CACGACCGGC GCTTCAGGTC
GGTGGTGATT TCTTCGATTT CATAACCGCT CCCAATCACC CGTTCATTTT CACGATTGGT
GACGTTAGTG GAAAAGGGGT TTCGGCGGCG CTGTTGATGT CAATGACGCG CACTGCGTTA
CACAGTAAAG CGCAGTTTAT GCCTTCGCCG ACACCGGCGT CGGTGATGCG ACAGTCGAAC
AAGGACCTCT ACAACGATTT TACCCGGATT GGTGTTTTTG CTACCGTTTT TGTTGGACAA
TACGAAGCCG AACGCCGAGA GATTGCATAC GCTAACGCTG GCCACGCTCC GGTTATTTAC
CGTCCGCGCA GTGGTAACGC CGAACTATTG TTGGCCGACA ACACTGCGAT AGGCATTTTG
CCGGTAAATC ATTTTCAAAA TCGTTATCTG CCGCTCAGGC CGGGTGATCT GCTTGTAGTT
GCGACCGACG GCTTTAGTGA TGCGCGCAAT GCAGACGATG AAATGTTCGG GATTGAGCGT
TTATTAATTG CAATTGATGA ATTGGCCGAA CGATCGGCGC GTGAGATTGC CGACGGCCTG
TTCAGCGCTA TCGATCGGTT TAGTGCCGGT CATCCGCAAG ATGACGATCA GACCCTTATT
GTTCTCAAAG GAGCGGAGCC GTGA
 
Protein sequence
MLLSLLSLQY DRIQALAEAW LAHGAQAFGV YANGRALAYW PAGQRLLAPD ITAPIYQYGE 
VAGELRLTGL RDEAARRRLQ AEANLIGYIL QLEYELQCMT ADLVASQDQQ LALYRLTQAM
RDLVTIRETL DTVIVEAKRM VKAQAGFATY VPTNGGEPLL VQSSEQRLSP ESIWRLYWQL
QTEDRPIVLN ESDGDLRRPP GVRNLLLLPI RVRGMIMASI GLIDRSGDFG TPELKLGRAI
AEQASAQIER ILLYQEMIEQ ARLRSEMDLA RRVQTDLLPR TLPDVPGLDL YAYSRPALQV
GGDFFDFITA PNHPFIFTIG DVSGKGVSAA LLMSMTRTAL HSKAQFMPSP TPASVMRQSN
KDLYNDFTRI GVFATVFVGQ YEAERREIAY ANAGHAPVIY RPRSGNAELL LADNTAIGIL
PVNHFQNRYL PLRPGDLLVV ATDGFSDARN ADDEMFGIER LLIAIDELAE RSAREIADGL
FSAIDRFSAG HPQDDDQTLI VLKGAEP