Gene Cagg_1099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1099 
Symbol 
ID7268551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1356609 
End bp1357901 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content57% 
IMG OID643565940 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_002462445 
Protein GI219848012 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000024984 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGACCCTGT TGGAATTGAC GCTGACGGTG GCGACGGTTG TGTTAACGGT GAGCCTGATC 
TTCACCGTGC GCCGGGGGCG TCAGCTTGAA CGCTTACGTG TTGCCCGTGA CCAAGAGTCT
ACTGCTGCAC CATCTGTTGC GGAACAGGAT GAGGTAATGA CGCTTACCCT CTCATCATCA
CTTCCACCGG ACGATCCATC GCTCCAGCTT GGCCTCCCCA ACTGTCAGCC CCTCTTTGTC
GCCCTTGTGC AAACCATCGA TACCGGTGTC ATCGTCGTCA ATCGCGCATG TCAAATCGTC
TTTCATAACG ACACAGCAAT TCATTTGTTG GCCGGTCAGC CTGATATGCG TGGGAGTGGG
TTGATCACGT TGGTGCGCGA TCATCAAGCT GACCGATTGG CGCACGATGC GATGACCGAT
GGTGAGCAAC GTGAATTAAC GATCCGTCCA CTGGCAACCG GGCGCACACT CCACTTGCAG
TTTGTCCCGT TATACACCGC TTCGCGGGAT ATTGTCGGTG CGCTGATCAC AATCCGTGAT
TTAACCCAAA TCAGTATGTT AGAGCGCGCG CGGCGTGATC TGGTTGCTAA TGTGTCGCAC
GAATTGCGCA CGCCGTTAGC TTCGCTCAAA TTGTTGGTTG AGACGTTGCA ATCGGCCCCA
CCACCCGATA TCGCTGCTCG GATGCTCGAA CAGATGGCAC AAGAGATCGA TGCCGTCACC
CAACTCGTTG ATGAACTGCA CGAGTTATCA CGTCTCGAGT CGGGTCGGGT CTCGTTGAAG
CTGGAGCCGC TCGCCGTCTG GCCGGTGATA GAGCGGGCGA TCGAGCGTAT CCGTCCCCAA
GCCGAACGCA AACACCACAC TATCTGTACC GAACCGGTGT CCGATCTACC GCCCGCGCTA
ATGGACGGTG ATCGGATTGG GCAGGTGTTG CTCAATTTGT TACACAACGC GGTTAAGTTT
ACCCCCGAAG GTGGCACAAT CACCGTTGCC GCCCAAGTAT TGCATATTGC CGTCGATGAT
CCTCCCTCGC GGACAGATCG ACCGCCGCAT CCGGCAGGTA CGTGGATCTT GATCAGTGTG
CGCGATACCG GGATCGGCAT TTCTAGTCGC GATATTCGGC GTATCTTCGA GCGCTTCTAC
AAAGTTGACC GGGCGCGTAC CCGCCATACC GGCGGCACCG GCTTGGGCTT GGCGATTGCC
AAACATCTGG TCGAAGGGCA CGGTGGCCGG ATTTGGGCCA GTAGCCAAGA AGGGCGTGGT
TCGACGTTCT GGTTTACCTT ACCGGTAGCG TGA
 
Protein sequence
MTLLELTLTV ATVVLTVSLI FTVRRGRQLE RLRVARDQES TAAPSVAEQD EVMTLTLSSS 
LPPDDPSLQL GLPNCQPLFV ALVQTIDTGV IVVNRACQIV FHNDTAIHLL AGQPDMRGSG
LITLVRDHQA DRLAHDAMTD GEQRELTIRP LATGRTLHLQ FVPLYTASRD IVGALITIRD
LTQISMLERA RRDLVANVSH ELRTPLASLK LLVETLQSAP PPDIAARMLE QMAQEIDAVT
QLVDELHELS RLESGRVSLK LEPLAVWPVI ERAIERIRPQ AERKHHTICT EPVSDLPPAL
MDGDRIGQVL LNLLHNAVKF TPEGGTITVA AQVLHIAVDD PPSRTDRPPH PAGTWILISV
RDTGIGISSR DIRRIFERFY KVDRARTRHT GGTGLGLAIA KHLVEGHGGR IWASSQEGRG
STFWFTLPVA