Gene Cagg_0684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0684 
Symbol 
ID7266936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp854286 
End bp856625 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content72% 
IMG OID643565543 
Producttranscriptional regulator, XRE family 
Protein accessionYP_002462052 
Protein GI219847619 
COG category[R] General function prediction only 
COG ID[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAAC TCACGGAATT CTCTTTCGGG TATTGGGTGC GCCGACGGCG GCTGGCGCTC 
GACCTGACCC AAGCCGAGCT GGGCCGCCGC GCGGGCGTCA GCGCGGCCGC CATCCGCAAG
ATCGAGGCGG ACGAGCGGCG GCCGTCGTGC GAGCTGGCCG AGCTGCTGGC GTCGGCTTTG
GGCGTGCCCG CTGAGGAACG CGCCGCGTTT CTGGCCGCGG CGCGCGGCCT CGCGTCGATC
GAGCGCCTGC CGGTCGCCGT GGGACCGCTG GCCGCGCCCG CGCAGCCGAT ACCGCCCGCC
ACCCCGCACA ACCTGCCTGC GCCGCTGACC TCCTTCGTGA ACCGCACCTC CGATGTCGCC
GCGGTGACAG CCCTGCTGCG CCGGCCCGAC GTGCGCCTGC TCACCCTCGT CGGCCCGCCC
GGCGTCGGCA AGACGCGGCT CAGCATCCAC GCTGCGGAGG CGTTGCTGCC CGCTTTCCCC
GACGGCGTCT GGTTCGTTGA ACTGGCGCCC ATCAGCGACC CGGCGCTGGC GCTCGCGGCC
ATTGCGCGGC AGGCCGGGGT GTCTAAGCGG GCCGACGAGC CGTTGGCTCA GCGCTTTCGC
GCCGCCTTGG GCGGGAAGCG GGTGCTGTTG GTGCTGGACA ACTGCGAGCA GGTGGTGGAG
GTTGCGGCAG AGGTGAGCGA GTTGTTGCGG GCCTGCAAGG GGCTGAAGGT CCTAGCCACC
AGCCGCGTGC CGTTGCACGT CTACGGCGAG CACGAATACG CAGCCTCGCC GCTGTCGCTG
CCGCCGCCGG GTGCGCTGCC CGACCAGATG CTGGAGTTCG AGGCGGTGCA GTTGTTCGTG
GCCTGCACAC GGCAGCATCA GCCGGCTTTC GCCGTGGACG CAGCCAACGC CGGGGACGTG
GCCGACATCC TGCGCAAGCT GGAGGGCATT CCGCTGGCCA TCGAGCTGGC TGCCGTGGCC
TTGCGCCGCC TGTCCGTGGC CGGGTTGGCC GCCATGCTGT GTTGCGAGGC GAACTGGGTG
GACGCGATCC AGGCCACGGC GCGCGATCTG CCGCCGCGCC AGCGCACCCT GGCCAGCGCC
ATCGCCTGGA GCTACGATCT GCTCGACGCG GACAGCCAGC GCTGTTTGCG GCGCCTGGCC
GTCTTAGTCG GCCCTTTCAC CCTCGATGCC GCGGTTGCCA TCTGCACGGA GACGGCCGAT
CCGGCCGACC GCGCACGGGC GCAGGCGTGG CTGACGGCCC TGGCCGATCA CAGCCTGCTG
TCGCCCGAAC CGGGCCGCTG GCGGCTGCTG GAGATGGTGC GCGAGTTCGC CTGGAACCGG
CTCGACCCGC AAGAGCGCGA CCTGGCGCAG CGCAGGCACG CGGATTACTT CCGCAACCTG
TTGGCGCAGT CCGCGAGCGA CATGGCAGCC ATCGAGCGCG ACCACAACAA CTACCTGGCC
GCCCTGCGGT GGTTCATCGA ACGGGGGGAA GCCTCCGCTG CGCTGCGCAT GTGCGCTGAC
CTGGCCTGGT TCTGGGAGAC GCACGGTTAC GTGCACGAGG GCCAGGCGCT GATCCGCCGC
AGCCTGGCGC TCGGGGGAGA AGTGGCTGTC GAGCAGCGCA TCGGCCTCCT GTTTAAGGCA
GCCAACCTCT CCTGGCAGCG CCACGACTTC GCCAGCGCCG ACGAGTTTGC CGGTCAGGCC
ATCGCCATCG CCGAGGACGA ACGGCCCGAG GAGCTGGCGG CGCTGTTCAA CCTGGTGGGA
CGCATGGAGA TCGAGCGCGG GGAGTTTGCC CAGGCAGAAG CGGCGTTGCG ACGCAGCGCG
GCGCTGGCGC GGCAGCGGCC GGAGCTGCTC AACCCCGGCT TCCCCCTCTT GCAGTTGGGC
GAGGTCGCGT GGGCGCGCGG GGACCTGACG CAGGCTCAGG CGTTGTTCGA CGAGGCGGCC
GCGCTGCTGC CCGACGCCGG GCCGGAGCTG GCGCGCGCCA TCCTGCACAC CGACCGGGCC
GAGATCGCCC TAGCGCGCGG GGATGTGGTG GCGGCGCGGG GGGAACTGCT GCTGGCGCTG
CCCCACGTCC GCCAGCATGT GCGCCGCGTG CGCTTCTGGC TGGTCAGCCT GGCCGGTTGG
CTGCTGGCCG ACGGATCGCC GGAGGACGCC GCCTTAGCTG TGCAATGCCT AGCTGCCGAG
GAGGGACTGG GCGAGCGCGG CGGTCCTCTT TCCCCCATTT ACCATGCGCT GATCGCCCAA
CGCCGGCGCA CCGCGTGCGA CTTGGTGGAG GCAGCGACCT GGTCAGCGCT TTGGCGCGCC
GCGCGCGTGT GGACGGCGCA AGAGGCGCTG GCGCGGGCCG AGAGCTGGTT GGGCCGCTGA
 
Protein sequence
MDELTEFSFG YWVRRRRLAL DLTQAELGRR AGVSAAAIRK IEADERRPSC ELAELLASAL 
GVPAEERAAF LAAARGLASI ERLPVAVGPL AAPAQPIPPA TPHNLPAPLT SFVNRTSDVA
AVTALLRRPD VRLLTLVGPP GVGKTRLSIH AAEALLPAFP DGVWFVELAP ISDPALALAA
IARQAGVSKR ADEPLAQRFR AALGGKRVLL VLDNCEQVVE VAAEVSELLR ACKGLKVLAT
SRVPLHVYGE HEYAASPLSL PPPGALPDQM LEFEAVQLFV ACTRQHQPAF AVDAANAGDV
ADILRKLEGI PLAIELAAVA LRRLSVAGLA AMLCCEANWV DAIQATARDL PPRQRTLASA
IAWSYDLLDA DSQRCLRRLA VLVGPFTLDA AVAICTETAD PADRARAQAW LTALADHSLL
SPEPGRWRLL EMVREFAWNR LDPQERDLAQ RRHADYFRNL LAQSASDMAA IERDHNNYLA
ALRWFIERGE ASAALRMCAD LAWFWETHGY VHEGQALIRR SLALGGEVAV EQRIGLLFKA
ANLSWQRHDF ASADEFAGQA IAIAEDERPE ELAALFNLVG RMEIERGEFA QAEAALRRSA
ALARQRPELL NPGFPLLQLG EVAWARGDLT QAQALFDEAA ALLPDAGPEL ARAILHTDRA
EIALARGDVV AARGELLLAL PHVRQHVRRV RFWLVSLAGW LLADGSPEDA ALAVQCLAAE
EGLGERGGPL SPIYHALIAQ RRRTACDLVE AATWSALWRA ARVWTAQEAL ARAESWLGR