Gene Cagg_0230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0230 
Symbol 
ID7269144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp287200 
End bp288570 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content60% 
IMG OID643565099 
Productputative transcriptional regulator 
Protein accessionYP_002461614 
Protein GI219847181 
COG category[K] Transcription 
COG ID[COG2865] Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATCGT CTCCCATTGC CGCTTTGCCC GGCCAGGCGC CAGGGCCGCG CCTAGCCTTT 
GCCGGTGATC GCCAACGCCC CGACGAAATC GCCGAGTCGC TGGCGGCGCT AGCTAATGCT
CACGGCGGCG CGCTGGTGAT CAGCGGCGGA CGCGGCTCGC AACTCGCCCC CCTGCGCGAT
CCGGCCGCCG CTATTGAGCT GGCGTTATCG GCAGCGCTGG CTTGCACGCC TCCACTGATT
ATCCCACTGC CCCAAGTCGT TGTCTACCAT GATATGCCCG TTGTCGTCGT GGAAGTTCCT
GCCGGATTAC CTTATGTCTA TTCTATCAAT GGCCGCTACT TACGCCGCGA AGGGGACAGC
AACCAGCCGT TGTCACCGGC AGCACTCCAC CGCCTGTTTA GCGAGCGGGC AGAACTGGGC
TGGGAACGGC AGGTGCCACT CGGCGCCTCC TTCGCCGAAC TCGATCCTGA CCTCATCACC
GCCTACGCGC GCCGAGTCGG TCCACCTGCC GGCGACGACC CAATGACTCT GCTAACCCGT
CGTGGCTGCC TCATCGATAA CCGGCCGACG AATGCCGGTC TTCTGCTCTT CGGACGCGAT
GTGGCTGTTC GTTTTCCCCA AGCCGAAATT ACCCTTGTGC GCTACCGCGG TCGTGAGCCG
GACGATGTCT TCGAGCGTGC CGATATTTGC GCGCCATTGC CTGACGCTAT CCGTCGTGCC
GAGCGTTGGC TTAACGATCA TATGCGCAAA GGTTCGCGGA TGATCGGCCT TGAACGCGAA
GACTGGACGC AATTCCCACC GGCGGCTGTG CGTGAGGCAT TGGTCAATGC GGTTGCCCAT
CGCGATTATG CAGCACGTGG CGAAGGAATT CGTATTACCC TCTTCAGCAA CCGACTCGAA
GTCTATTCAC CCGGTCGTCT CCCCGGGCAC GTCACTCTCG ATAATATTCG CGCCGAACGG
TTTTCGCGCA ACCCGGCTAT CGTGCAAGTC CTCGCCGATC TCGGTCTGGT CGAACGACTC
GGCTATGGTA TCGACCGCAT GCTGCGCCAC CTGGCTGCTG CCGGCTTACC ACCGGCTACT
TTCCACGAGA CTGCTGCCGG TTTTTTAGTG ATCTTGCCCG GTCACCCATT CGCCGAGGAA
CTCCCCGGTG GGATTGATAC GACGGCATGG CGACGAATGG GGTTGAATGA TCGTCAGATC
AGCGCGCTCC TCTTCGTCGT TGAACAGCAA CGGATCACCA ATCGCGATCT GCAAGAGATG
CATCCTGACG TTAGCCCAGA GACGATTCGC CGTGATCTAT CCGATCTTGT GGCCCGTGGG
TTACTTTTAA AGGTGGGAGA TAAACGCGCA ACCTATTATA TTCTGAAGTA A
 
Protein sequence
MESSPIAALP GQAPGPRLAF AGDRQRPDEI AESLAALANA HGGALVISGG RGSQLAPLRD 
PAAAIELALS AALACTPPLI IPLPQVVVYH DMPVVVVEVP AGLPYVYSIN GRYLRREGDS
NQPLSPAALH RLFSERAELG WERQVPLGAS FAELDPDLIT AYARRVGPPA GDDPMTLLTR
RGCLIDNRPT NAGLLLFGRD VAVRFPQAEI TLVRYRGREP DDVFERADIC APLPDAIRRA
ERWLNDHMRK GSRMIGLERE DWTQFPPAAV REALVNAVAH RDYAARGEGI RITLFSNRLE
VYSPGRLPGH VTLDNIRAER FSRNPAIVQV LADLGLVERL GYGIDRMLRH LAAAGLPPAT
FHETAAGFLV ILPGHPFAEE LPGGIDTTAW RRMGLNDRQI SALLFVVEQQ RITNRDLQEM
HPDVSPETIR RDLSDLVARG LLLKVGDKRA TYYILK