Gene Cagg_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1079 
Symbol 
ID7268531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1334259 
End bp1336733 
Gene Length2475 bp 
Protein Length824 aa 
Translation table11 
GC content56% 
IMG OID643565924 
ProductATP-dependent protease La 
Protein accessionYP_002462429 
Protein GI219847996 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.932404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.310729 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAC CAATGTCATT GTTTGATGAT CTGCCTGAGG AGCATGATGA ACCGCAAGAA 
GCGCCTGAGC GTCGGTTGCC AATGGTGGTG CTCGGCGAGA TGGTCATCAT GCCGCACATG
ACGATACCAT TGCAGGTGCC GCAAGGGAAA TCCTACCGTG CGATGGAGCG CGCGTGGGAA
GAAGATCGCG ATGTCTTGTT GATCTTCGTC CGCGAGCACC AGCTTGAAGG CTACAAGAGC
AATCAACCAC AGAATCTGCC ACCAATTGGT GTGATTGCTC AGTTGCAAGA GTTTGCCAAA
CTGAACGACG GTACTGCCCG CGTGATTCTG GAAGGCCAGA GTCGGGCGCA GATTATTGAG
GCGATCCAGA TCACACCCTT CTACCGGGTG CGCTGCCGGC CATACACTGA CCCGCCGGTG
AGTGGTCTGG AAGTAGAAGC CTTGATGGAG ACGGTCAAGC AGCAAGTTGA TGAGTTTGTC
GAGCATCTTG GTGAAGTGCC GCAAGAGGCC GTCCAGTTCG TCCATCGCAT TGACCGTCCC
GGCCACTTAG CCGACATTGT GACATGGGGA CCGGCGTTTG ATTTTAAAGA TCGGCTCGAG
GTTCTTAACA CACTCGACCC GGTTGAGCGC TTGCGCAAGG TCTATCTTGT GTTAGCGCGA
CAGCTTGAAC TCCTCAAGTT GCGGGTCAAG ATTCAGCAAG ATACCAAAGA GGTGCTCGAT
CAGAGCCAAC GTGAGTACTT TTTGCGCGAG CAGTTGCGGA TTATTCGCCG CGAGCTAGGT
GAAGATGAAG AGGGTGATGA TCCGATCGAC GAACTACGGC GCAAGATTCA CGAACTCGAT
GCGCCGGAGT ACGTGAAAAA TCAGGCGTTG CATGAGTTGA AGCGGTTGGC CCAGCAGGGG
ATGAACAACC CCGAGTCGGG GGTCATTCGC ACCTATCTCG ATTGGATCCT CTCGTTGCCA
TGGGCTGATG AGGAATTGCC CGAGATCAGC ATTACCGAGG CTCAGAAAGT GCTTGACGCC
GATCACTATG GGTTGGAAAA GGTAAAAGAG CGTATCCTCG AATATCTAGC CGTGCGCAAA
CTGGCCGGTG ATAAGATGCG TTCTCCCATT CTCTGCTTCG TTGGCCCACC CGGTGTCGGC
AAGACAAGCC TCGGTCGCAG TATTGCGCGC GCCTTGGGGC GCAAGTTTGT GCGCACCAGT
CTGGGTGGTG TACGGGATGA AGCTGAAATT CGTGGGCACC GTCGCACCTA CATCGGTGCT
ATGCCCGGTC GCATCATTCA GGCGATGAAG AATGCTAAGT CGAAGAGTCC GGTCTATATC
CTCGACGAAG TGGATAAGAT CGGGTTGGAT TTTCGCGGTG ATCCGACGTC GGCGCTGCTT
GAGGTGCTCG ATCCAGAGCA AAACAACGCC TTCAGCGATC ACTATCTCGA AATTCCGTTC
GATCTGAGCA AGGTGATCTT TATCGCGACG GCTAATCAGC TCGATCCGAT CCCGTTACCG
TTGCGCGACC GTATGGAGAT CATCGAGATC GGCGGTTACA CCGAGGACGA GAAGTTGGAA
ATTGCCCGCG GTTTCCTCAT TCCCAAGCAG CGTGAGTTCC ATGGGTTGAC AGAAGATCAG
ATCGAGTTTA CCGAGGGCGC GATTCTGAAG CTGATCCGCG AGTATACCCG CGAAGCCGGT
GTGCGCGGTC TTGAACGTGA GATCGCCAGC TTGTGCCGCA AAGTGGCTCG CCAAGTCGCC
GAGCAGACGG AAGCGAACGG CGAACTACCG CCGAAGTTTG TGATCGATGA AGCTGCCGTG
GTCAAGTACC TTGGCCCGGA GCGCTACACA TACGGGATCG CCGAAGAGCA AGACGAGGTT
GGTGTAGCAA CCGGCGTAGC GTGGACGAGT GCCGGCGGCG ATATTCTCAG TATCGAGGTG
TTGCCGTTTA AGGGCAAAGG TCAGTTGCAA CTGACCGGTC AGCTTGGTGA GGTGATGAAG
GAGAGTGCGC AAACGGCGGT CAGCTACGTG CGCTCGCGGG CTGCCGATTT TGGTATCGAT
CCCAATACCT TTGAGGAGAC GAATATTCAC ATTCACATTC CCGAAGGTGC GGTACCGAAG
GATGATCCCT CGGCGGGGAT TACGCTGACG ACGGCGCTGA TCAGTGCGCT CACCGGTACA
CCGGTGCGCC GCGATGTAGC CATGACCGGC GAGGTTACGC TGCGCGGTAA GGTGTTGCCG
ATTGGTGGTC TGAAAGAGAA GACACTGGCA GCACATCGCG CCGGGATTCG CACCTTCATC
TTACCCAAAG AGAATGCGAA GGATATCAGC GAGCTACCTG AAAAAGTCCG CCGTGAGTTG
AACCTGATCC CGGTCTCATC GATGGATGAG GTGCTGCGGA TTGCCTTGAG CCGGATGCCG
ACACCGGCCA ATAACCAGAA CGGATCTCAT ACCAACAACC GCGGTCAACC CTCGCCGGCT
CCCGCCGGTA CGTGA
 
Protein sequence
MNEPMSLFDD LPEEHDEPQE APERRLPMVV LGEMVIMPHM TIPLQVPQGK SYRAMERAWE 
EDRDVLLIFV REHQLEGYKS NQPQNLPPIG VIAQLQEFAK LNDGTARVIL EGQSRAQIIE
AIQITPFYRV RCRPYTDPPV SGLEVEALME TVKQQVDEFV EHLGEVPQEA VQFVHRIDRP
GHLADIVTWG PAFDFKDRLE VLNTLDPVER LRKVYLVLAR QLELLKLRVK IQQDTKEVLD
QSQREYFLRE QLRIIRRELG EDEEGDDPID ELRRKIHELD APEYVKNQAL HELKRLAQQG
MNNPESGVIR TYLDWILSLP WADEELPEIS ITEAQKVLDA DHYGLEKVKE RILEYLAVRK
LAGDKMRSPI LCFVGPPGVG KTSLGRSIAR ALGRKFVRTS LGGVRDEAEI RGHRRTYIGA
MPGRIIQAMK NAKSKSPVYI LDEVDKIGLD FRGDPTSALL EVLDPEQNNA FSDHYLEIPF
DLSKVIFIAT ANQLDPIPLP LRDRMEIIEI GGYTEDEKLE IARGFLIPKQ REFHGLTEDQ
IEFTEGAILK LIREYTREAG VRGLEREIAS LCRKVARQVA EQTEANGELP PKFVIDEAAV
VKYLGPERYT YGIAEEQDEV GVATGVAWTS AGGDILSIEV LPFKGKGQLQ LTGQLGEVMK
ESAQTAVSYV RSRAADFGID PNTFEETNIH IHIPEGAVPK DDPSAGITLT TALISALTGT
PVRRDVAMTG EVTLRGKVLP IGGLKEKTLA AHRAGIRTFI LPKENAKDIS ELPEKVRREL
NLIPVSSMDE VLRIALSRMP TPANNQNGSH TNNRGQPSPA PAGT