Gene Cagg_3615 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3615 
Symbol 
ID7269759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4388876 
End bp4391242 
Gene Length2367 bp 
Protein Length788 aa 
Translation table11 
GC content58% 
IMG OID643568422 
ProductATP-dependent protease La 
Protein accessionYP_002464888 
Protein GI219850455 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.395652 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGAGG CGAACGCTTT CCAACGGCCG GCAACGCCAG ACATTCCTGA AGTATTGCCG 
ATTCTGCCGA TCAATAATGC AGTGCTCTTC CCAGGCATGT TTTTGCCACT GGTCGTTAGT
GGCGACGCAT GGGTACGTCT GGTCGATGAA GCAGCACTGT CAACGAAGAT GATCGGTGTG
TTTCGTCGCG TGCAGGCCGG TGAGGAGTTT GAACCGTCAA TGTTAGCGCC TACCGGCACG
GCGGCAATGA TCGTGCGCAT GATGCGTTTG CCGCAGGGAG GAGTTCAACT GCTCTTGCAA
GGGCAGGCGC GGATCAAGGT ACAACACTGG GTGTCGATCA AGCCTTATCC CCAGGCACGG
GTGAGCATCT CTCGTGATCC GCACGAAACA TCACTGGAGA CATCGGGCTT GGCGCGGGCT
GCACTGGCCG GTTTTCAGCA AATTGTCGAG CTGAGTCCTA ATCTGCCCGA TGAATTAGCC
ATTGCCGCCG CCAATGCACC ACATCCCGGT ATGTTGGCCG ACCTCATTGC GGCCAATCTG
AACCTCAATC TCGACGACCA GCAGGCCGTG CTCGACATGC TCGATGTCAC CGAGCGGTTG
CAGCACGTGT TGCGCCTGCT CGATCGCGAG CGCGAAATTC TGATGATCGG GCGCAAGGCG
CAGGAAGAGG TCGCGAAGAA TCAGCGCGAG TATGTCTTGC GCCAGCAGCT TGAGGCGATC
AAGCGCGAGT TGGGCGAGAC TGATGATCAC GCTGTCGAGA TTGCTGAGTT GCGCCGGCGG
CTGGAAGAGG CCAACCTGCC CACCGAAGCG CGTCAAGAGG CCGAACGCGA GCTGTCGCGG
CTCGAACGCA TGCCCCCCGG TGCAGCCGAG TATACGGTAG CGCGTACCTA CCTCGATTGG
ATCCTCGATC TGCCGTGGCA TGCCAGTACC GAAGACAATC TCGATATTAC ACAGGCGCGG
CGTGTCCTCG ACGAGGATCA TTACGATCTA GATCGGATCA AAGAGCGAAT TATTGAATAC
CTTGCCGTGC GAAAACTTCG CCAAGAAGCG GGTGCCGGTA GCGAAACCCG CGGACCAATC
CTCTGTTTCG TCGGCCCACC CGGTGTCGGT AAGACGAGTC TGGGGGCTTC CATCGCCCGC
GCATTGGGGC GTAAATTTGT GCGTGTCGCA TTGGGCGGCG TGCGCGATGA AGCCGAAATC
CGCGGTCATC GCCGTACCTA CATCGGCGCC TTACCCGGAC GGATTATTCA AGGCTTAAGC
CGCGCCGGTA GCAACAACCC GGTCTTACTC CTCGATGAAG TTGATAAACT GAGTATCGGT
TTTCAGGGCG ACCCGGCGGC AGCGTTGCTC GAGGTGCTCG ACCCTGAGCA GAACGTCGCC
TTTGTCGATC GCTATCTCGA TGTGCCGTTC GATCTGAGCA AAGTGCTCTT TGTCTGCACC
GCCAATCGCG CCGACACCAT CCCGCCAGCG CTGCTCGACC GTATGGAGCT GTTGGAGTTG
GCCGGCTACA CTGAGCAAGA GAAACTGGAG ATCGCACGAC GCTACCTGAT CCCACGTCAG
CGCAACGAAC AGGGGTTGGC CGAACGCGGC CCCGAATTGA CGACTACTGC ATTACAACGG
CTCATCCGCG AGTACACCCA CGAAGCCGGG GTGCGCGATC TCGAACGGCG TATCGGAGCG
GTGTATCGCA AGATGGCTAC TCGCCTGGCC GAAGGTAAAG AGCTGCCGGC GCAAGTTGAT
GCAGCCGATC TTGATGATCT GCTTGGCCCA CCGCGTTTTC GCAGTGAGAC ACTGCTCGGC
GAAAATGAGG TTGGGGTAGT GACCGGCCTT GCGTGGACGC CGACCGGTGG TGATGTGCTC
TTCATTGAGG TGAGTGTGAT TCCGGGGAAC GGCCAACTGA TCCTCACCGG TCAGTTGGGT
GATGTGATGA AAGAATCGGC GCGGGCCGCA CTCACCTACG CCCGTTCGCG AGCTGCTGAG
CTAGGTATCG AAGCCGAGGT CTTCCAGAAG AGCGATATTC ATATTCACGT ACCTGCCGGC
GCCGTTCCCA AAGATGGCCC TTCAGCCGGT ATCACGATGG CGAGTGCGCT CATTTCGGCG
CTGACGAGAC GTGAAGTGGA TAAGCGAATT GCCATGACCG GTGAGGTCAC GTTGCGCGGA
AAAATCTTGC CGATTGGCGG GGTGAAAGAG AAGCTCCTGG CAGCGCAGCG GGCCGGTGTG
CGTAAAGTAT TGTTACCGAA AGAGAACGAG ATCGATCTGC GAGAGGTACC CGCCGAAGCC
AAAGAACAAC TGGAGATTGT GCTCGTTAAG CACATGGACG AAGTGCTGCG TGAGTTGGGC
CTTGTTGCTG CGCCGGTGAG TGAGTAA
 
Protein sequence
MSEANAFQRP ATPDIPEVLP ILPINNAVLF PGMFLPLVVS GDAWVRLVDE AALSTKMIGV 
FRRVQAGEEF EPSMLAPTGT AAMIVRMMRL PQGGVQLLLQ GQARIKVQHW VSIKPYPQAR
VSISRDPHET SLETSGLARA ALAGFQQIVE LSPNLPDELA IAAANAPHPG MLADLIAANL
NLNLDDQQAV LDMLDVTERL QHVLRLLDRE REILMIGRKA QEEVAKNQRE YVLRQQLEAI
KRELGETDDH AVEIAELRRR LEEANLPTEA RQEAERELSR LERMPPGAAE YTVARTYLDW
ILDLPWHAST EDNLDITQAR RVLDEDHYDL DRIKERIIEY LAVRKLRQEA GAGSETRGPI
LCFVGPPGVG KTSLGASIAR ALGRKFVRVA LGGVRDEAEI RGHRRTYIGA LPGRIIQGLS
RAGSNNPVLL LDEVDKLSIG FQGDPAAALL EVLDPEQNVA FVDRYLDVPF DLSKVLFVCT
ANRADTIPPA LLDRMELLEL AGYTEQEKLE IARRYLIPRQ RNEQGLAERG PELTTTALQR
LIREYTHEAG VRDLERRIGA VYRKMATRLA EGKELPAQVD AADLDDLLGP PRFRSETLLG
ENEVGVVTGL AWTPTGGDVL FIEVSVIPGN GQLILTGQLG DVMKESARAA LTYARSRAAE
LGIEAEVFQK SDIHIHVPAG AVPKDGPSAG ITMASALISA LTRREVDKRI AMTGEVTLRG
KILPIGGVKE KLLAAQRAGV RKVLLPKENE IDLREVPAEA KEQLEIVLVK HMDEVLRELG
LVAAPVSE