Gene Cagg_2338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2338 
Symbol 
ID7268688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2842298 
End bp2845567 
Gene Length3270 bp 
Protein Length1089 aa 
Translation table11 
GC content56% 
IMG OID643567167 
ProductTetratricopeptide TPR_4 
Protein accessionYP_002463652 
Protein GI219849219 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.878035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.026565 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGCA ACCGGGCAAT CTTTGATCGG GCGATGGAAC AGTGCCGCGA GGCATCCGCA 
AAAGGACGCT GGGAAGACTC GTTACGCGCT GCAGTGCGCG CCTTACAAGA GTTTCCCCAA
GATATTGAAG CGCGCACTGC GGCAGCCGTT GCCTTGTTCC AAACGAATCG GTTGGATAAA
GCATTGCAGG CGTTTAGCGA TCTGTACGAG GCCGACCCAA ACAATGCGTT CTATCTCAAT
TACATCGCTC AGTGCTATCG CCGTCAAGGA AACATTCCTG CCGCCGTCGA GGCCTACAGT
GCCCTGGCCG ATCTGCATAG CGCCCAGCAG CGCGCGCTGC AAGCTACTGA AGCGTTGCGT
GAATTACTGA CGTTACAGCC TGAACGAGAC GATCAGCGTC GGCGCTTGGC CCAGCTCTAC
GAAGACCTTG GCGCAATCGC CGAATCAGCC GAGACTCATC TTGAACTGGC CCAACGCTTT
ATCGCACGGG GAGAAACGGC TGAAGCGTTG TCTGAGGTCG AGTGGGTTTT GCGCCTCGAT
GCCAATCACC GCGTCGCCCG TGAATTGGCG ACGCGCCTAC GCGAACAACT GGGCGGGTCG
CCGAATCCAC CCCTTCGTGG CACCAGTAGC TTACGGGCAG CTTACCCTAC CAGCGCCCTT
CGCGGTTCGC AAACGCAACC TGAGCAGTTG ATCGCTGAAG CGATGGCCTG TCAGCAAGCG
GGTGATGAAG AGCGGGCTAT CGCTTTGTAC GAACAGGCGG TGCAAGCCGG GATCGAACGG
GCTGATGTCC TCTATTCACT CGGGCTACTG TACCAGAGTC AAGGTAATCT TAAAGCGGCT
GTCAGCGTTC TCACGCGCGC GGCCGGTGAC CCCGAGTACG CCCTGTCGGC CCACTTTGCG
CTTGGGCAAG TCTATCGTGA CCTCGGTCAA TTACCGCAAG CAGCGCAAGA ATTTGAGACG
ACCATCGGAT TGGTTGATCT CGAAACGATC GGCAAGGCCG AGGTTGATGA CCTGATTGCA
ATGTACGAGA GCGCGGCAAC GATTTATGAG CAACTGGGCG ATCTGGCTCG TGCCTCACTG
CTCTATGGTA CCCTTGCCGA GTTTCTGACC AGCAAGCGAT GGGGACGCGA TCGTGCTGCC
GAGTTCAAAA ATCGGGCCAA GGAACTGGCC GATCGCAATA TGTTCGCGAA GCTACGCACG
ATCGGTACCG GTGTTCTCCA GCCAACACCT GCCACACCAC CACCATCATC TCCACCGATA
ATTGACCCCG GAAGTGAGCG ATGGGGTAAG CTGCGCCCGA TTACCGATGT CTTGCGGTCA
GGTGGGCGTT CTGAAGAAGA CGTAACACCG GAGGTCACAC CGGCAACGCT CGAACAATTG
CCGATTATCG AGCGAATGAG CGTACCGACC AGCCTACCCA CGCCGGCGTT TCCACCACCG
ACGCCGCTCG ATCCGTCTGG TCTTGATGAG GTGACGGCCG GCTGGCTCGA ACTTAGTGGT
ACCTATCTCG AACAGGGTCT GCTCGATGCC GCTCTCGACG CTTGTATGGA GGTGATTCAT
CATAATGTCG AATATCTGCC CATCCACTTA CGGATGGGCG AGATCTACGA ACGGCAGGGG
CGTCCTGAAG AGGCATTGGC AAAATATCAG TTACTGATCG AGACGTATCA GGTGCGAGGC
GAAGCCGAGA AGGCGATTGA TGTCTATTTT CGCTTTATCG AGCTGTCACC CGATTCCATA
AATGCACGCT CGAAGTTGGC CGAGATACTA CGGCAGACCG GCCGTATCGA TGAAGCTGTC
GACCAGTCGT TACAGGTTGC CAATACCTAT TTTCGATTGG GTCAAACGAA TAAAGCGCTC
GAAGAGTTTC GCCGTTTGTT ACAATGGGCA CCCAAACATC GCGAGGGACA CGCTCAATAT
GGATTGGCCT TGCTCAAACT GGAGCGATTT GAAGCCGCAC TCGATGAATT TCGGCAGGCA
CTCGAACTCG GCTCACCGGA CGATCCGGTC GCGCTAGCAC GGCTCAACAT CACGCTCGCC
CTGATGGGCG AACAACCAAA TGTCATTTGG GACTCACTGG CCACCGTTCT AGAGCAATTA
CGCAAGCATC CCACTGAATT TGCTGCCGTT CAAGCGGAAT ACCGGGCTGC CCTGTTGATC
GACGACCGCG CGTTGCTCCA TTATATGCTG GCGATCATCC AACAACAGCA CGAGCAACAC
CATTTGGCGC TCCTTGAGCT TGAACAAGCT CAAATCTTGC TCAACAGCGA ACCCGACCCG
ATGCTCCCGC CAGCCCTCAT GTATCGCGCA ACCGCCGACA GTTATATTGC GTTAGGACAA
GCCGAGCAGG CACTCGAACA GTTACGCAAA GGGCAAGCCG TTGCCGAGCA GACAACGCCC
AACTCATCGA TCCGCCATCC ATTCGCCATT CCACCGTCGC GTGGCGATAT GGTGCGTCGC
ATGGCCGAAG CGTATGCGGT CAGTGATGAT TTAGTCGGGG CAGAGCAAGC GTTACTGGAA
GCGAAAAAGT TATTGCCATA TGACCGGGCG ATCTACACCA AGCTGGCGAA CGTTTATTTT
CGGCAGGGTA AGCTCGCCGA GGCATTGGCC CAACTCGATG AGCTGGCAAC GTATTACGAA
GAACGCCAAC AATTGGATTT AGCCATCGAG CTTCTCGAAT TCGCAGTACA GCTTGCTCCT
AACCATATCG GCATGAGTAA CCGGTTGGCG CGCTTACAAC TGCGGGTTGG GAAACTCGAC
CAGGGTTTAG CCGGGCTGGT GCGGGTCAGT GAACTACAAA AACGTGCCGG TCAATTGAAA
GATGCCGTTG CGTCGTTACA GGAAGTCGCG CAAACGTATT GGATGTTGAG TGATCATGAA
CGTGCGCGCG AGATGTACGA CCGAATTGTA CAGATAGCAC CGAACGATGT TGATGCGCGT
CAATGGTTGG CCCTTATGCA CACTCTCTCG CGTCGCACCA AAGAAGCGAT CAGCGAAAAG
AAGCAGATCG CGCGCATTTT TGCCCAACAA CGCGATTATG ACAATGCGAT TGCCGAGCTG
CACCAGATTA TTGGCCTCGA TCAGAACGAT TTAGAAGCCT ACTTCATGCT CTACGACATG
CTCATGCGAC GCGAGGAATA CGGACAGGCC AGCCAACTGT GTCGGCGCAT GTTGAAGATG
CCGGGAATTG AAACGGAACG GGTGGAAGCG ATGTTGAGCG CCGCCAATCG TATGCTTGAG
CAACGCAAGC CGGCGCCGCC ACAGAGTTGA
 
Protein sequence
MAGNRAIFDR AMEQCREASA KGRWEDSLRA AVRALQEFPQ DIEARTAAAV ALFQTNRLDK 
ALQAFSDLYE ADPNNAFYLN YIAQCYRRQG NIPAAVEAYS ALADLHSAQQ RALQATEALR
ELLTLQPERD DQRRRLAQLY EDLGAIAESA ETHLELAQRF IARGETAEAL SEVEWVLRLD
ANHRVARELA TRLREQLGGS PNPPLRGTSS LRAAYPTSAL RGSQTQPEQL IAEAMACQQA
GDEERAIALY EQAVQAGIER ADVLYSLGLL YQSQGNLKAA VSVLTRAAGD PEYALSAHFA
LGQVYRDLGQ LPQAAQEFET TIGLVDLETI GKAEVDDLIA MYESAATIYE QLGDLARASL
LYGTLAEFLT SKRWGRDRAA EFKNRAKELA DRNMFAKLRT IGTGVLQPTP ATPPPSSPPI
IDPGSERWGK LRPITDVLRS GGRSEEDVTP EVTPATLEQL PIIERMSVPT SLPTPAFPPP
TPLDPSGLDE VTAGWLELSG TYLEQGLLDA ALDACMEVIH HNVEYLPIHL RMGEIYERQG
RPEEALAKYQ LLIETYQVRG EAEKAIDVYF RFIELSPDSI NARSKLAEIL RQTGRIDEAV
DQSLQVANTY FRLGQTNKAL EEFRRLLQWA PKHREGHAQY GLALLKLERF EAALDEFRQA
LELGSPDDPV ALARLNITLA LMGEQPNVIW DSLATVLEQL RKHPTEFAAV QAEYRAALLI
DDRALLHYML AIIQQQHEQH HLALLELEQA QILLNSEPDP MLPPALMYRA TADSYIALGQ
AEQALEQLRK GQAVAEQTTP NSSIRHPFAI PPSRGDMVRR MAEAYAVSDD LVGAEQALLE
AKKLLPYDRA IYTKLANVYF RQGKLAEALA QLDELATYYE ERQQLDLAIE LLEFAVQLAP
NHIGMSNRLA RLQLRVGKLD QGLAGLVRVS ELQKRAGQLK DAVASLQEVA QTYWMLSDHE
RAREMYDRIV QIAPNDVDAR QWLALMHTLS RRTKEAISEK KQIARIFAQQ RDYDNAIAEL
HQIIGLDQND LEAYFMLYDM LMRREEYGQA SQLCRRMLKM PGIETERVEA MLSAANRMLE
QRKPAPPQS