Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2338 |
Symbol | |
ID | 7268688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 2842298 |
End bp | 2845567 |
Gene Length | 3270 bp |
Protein Length | 1089 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643567167 |
Product | Tetratricopeptide TPR_4 |
Protein accession | YP_002463652 |
Protein GI | 219849219 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.878035 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.026565 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGGCA ACCGGGCAAT CTTTGATCGG GCGATGGAAC AGTGCCGCGA GGCATCCGCA AAAGGACGCT GGGAAGACTC GTTACGCGCT GCAGTGCGCG CCTTACAAGA GTTTCCCCAA GATATTGAAG CGCGCACTGC GGCAGCCGTT GCCTTGTTCC AAACGAATCG GTTGGATAAA GCATTGCAGG CGTTTAGCGA TCTGTACGAG GCCGACCCAA ACAATGCGTT CTATCTCAAT TACATCGCTC AGTGCTATCG CCGTCAAGGA AACATTCCTG CCGCCGTCGA GGCCTACAGT GCCCTGGCCG ATCTGCATAG CGCCCAGCAG CGCGCGCTGC AAGCTACTGA AGCGTTGCGT GAATTACTGA CGTTACAGCC TGAACGAGAC GATCAGCGTC GGCGCTTGGC CCAGCTCTAC GAAGACCTTG GCGCAATCGC CGAATCAGCC GAGACTCATC TTGAACTGGC CCAACGCTTT ATCGCACGGG GAGAAACGGC TGAAGCGTTG TCTGAGGTCG AGTGGGTTTT GCGCCTCGAT GCCAATCACC GCGTCGCCCG TGAATTGGCG ACGCGCCTAC GCGAACAACT GGGCGGGTCG CCGAATCCAC CCCTTCGTGG CACCAGTAGC TTACGGGCAG CTTACCCTAC CAGCGCCCTT CGCGGTTCGC AAACGCAACC TGAGCAGTTG ATCGCTGAAG CGATGGCCTG TCAGCAAGCG GGTGATGAAG AGCGGGCTAT CGCTTTGTAC GAACAGGCGG TGCAAGCCGG GATCGAACGG GCTGATGTCC TCTATTCACT CGGGCTACTG TACCAGAGTC AAGGTAATCT TAAAGCGGCT GTCAGCGTTC TCACGCGCGC GGCCGGTGAC CCCGAGTACG CCCTGTCGGC CCACTTTGCG CTTGGGCAAG TCTATCGTGA CCTCGGTCAA TTACCGCAAG CAGCGCAAGA ATTTGAGACG ACCATCGGAT TGGTTGATCT CGAAACGATC GGCAAGGCCG AGGTTGATGA CCTGATTGCA ATGTACGAGA GCGCGGCAAC GATTTATGAG CAACTGGGCG ATCTGGCTCG TGCCTCACTG CTCTATGGTA CCCTTGCCGA GTTTCTGACC AGCAAGCGAT GGGGACGCGA TCGTGCTGCC GAGTTCAAAA ATCGGGCCAA GGAACTGGCC GATCGCAATA TGTTCGCGAA GCTACGCACG ATCGGTACCG GTGTTCTCCA GCCAACACCT GCCACACCAC CACCATCATC TCCACCGATA ATTGACCCCG GAAGTGAGCG ATGGGGTAAG CTGCGCCCGA TTACCGATGT CTTGCGGTCA GGTGGGCGTT CTGAAGAAGA CGTAACACCG GAGGTCACAC CGGCAACGCT CGAACAATTG CCGATTATCG AGCGAATGAG CGTACCGACC AGCCTACCCA CGCCGGCGTT TCCACCACCG ACGCCGCTCG ATCCGTCTGG TCTTGATGAG GTGACGGCCG GCTGGCTCGA ACTTAGTGGT ACCTATCTCG AACAGGGTCT GCTCGATGCC GCTCTCGACG CTTGTATGGA GGTGATTCAT CATAATGTCG AATATCTGCC CATCCACTTA CGGATGGGCG AGATCTACGA ACGGCAGGGG CGTCCTGAAG AGGCATTGGC AAAATATCAG TTACTGATCG AGACGTATCA GGTGCGAGGC GAAGCCGAGA AGGCGATTGA TGTCTATTTT CGCTTTATCG AGCTGTCACC CGATTCCATA AATGCACGCT CGAAGTTGGC CGAGATACTA CGGCAGACCG GCCGTATCGA TGAAGCTGTC GACCAGTCGT TACAGGTTGC CAATACCTAT TTTCGATTGG GTCAAACGAA TAAAGCGCTC GAAGAGTTTC GCCGTTTGTT ACAATGGGCA CCCAAACATC GCGAGGGACA CGCTCAATAT GGATTGGCCT TGCTCAAACT GGAGCGATTT GAAGCCGCAC TCGATGAATT TCGGCAGGCA CTCGAACTCG GCTCACCGGA CGATCCGGTC GCGCTAGCAC GGCTCAACAT CACGCTCGCC CTGATGGGCG AACAACCAAA TGTCATTTGG GACTCACTGG CCACCGTTCT AGAGCAATTA CGCAAGCATC CCACTGAATT TGCTGCCGTT CAAGCGGAAT ACCGGGCTGC CCTGTTGATC GACGACCGCG CGTTGCTCCA TTATATGCTG GCGATCATCC AACAACAGCA CGAGCAACAC CATTTGGCGC TCCTTGAGCT TGAACAAGCT CAAATCTTGC TCAACAGCGA ACCCGACCCG ATGCTCCCGC CAGCCCTCAT GTATCGCGCA ACCGCCGACA GTTATATTGC GTTAGGACAA GCCGAGCAGG CACTCGAACA GTTACGCAAA GGGCAAGCCG TTGCCGAGCA GACAACGCCC AACTCATCGA TCCGCCATCC ATTCGCCATT CCACCGTCGC GTGGCGATAT GGTGCGTCGC ATGGCCGAAG CGTATGCGGT CAGTGATGAT TTAGTCGGGG CAGAGCAAGC GTTACTGGAA GCGAAAAAGT TATTGCCATA TGACCGGGCG ATCTACACCA AGCTGGCGAA CGTTTATTTT CGGCAGGGTA AGCTCGCCGA GGCATTGGCC CAACTCGATG AGCTGGCAAC GTATTACGAA GAACGCCAAC AATTGGATTT AGCCATCGAG CTTCTCGAAT TCGCAGTACA GCTTGCTCCT AACCATATCG GCATGAGTAA CCGGTTGGCG CGCTTACAAC TGCGGGTTGG GAAACTCGAC CAGGGTTTAG CCGGGCTGGT GCGGGTCAGT GAACTACAAA AACGTGCCGG TCAATTGAAA GATGCCGTTG CGTCGTTACA GGAAGTCGCG CAAACGTATT GGATGTTGAG TGATCATGAA CGTGCGCGCG AGATGTACGA CCGAATTGTA CAGATAGCAC CGAACGATGT TGATGCGCGT CAATGGTTGG CCCTTATGCA CACTCTCTCG CGTCGCACCA AAGAAGCGAT CAGCGAAAAG AAGCAGATCG CGCGCATTTT TGCCCAACAA CGCGATTATG ACAATGCGAT TGCCGAGCTG CACCAGATTA TTGGCCTCGA TCAGAACGAT TTAGAAGCCT ACTTCATGCT CTACGACATG CTCATGCGAC GCGAGGAATA CGGACAGGCC AGCCAACTGT GTCGGCGCAT GTTGAAGATG CCGGGAATTG AAACGGAACG GGTGGAAGCG ATGTTGAGCG CCGCCAATCG TATGCTTGAG CAACGCAAGC CGGCGCCGCC ACAGAGTTGA
|
Protein sequence | MAGNRAIFDR AMEQCREASA KGRWEDSLRA AVRALQEFPQ DIEARTAAAV ALFQTNRLDK ALQAFSDLYE ADPNNAFYLN YIAQCYRRQG NIPAAVEAYS ALADLHSAQQ RALQATEALR ELLTLQPERD DQRRRLAQLY EDLGAIAESA ETHLELAQRF IARGETAEAL SEVEWVLRLD ANHRVARELA TRLREQLGGS PNPPLRGTSS LRAAYPTSAL RGSQTQPEQL IAEAMACQQA GDEERAIALY EQAVQAGIER ADVLYSLGLL YQSQGNLKAA VSVLTRAAGD PEYALSAHFA LGQVYRDLGQ LPQAAQEFET TIGLVDLETI GKAEVDDLIA MYESAATIYE QLGDLARASL LYGTLAEFLT SKRWGRDRAA EFKNRAKELA DRNMFAKLRT IGTGVLQPTP ATPPPSSPPI IDPGSERWGK LRPITDVLRS GGRSEEDVTP EVTPATLEQL PIIERMSVPT SLPTPAFPPP TPLDPSGLDE VTAGWLELSG TYLEQGLLDA ALDACMEVIH HNVEYLPIHL RMGEIYERQG RPEEALAKYQ LLIETYQVRG EAEKAIDVYF RFIELSPDSI NARSKLAEIL RQTGRIDEAV DQSLQVANTY FRLGQTNKAL EEFRRLLQWA PKHREGHAQY GLALLKLERF EAALDEFRQA LELGSPDDPV ALARLNITLA LMGEQPNVIW DSLATVLEQL RKHPTEFAAV QAEYRAALLI DDRALLHYML AIIQQQHEQH HLALLELEQA QILLNSEPDP MLPPALMYRA TADSYIALGQ AEQALEQLRK GQAVAEQTTP NSSIRHPFAI PPSRGDMVRR MAEAYAVSDD LVGAEQALLE AKKLLPYDRA IYTKLANVYF RQGKLAEALA QLDELATYYE ERQQLDLAIE LLEFAVQLAP NHIGMSNRLA RLQLRVGKLD QGLAGLVRVS ELQKRAGQLK DAVASLQEVA QTYWMLSDHE RAREMYDRIV QIAPNDVDAR QWLALMHTLS RRTKEAISEK KQIARIFAQQ RDYDNAIAEL HQIIGLDQND LEAYFMLYDM LMRREEYGQA SQLCRRMLKM PGIETERVEA MLSAANRMLE QRKPAPPQS
|
| |