Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2951 |
Symbol | |
ID | 7268824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 3614183 |
End bp | 3617278 |
Gene Length | 3096 bp |
Protein Length | 1031 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643567773 |
Product | SMC domain protein |
Protein accession | YP_002464247 |
Protein GI | 219849814 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR00618] exonuclease SbcC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00128287 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.000000102622 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGATACCAA TCCAGCTTTC ACTGCGAAAC TTTATGTGTT ACCGTACCGA CGACGGTAAA CCACTCCGCC TCGAACTTGA TGGTCTCCAC GTGCTGTGTT TATCAGGCGA GAATGGGGCC GGCAAATCGA CCCTGCTTGA CGCGATCACG TGGGCTTTGT GGGGAAAAGC CCGTAGTGCC GACGATGACC TGATCACACA AGGTGAGACC GAGATGATGG TCGAGCTGGT ATTCGCCCTT GATGGTCGCA CATATCGGGT GATTCGTCAG CACCAACGTG GGCGGAGCAC CGGCAAAGGT ACGAGTGCCG GAAAGACGTG GCTCGATCTG CAAATACTAG ACGGCACGCA GTGGCGACCA ATCGGCGAAA ATACCGTCCG CGAGACGCAG GCCAAGATCG ACGCTCTGTT GCGAATGTCG TACCGCACCT TCATCAATGC GTCGTTCCTG TTGCAAGGCC AGGCCGATAA GTTCACCAGT GCTCCAGCCG CCGAGCGTAA ACAGGTGCTG GCCGAGATTC TCGGTCTGGA CGAATATGCC GAGCTTGAAC AACGTGCCCG CGAACGGGTA CGTATCCTCG ATGCCGAGAC GATCAGGGTA CGTGGTCAGC TTGAATCGTT GCAACCGACT GCAGCCAAAG TACCGTTCTG GCAAGAGGCG GTGGTCAATG CTGAACAACA ACGGCGGCGT TTGCAAGCGG CTTACGCTGA GTTAGAGGCG GAATATACCG TTGCCGTCAA TCGGCTCCGC GAGTTGGAAG CCCTCGCCCA ACGTCACCGT GAACTCCTCG ACCGGATTAC GTCTTTGCAT ACCGACATTC AGCGTTACAA CCGTGAATTG AATGAATTAG CCCAACGGAT CAGTCACGAC GAAAGCATCA TCGCCCGTCG CTCGGTTATC CAAGCCGGTC TCACCGAACT GACTACCGCT CGTGCCGAAT TGGAACGTCT TAGGCAAGTG CGCGACCAGT ACAATACGCT GATGATGCGC CGAACCGAGC TGAAACAGGA ACTCAAGACG GCGTTCTACG AATTGCGCGA GCGTCTATCT CGGGCCGAAC AAGAACGTGA ACGCCTGCAC ACTGCGGTAA CCCGCTTCGC CGAATTACAA CAGCAAGTCG CTACCTTGCA GCACCGCCTA TACGAGTTGG CACCGGCGCA TGCGCGCATG GCACACCTCC AAGACCAACG GATCGCCATT GAACAACAAC TATCGCACCT GAAAGAACTC ACATACCGAC AAACGGTGCT CAAAGATCAG CTTGATCAGC GCCGCGTTGC GCTCAAGAAC GAGCAGGATC GTTTGCAGCA AGATCGGCAA CGCCTCGACC GTCAACTCGC CGACGTTGCC CGGTGGCGGG TGGCGCTTCA GGAAGCGCAA ATGGCCCTTG CCGCTTTACG TGCTCTGGAA GAACAGCAGT TGCTGCACCG TCGGCGTGAA CAAGAGATTG TTGAAACACT TGGGAAGGCA CGGGCTATTG CGATGCAGGC ACAACAAGCG ATGGATAAAC TACGGGCAAA TCAGGCTCTG CTGGCTACCG GTAGTGGTGA ATGCCCGGTT TGCCGTCACC GGCTCGATCC TGCTGAAACT GAACACGTGA TGGCTCACTA CGCCCACGAG CTAGCAGCGC TTCGGCACGA GGAAGCACGT GCATTGGCAA CGGCGCAAAC CGCCGAACAA GCACTGGCAA CCGTGCGCGC CACGATTGCG GACAACGAGC AAGAACTGGA CAAGTTACGC CGACAAGCTG CCGCTATCGA GACGCTGGAA CGTCAACTTG CGCAAGCTAC TGCTTGGGAA CAAGAACGGA ACGATATTGT CCGGCGACTC ACAGCTCTTG AAGCGAAACT TGCCACCGAC GAGATCGATC CGCCGCTCCA AGCCGAACTC ACTGCGGTCA CCGCACAACT GACACAGTTT GACCACATAA CCGGTCTCCA AAACGACTTG GCGATGATCA ACGACGAACT AACCGCCTGT GAGCGCCAAC TGCGTGAACA GAGCCGCCTC GAAGGTGAAC TCGATAGCTG TCAACGCGAG CTAGAACGCT TGCAAGATGC CTCGGCCAAA CTTCCCGACG TCGAGGCAGT CGTCGCCGAA CTCCAACGCC AAATCGAGAC TAACGACTTC GCCCACGAGA TTCGGAGCGC CGGACGGCAA GTGGAAGCCG AGATTGCAGC TCTCAACTAC CAACCTGAAT TGCTCGAGAT GGCCGAGGCG AAGGTTCGTT CTTTAGCTCA TTGGGAACAG GCAGAACGCG AATTGATATT GGCTGAACAA CGCTACGCTG GCGAACTAAA ACTTCGTTCG CAGACGCAAA CGCTGCTCGC TCACGCCGAG CGTGAGCGAC AAACGCTGCA AGCCGAGGTG GATACATTAG CTAACGAACT AACCAAACTG CCGCTCGTGC AAACTACCGT TACCCAGATC AAACAACGTC TAGACGAAAC TGCACGCGCA TTGCAGATTG CCGAGCGCGA TCTGACCGAA AAGCAGACGT ACCTCCGGCA GGCAGAGGCT GCCGCTGCAC AATTAGAGAC ATTACAAGCC CAGGAACGAC AGCTCTGCGA ACGTAGCGCA CTCTTTGCCG AGCTGGCCGA AGCCTTTGGC AAAAAGGGGG TGCAGGCGAT GTTGATCGAA ACCGCCATCC CCCAGATCGA AGACGAAGCT AACAGCTTGC TGGCCCGCTT GACCGATGGG CAGATGCATC TGCGCTTCGA GATGCAGCGT GACACCAAGA AGGGTGATAC GGTTGAGACG CTCGATGTGC GTGTCGCCGA TGCCCTCGGT ACACGCGACT ACAAGACGTT CAGCGGTGGC GAGGCCATGC GGGTCAACTT CGCAATTCGG ATTGCGCTTT CTCGTCTGCT CGCTCACCGG GCCGGTGCGC GCCTTGAAAC ACTGGTAATC GATGAAGGAT TCGGTACTCT CGACGCCGAT GGCCGTGAGC GGATGGTAGA GGCAATTACG GCGATTCAAC AAGATTTTGC CCGGATTATC GTCATCACCC ACATTGACGA TCTCAAAGAT CGCTTTCCGG CAACACTTGA AATCCGCAAG ACACCTGCCG GTAGTCGGTG GGAATTGCGC GGGTAA
|
Protein sequence | MIPIQLSLRN FMCYRTDDGK PLRLELDGLH VLCLSGENGA GKSTLLDAIT WALWGKARSA DDDLITQGET EMMVELVFAL DGRTYRVIRQ HQRGRSTGKG TSAGKTWLDL QILDGTQWRP IGENTVRETQ AKIDALLRMS YRTFINASFL LQGQADKFTS APAAERKQVL AEILGLDEYA ELEQRARERV RILDAETIRV RGQLESLQPT AAKVPFWQEA VVNAEQQRRR LQAAYAELEA EYTVAVNRLR ELEALAQRHR ELLDRITSLH TDIQRYNREL NELAQRISHD ESIIARRSVI QAGLTELTTA RAELERLRQV RDQYNTLMMR RTELKQELKT AFYELRERLS RAEQERERLH TAVTRFAELQ QQVATLQHRL YELAPAHARM AHLQDQRIAI EQQLSHLKEL TYRQTVLKDQ LDQRRVALKN EQDRLQQDRQ RLDRQLADVA RWRVALQEAQ MALAALRALE EQQLLHRRRE QEIVETLGKA RAIAMQAQQA MDKLRANQAL LATGSGECPV CRHRLDPAET EHVMAHYAHE LAALRHEEAR ALATAQTAEQ ALATVRATIA DNEQELDKLR RQAAAIETLE RQLAQATAWE QERNDIVRRL TALEAKLATD EIDPPLQAEL TAVTAQLTQF DHITGLQNDL AMINDELTAC ERQLREQSRL EGELDSCQRE LERLQDASAK LPDVEAVVAE LQRQIETNDF AHEIRSAGRQ VEAEIAALNY QPELLEMAEA KVRSLAHWEQ AERELILAEQ RYAGELKLRS QTQTLLAHAE RERQTLQAEV DTLANELTKL PLVQTTVTQI KQRLDETARA LQIAERDLTE KQTYLRQAEA AAAQLETLQA QERQLCERSA LFAELAEAFG KKGVQAMLIE TAIPQIEDEA NSLLARLTDG QMHLRFEMQR DTKKGDTVET LDVRVADALG TRDYKTFSGG EAMRVNFAIR IALSRLLAHR AGARLETLVI DEGFGTLDAD GRERMVEAIT AIQQDFARII VITHIDDLKD RFPATLEIRK TPAGSRWELR G
|
| |