Gene Cagg_2951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2951 
Symbol 
ID7268824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3614183 
End bp3617278 
Gene Length3096 bp 
Protein Length1031 aa 
Translation table11 
GC content56% 
IMG OID643567773 
ProductSMC domain protein 
Protein accessionYP_002464247 
Protein GI219849814 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00128287 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000102622 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGATACCAA TCCAGCTTTC ACTGCGAAAC TTTATGTGTT ACCGTACCGA CGACGGTAAA 
CCACTCCGCC TCGAACTTGA TGGTCTCCAC GTGCTGTGTT TATCAGGCGA GAATGGGGCC
GGCAAATCGA CCCTGCTTGA CGCGATCACG TGGGCTTTGT GGGGAAAAGC CCGTAGTGCC
GACGATGACC TGATCACACA AGGTGAGACC GAGATGATGG TCGAGCTGGT ATTCGCCCTT
GATGGTCGCA CATATCGGGT GATTCGTCAG CACCAACGTG GGCGGAGCAC CGGCAAAGGT
ACGAGTGCCG GAAAGACGTG GCTCGATCTG CAAATACTAG ACGGCACGCA GTGGCGACCA
ATCGGCGAAA ATACCGTCCG CGAGACGCAG GCCAAGATCG ACGCTCTGTT GCGAATGTCG
TACCGCACCT TCATCAATGC GTCGTTCCTG TTGCAAGGCC AGGCCGATAA GTTCACCAGT
GCTCCAGCCG CCGAGCGTAA ACAGGTGCTG GCCGAGATTC TCGGTCTGGA CGAATATGCC
GAGCTTGAAC AACGTGCCCG CGAACGGGTA CGTATCCTCG ATGCCGAGAC GATCAGGGTA
CGTGGTCAGC TTGAATCGTT GCAACCGACT GCAGCCAAAG TACCGTTCTG GCAAGAGGCG
GTGGTCAATG CTGAACAACA ACGGCGGCGT TTGCAAGCGG CTTACGCTGA GTTAGAGGCG
GAATATACCG TTGCCGTCAA TCGGCTCCGC GAGTTGGAAG CCCTCGCCCA ACGTCACCGT
GAACTCCTCG ACCGGATTAC GTCTTTGCAT ACCGACATTC AGCGTTACAA CCGTGAATTG
AATGAATTAG CCCAACGGAT CAGTCACGAC GAAAGCATCA TCGCCCGTCG CTCGGTTATC
CAAGCCGGTC TCACCGAACT GACTACCGCT CGTGCCGAAT TGGAACGTCT TAGGCAAGTG
CGCGACCAGT ACAATACGCT GATGATGCGC CGAACCGAGC TGAAACAGGA ACTCAAGACG
GCGTTCTACG AATTGCGCGA GCGTCTATCT CGGGCCGAAC AAGAACGTGA ACGCCTGCAC
ACTGCGGTAA CCCGCTTCGC CGAATTACAA CAGCAAGTCG CTACCTTGCA GCACCGCCTA
TACGAGTTGG CACCGGCGCA TGCGCGCATG GCACACCTCC AAGACCAACG GATCGCCATT
GAACAACAAC TATCGCACCT GAAAGAACTC ACATACCGAC AAACGGTGCT CAAAGATCAG
CTTGATCAGC GCCGCGTTGC GCTCAAGAAC GAGCAGGATC GTTTGCAGCA AGATCGGCAA
CGCCTCGACC GTCAACTCGC CGACGTTGCC CGGTGGCGGG TGGCGCTTCA GGAAGCGCAA
ATGGCCCTTG CCGCTTTACG TGCTCTGGAA GAACAGCAGT TGCTGCACCG TCGGCGTGAA
CAAGAGATTG TTGAAACACT TGGGAAGGCA CGGGCTATTG CGATGCAGGC ACAACAAGCG
ATGGATAAAC TACGGGCAAA TCAGGCTCTG CTGGCTACCG GTAGTGGTGA ATGCCCGGTT
TGCCGTCACC GGCTCGATCC TGCTGAAACT GAACACGTGA TGGCTCACTA CGCCCACGAG
CTAGCAGCGC TTCGGCACGA GGAAGCACGT GCATTGGCAA CGGCGCAAAC CGCCGAACAA
GCACTGGCAA CCGTGCGCGC CACGATTGCG GACAACGAGC AAGAACTGGA CAAGTTACGC
CGACAAGCTG CCGCTATCGA GACGCTGGAA CGTCAACTTG CGCAAGCTAC TGCTTGGGAA
CAAGAACGGA ACGATATTGT CCGGCGACTC ACAGCTCTTG AAGCGAAACT TGCCACCGAC
GAGATCGATC CGCCGCTCCA AGCCGAACTC ACTGCGGTCA CCGCACAACT GACACAGTTT
GACCACATAA CCGGTCTCCA AAACGACTTG GCGATGATCA ACGACGAACT AACCGCCTGT
GAGCGCCAAC TGCGTGAACA GAGCCGCCTC GAAGGTGAAC TCGATAGCTG TCAACGCGAG
CTAGAACGCT TGCAAGATGC CTCGGCCAAA CTTCCCGACG TCGAGGCAGT CGTCGCCGAA
CTCCAACGCC AAATCGAGAC TAACGACTTC GCCCACGAGA TTCGGAGCGC CGGACGGCAA
GTGGAAGCCG AGATTGCAGC TCTCAACTAC CAACCTGAAT TGCTCGAGAT GGCCGAGGCG
AAGGTTCGTT CTTTAGCTCA TTGGGAACAG GCAGAACGCG AATTGATATT GGCTGAACAA
CGCTACGCTG GCGAACTAAA ACTTCGTTCG CAGACGCAAA CGCTGCTCGC TCACGCCGAG
CGTGAGCGAC AAACGCTGCA AGCCGAGGTG GATACATTAG CTAACGAACT AACCAAACTG
CCGCTCGTGC AAACTACCGT TACCCAGATC AAACAACGTC TAGACGAAAC TGCACGCGCA
TTGCAGATTG CCGAGCGCGA TCTGACCGAA AAGCAGACGT ACCTCCGGCA GGCAGAGGCT
GCCGCTGCAC AATTAGAGAC ATTACAAGCC CAGGAACGAC AGCTCTGCGA ACGTAGCGCA
CTCTTTGCCG AGCTGGCCGA AGCCTTTGGC AAAAAGGGGG TGCAGGCGAT GTTGATCGAA
ACCGCCATCC CCCAGATCGA AGACGAAGCT AACAGCTTGC TGGCCCGCTT GACCGATGGG
CAGATGCATC TGCGCTTCGA GATGCAGCGT GACACCAAGA AGGGTGATAC GGTTGAGACG
CTCGATGTGC GTGTCGCCGA TGCCCTCGGT ACACGCGACT ACAAGACGTT CAGCGGTGGC
GAGGCCATGC GGGTCAACTT CGCAATTCGG ATTGCGCTTT CTCGTCTGCT CGCTCACCGG
GCCGGTGCGC GCCTTGAAAC ACTGGTAATC GATGAAGGAT TCGGTACTCT CGACGCCGAT
GGCCGTGAGC GGATGGTAGA GGCAATTACG GCGATTCAAC AAGATTTTGC CCGGATTATC
GTCATCACCC ACATTGACGA TCTCAAAGAT CGCTTTCCGG CAACACTTGA AATCCGCAAG
ACACCTGCCG GTAGTCGGTG GGAATTGCGC GGGTAA
 
Protein sequence
MIPIQLSLRN FMCYRTDDGK PLRLELDGLH VLCLSGENGA GKSTLLDAIT WALWGKARSA 
DDDLITQGET EMMVELVFAL DGRTYRVIRQ HQRGRSTGKG TSAGKTWLDL QILDGTQWRP
IGENTVRETQ AKIDALLRMS YRTFINASFL LQGQADKFTS APAAERKQVL AEILGLDEYA
ELEQRARERV RILDAETIRV RGQLESLQPT AAKVPFWQEA VVNAEQQRRR LQAAYAELEA
EYTVAVNRLR ELEALAQRHR ELLDRITSLH TDIQRYNREL NELAQRISHD ESIIARRSVI
QAGLTELTTA RAELERLRQV RDQYNTLMMR RTELKQELKT AFYELRERLS RAEQERERLH
TAVTRFAELQ QQVATLQHRL YELAPAHARM AHLQDQRIAI EQQLSHLKEL TYRQTVLKDQ
LDQRRVALKN EQDRLQQDRQ RLDRQLADVA RWRVALQEAQ MALAALRALE EQQLLHRRRE
QEIVETLGKA RAIAMQAQQA MDKLRANQAL LATGSGECPV CRHRLDPAET EHVMAHYAHE
LAALRHEEAR ALATAQTAEQ ALATVRATIA DNEQELDKLR RQAAAIETLE RQLAQATAWE
QERNDIVRRL TALEAKLATD EIDPPLQAEL TAVTAQLTQF DHITGLQNDL AMINDELTAC
ERQLREQSRL EGELDSCQRE LERLQDASAK LPDVEAVVAE LQRQIETNDF AHEIRSAGRQ
VEAEIAALNY QPELLEMAEA KVRSLAHWEQ AERELILAEQ RYAGELKLRS QTQTLLAHAE
RERQTLQAEV DTLANELTKL PLVQTTVTQI KQRLDETARA LQIAERDLTE KQTYLRQAEA
AAAQLETLQA QERQLCERSA LFAELAEAFG KKGVQAMLIE TAIPQIEDEA NSLLARLTDG
QMHLRFEMQR DTKKGDTVET LDVRVADALG TRDYKTFSGG EAMRVNFAIR IALSRLLAHR
AGARLETLVI DEGFGTLDAD GRERMVEAIT AIQQDFARII VITHIDDLKD RFPATLEIRK
TPAGSRWELR G