Gene Cagg_1416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1416 
Symbol 
ID7269248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1742949 
End bp1744943 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content54% 
IMG OID643566259 
Productexcinuclease ABC subunit B 
Protein accessionYP_002462759 
Protein GI219848326 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTTC GGATTGAAGC GCCCTATCAG CCGACCGGTG ATCAGCCACA AGCGATAGAG 
AAACTTGTGG CCGGGTTGCG GGCCGGCTAT CGTCATCAGA CGCTGCTCGG TGCAACCGGC
ACCGGCAAAA CGTTTGTGAT GGCGCATATT TTTGCCCAAA TTCAGCGCCC TACGCTCGTG
CTTGCCCATA ACAAGACGCT GGTCTCGCAA CTATGGGCCG AGTTCCGTGA GTTTTTGCCT
GATGCCGCAG TCGAGATGTT TATCTCGTAC TACGATGAAT ATACGCCCGA AGCCTATGTG
CCCTCAAAAG ACCTCTACAT CGAGAAAGAG GCTAGCATTA ATGAAGAAAT CGACCGTCTG
CGTCACGCGG CGACCCAAGC ACTGCTGACC CGCCGCGATG TTCTGATTGT CGCTTCGGTT
TCGGCGATCT TTGGTCTCGG TTCACCTCAC GACTACGGTC AAGAGAAAAT TCACCTCCGT
ACCGGTGAAG TCCGTAACCG CGATAAACTG TTACGCCAAC TGATCGATTT ACAATTCGAG
CGTAACGATG TCGATTTTCA ACGCGGTACC TTCCGCGTTC GTGGCGATAC CCTCGATATT
ATTCCCGCGA ACGCTGAAAC CGCCATCCGG GTTGAGTTCT GGGGCGATGA GATTGAGCGG
ATTGTCGAAC TTGACCCGCT TACCGGAGAG GTGTTGCTGA AGCATACGGC GGTTGAAATT
TATCCGGCTA AGCACTTCGT CACCACCAAA GAGAAGCTAC AATTGGCGAT TGTGAGCATC
CAAGCCGAGC TGAACGAACG GTTACAAGAA TTAGAAGCTG CCGGCAAGTT ACTCGAAGCG
CAACGGCTGA AACAACGCAC GCTGTACGAC CTCGAAATGC TGAGTGAAGT GGGCTACTGT
AGCGGGATTG AGAACTATTC GCGCCATCTC GACGGACGCG CACCCGGTCA GACACCGTGG
ACATTGCTCG ATTATTTTCC CGATGATTTT TTGATGTTTA TTGACGAAAG TCACATCACC
ATTCCCCAGT TGCGCGGTAT GTACAACGGG GACCGTCAGC GTAAACAGAC ATTGGTTGAT
TACGGTTTTC GCTTACCGTC GGCACTCGAT AACCGCCCAC TCAAGTTTGA AGAGTTTGAG
CAGCACGTCT ATCAGGTGAT CTACGTCTCG GCAACCCCCG GTCCCTACGA GCGAGAAAAG
AGTGAACAGA TTGTTGAGCA GATTATCCGT CCGACCGGTC TGCTCGATCC GGAAATCGAG
GTGCGCCCGA CGCGCGGGCA AATCGATGAT CTGCTCGGCG AGATACGGCG ACGGGTAGAA
CGTAAACAGC GCGTGTTGGT AACGACTCTC ACCAAACGCA TGGCCGAAGA TTTGGCCGAC
TACCTCAAAG AGATGGGGGT ACGCACGATG TACCTCCACG CCGATATCGA TACCATCGAG
CGGGTAGAGA TTCTGCGTGA TCTGCGGCTT GGAGTGTATG ATGTAGTCGT GGGGATTAAC
CTGTTGCGTG AGGGGCTTGA TCTCCCTGAA GTGAGTTTGG TGGCGATCCT CGATGCGGAT
AAAGAGGGGT ATCTTCGCTC AGAGACATCG CTTATTCAGA TCATTGGTCG AGCGGCGCGT
CATATCGAGG GTAAAGTGAT CATGTACGCT GATACCATCA CCCGTTCGAT GGAAGTCGCC
ATTCGTGAGA CGCAACGCCG ACGTGAGATT CAGATGGCGC ACAATGTGCG GCACGGCATT
ACGCCGCAAG GAATCGCAAA AGGGGTACGT GATTTGACCG ACCGCATTCG CAAGGTGGCT
GAAGAGCGCG GTGAGTACGT GACCACTCCA GAGACCGCCG TACCGGTTGA TTTGCCACGT
GATGAAGTGT TGAAGCTGAT CAAAGACCTC GAGAAGCAGA TGAAACAGGC CGCCAAAGCG
CTTGCGTTTG AGAAAGCAGC GGCGTTGCGT GATCAGATCG TCGAGCTGCG ACAGGCCTTA
GCACTGAGTG AGTAG
 
Protein sequence
MPFRIEAPYQ PTGDQPQAIE KLVAGLRAGY RHQTLLGATG TGKTFVMAHI FAQIQRPTLV 
LAHNKTLVSQ LWAEFREFLP DAAVEMFISY YDEYTPEAYV PSKDLYIEKE ASINEEIDRL
RHAATQALLT RRDVLIVASV SAIFGLGSPH DYGQEKIHLR TGEVRNRDKL LRQLIDLQFE
RNDVDFQRGT FRVRGDTLDI IPANAETAIR VEFWGDEIER IVELDPLTGE VLLKHTAVEI
YPAKHFVTTK EKLQLAIVSI QAELNERLQE LEAAGKLLEA QRLKQRTLYD LEMLSEVGYC
SGIENYSRHL DGRAPGQTPW TLLDYFPDDF LMFIDESHIT IPQLRGMYNG DRQRKQTLVD
YGFRLPSALD NRPLKFEEFE QHVYQVIYVS ATPGPYEREK SEQIVEQIIR PTGLLDPEIE
VRPTRGQIDD LLGEIRRRVE RKQRVLVTTL TKRMAEDLAD YLKEMGVRTM YLHADIDTIE
RVEILRDLRL GVYDVVVGIN LLREGLDLPE VSLVAILDAD KEGYLRSETS LIQIIGRAAR
HIEGKVIMYA DTITRSMEVA IRETQRRREI QMAHNVRHGI TPQGIAKGVR DLTDRIRKVA
EERGEYVTTP ETAVPVDLPR DEVLKLIKDL EKQMKQAAKA LAFEKAAALR DQIVELRQAL
ALSE