Gene Cagg_0015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0015 
Symbol 
ID7269011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp23981 
End bp26836 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content57% 
IMG OID643564887 
Productexcinuclease ABC, A subunit 
Protein accessionYP_002461404 
Protein GI219846971 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATGGA TACGGATCCG CGGCGCACGC ACTCACAATC TCAAACAGAT CGATCTCGAC 
ATCCCACGCG GTAAATTGGT CGTGATGACC GGCGTTTCGG GATCGGGGAA GAGTTCACTC
GCCTTCGATA CCATCTTTGC CGAAGGGCAA CGCCGTTATG TCGAGAGCTT ATCGGTGTAC
GCTCGTCAAC TGCTCGGTCA ACTCGAAAAG CCCGATGTAG ACCTGATCGA AGGTCTATCA
CCGGCGATTG CGATTGATCA AAAAGGCGCC GCCCGTAACC CCCGTTCCAC CGTTGGTACA
GTCACCGAAG TGTATGATTT TTTGCGCTTG CTCTTTGCCC GTATCGGTCA ACCACACTGC
CCACACTGCG CTACACCGCT CCGTCGCTAT ACGCCGCAGC AGATGGTTGA TTTTATCGTT
GAACTGCCGG CCGACCAACG GGTACAGCTC CTTGCCCCGC TGACCGGTGA TGCTGAAGCA
GTGCTGGCCG AGCTGCGGCG ACGCGGTTTC GTTCGTGCTC GGATCGACGG TGTGGTGCAT
GAACTTGATG AGCCGATCCG GTTAGATAAG TACCGAAAGC CACTGATCGA AGCGGTTGTC
GACCGATTGA TCGTTCGTCG TACCGCCGAC GGAACTCCAG CACTTGACCG TGTGCGGGTC
GCCGATTCGG TTGAGACGGC CCTGAAGTTG AGTGGTGGTC AACTGATCGT GCAAGTCATC
GACGGCGATG AGTGGCTGCT CAGTGAACGG TTTGTTTGTC CGCAACACGG TTCGATTGAC
TTAGGTGAGC TTGCACCGCG CGATTTTTCG TTCAACAACC CGCACGGGGC TTGTCCAACC
TGCGATGGCC TGGGTGTTGT ACCCGAAATC GATCCTACCC TCGTTGTGCC TGACCGCCGC
TTACCACTCC TCGAAGCGAT TGCACCGTGG CGAGAAGGAG ATAGTACCGC CCAGCGCTAC
TATCAAGACG TGTTGCGATC TTTTGCTGCT CACTTCGGCA TTGACCCACT CACACCGATC
AATGTACTGT CGCCCGAAGT CTTAAGTGCG TTACTGTACG GTACCGGTGG TGAACCGATT
ACCTTGCAGT ACCACCATCA GGGGCGAGTC CATAGTGTTG AAACTGAATT TGAGGGTATC
ATCCCCAATC TACGCCGCCG TCTCAACGAA CAGCGTAACG GCAACGACCC ATCGCCACTT
GAGCAGTACA CTTCACCACG CCCCTGCCCT GACTGTGGTG GCACCCGCCT GCGCCCCGCT
GCTCGCGCCG TCACGGTAGC CGGGGCATCG ATTGCCGACA TTCACCGGAT GACCGTCGCC
GATGCGTTGG CATGGGCCAG TGCTCTGCTC GCAGACCGCG AATTGAGCGA ACGGGAACGA
ACCATTGCGC GCCCAATCGT GCGTGAGATT ACCCAGCGTT TGCGATTTTT ATGTGAAGTT
GGACTATCTT ATCTCACGCT GGATCGCACA GCAATGACCC TCAGCGGCGG TGAGATGCAG
CGCGTGCGAT TGGCTACGCA AGTAGGCGCC GGCCTATCTG GTGTTCTTTA CGTGCTCGAC
GAACCGAGCA GCGGTTTACA TTCACGTGAT CACGATCGCC TGCTCACTAC GTTACTGCAA
TTACGCGATC TCGGTAACAG TGTGATTGTG GTCGAACACG ACGAAGCAAC TATCCGTGCT
GCCGATTGGC TGGTTGATAT TGGGCCGGGA GCTGGGCCGC ACGGTGGCGA AGTATTGGCA
AGCGGAACGT TGAACGACAT CGTCGCCTGC CCACGTTCGC TGACCGGGCA ATATCTGAGT
GGAAAGCGCC AGATCCCGAT TCCCGACCGG CGCCGTCCGG CCAACGGGCC ATGGCTCGAA
CTGCGCGGAT GCCGGGCCAA TAATCTCAAG AATATCGATG TGCGTATTCC TCTTGGCTGC
TTTGTCGCAG TGAGTGGTGT CAGTGGTAGT GGCAAGAGTA GTCTCATCGG AGACACGCTT
GCGCCGCGCT TAATGCAGTT GCTGCATGGC GGCAACATCC GCGCCGGCGA TCACGATGCC
ATTCTTGGTG TTGAGCATCT TGAACGAGTG ATCGTCGTCG ATCAAACCCC GATCGGACGC
ACGCCACGCT CGAACCCGGC TACCTACTGT CGGATTTTCG ATCCCATTCG CAACTTATTC
GCTGCGACCA ATGAAGCCAA AGCTCGCGGG TATGATGCCT CGCGCTTCAG CTTTAACATC
AAGGGCGGTC GGTGTGAGCA CTGCGCCGGT GAAGGGTTGA TGCGGGTCGA AATGCAGTTT
CTCCCCGATA TTTTCGTACC ATGTGATATT TGCGGCGGAA CGCGGTATAA TCGAGAGACG
TTAGATATTC GTTACCGTGG TCTCAATATT GCCGAAGTAT TAGAACTAAC GGTAGCAGAA
GCACTTGACT TCTTCGCGCG TGTACCGGCT ATTGCCGAAC GGTTGCAAGC ACTTTACGAT
GTCGGCCTTG GCTATCTCAA ACTCGGTCAG CCGGCGCCGA CACTGAGCGG TGGCGAGGCA
CAGCGGATCA AACTGGCCGC CGAACTGAGC CGGCGGAGTA GCGGACGGAC GCTCTACATT
CTCGATGAGC CAACGACCGG CCTACACTTC GCCGACATCG AACGATTGGT AACCGTCTTA
CAACGTCTCG TTGAGGCGGG CAATACGGTA CTCATGGTTG AACATCACAT TGATCTGATC
GCCGCTGCCG ACTGGGTGAT CGAACTCGGC CCTGAAGGCG GCGACAACGG CGGGTACCTG
ATCGGCACCG GCCCACCCGA AACTATTGCT ATGCTGGCCG AATCGGCAAC CGGTCCGTAT
CTGAAAAATC GCCTATATCG CGCATTGATG CGATAA
 
Protein sequence
MEWIRIRGAR THNLKQIDLD IPRGKLVVMT GVSGSGKSSL AFDTIFAEGQ RRYVESLSVY 
ARQLLGQLEK PDVDLIEGLS PAIAIDQKGA ARNPRSTVGT VTEVYDFLRL LFARIGQPHC
PHCATPLRRY TPQQMVDFIV ELPADQRVQL LAPLTGDAEA VLAELRRRGF VRARIDGVVH
ELDEPIRLDK YRKPLIEAVV DRLIVRRTAD GTPALDRVRV ADSVETALKL SGGQLIVQVI
DGDEWLLSER FVCPQHGSID LGELAPRDFS FNNPHGACPT CDGLGVVPEI DPTLVVPDRR
LPLLEAIAPW REGDSTAQRY YQDVLRSFAA HFGIDPLTPI NVLSPEVLSA LLYGTGGEPI
TLQYHHQGRV HSVETEFEGI IPNLRRRLNE QRNGNDPSPL EQYTSPRPCP DCGGTRLRPA
ARAVTVAGAS IADIHRMTVA DALAWASALL ADRELSERER TIARPIVREI TQRLRFLCEV
GLSYLTLDRT AMTLSGGEMQ RVRLATQVGA GLSGVLYVLD EPSSGLHSRD HDRLLTTLLQ
LRDLGNSVIV VEHDEATIRA ADWLVDIGPG AGPHGGEVLA SGTLNDIVAC PRSLTGQYLS
GKRQIPIPDR RRPANGPWLE LRGCRANNLK NIDVRIPLGC FVAVSGVSGS GKSSLIGDTL
APRLMQLLHG GNIRAGDHDA ILGVEHLERV IVVDQTPIGR TPRSNPATYC RIFDPIRNLF
AATNEAKARG YDASRFSFNI KGGRCEHCAG EGLMRVEMQF LPDIFVPCDI CGGTRYNRET
LDIRYRGLNI AEVLELTVAE ALDFFARVPA IAERLQALYD VGLGYLKLGQ PAPTLSGGEA
QRIKLAAELS RRSSGRTLYI LDEPTTGLHF ADIERLVTVL QRLVEAGNTV LMVEHHIDLI
AAADWVIELG PEGGDNGGYL IGTGPPETIA MLAESATGPY LKNRLYRALM R