Gene Cagg_2343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2343 
Symbol 
ID7268693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2849538 
End bp2852552 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content55% 
IMG OID643567172 
Productexcinuclease ABC, A subunit 
Protein accessionYP_002463657 
Protein GI219849224 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.684635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00406706 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAGCCGCA CTCGGTGCGG CTCGCAATAT GATACGGCGT CGCCGGTGGG GCGGCGCTTT 
CGCTTCTACG GCATCAGCGT GGAGACGGTC GTGGCTAAGT CGACAATCGT GATTAAAGGT
GCGCGGGAAC ATAATCTCAA GGGGATCGAT CTTGAGATTC CCCGTGATCG GTTGGTGGTA
TTGACCGGAG TGAGCGGTTC GGGCAAGAGT TCGCTGGCGT TTGATACCCT CTACGCCGAA
GGCCAGCGGC GCTACGTTGA GAGCCTGTCG GCCTATGCGC GCCAATTCCT TGGGCAAATG
GAGAAGCCAC AGGTCGATCT GATCGAGGGT TTGTCACCCG CTATCGCCAT TGAACAGAAG
AGCGCCAGCA AGAATCCCCG TTCGACGGTT GGCACCGTTA CCGAGATTTA CGACTATCTG
CGGTTGTTGT TTGCCCGTGT CGGTCAGCAG TATTGTCACC GCTGTGGGCA ACCGGTGGCG
GCTCAATCGG CCCAGCAGAT GGTGGATCGG ATCTTGACCT TGCCGCCGGG GACACGCTTT
GTAATACTCG CACCGGTTGT ATCGCAGCGT AAGGGTGAGT ATAAAGATAT TTTCGCCGAA
GCTAAGGCCG AAGGTTTTAT CCGTGTGCGT GTTGATGGCG AAATTCGCTC TCTTGATGAA
GAAATTAAGC TGAATAAGAA GGTAAAACAC TCCATCGAGA TTGTTGTCGA TCGTTTAGCC
ATACCCTCAA CCACCGATAA CGGTGACCGC GACGGGTTCA CGACGCGCCT CACCGATAGT
ATTGAGACGG CATTGCGGGT TGGTGAAGGC AAAGTTCTGA TCAGCCTCGC CGATCAACCG
TCTGATCCCG ATCAGCCACG CGAATGGATG ATGAGTGAGA GCAATACCTG TCTCGCGTGT
GGCATTTCTT TCCCCGAACT GACGCCTCAG ATGTTCTCGT TTAACTCGCC GCAAGGCGCG
TGTCCGACCT GTACCGGGCT TGGCTTCCGG CTTGAGGTCG ATCCCGCCCT GCTCGTCCCC
AATGGTGAAC TGTCCATTCA CGATGGTGCG GTGACGTATT GGGGAGAGAT GCGTAAAAAA
CAGGACACGT GGGCCTACAA GGCGTTACAG GCCATAGCCG CCCATTACCG CATTGATCTC
GATGCACCGT GGGATTCCCT TAGCCAGGCT CAGCGCGATG TGATTATCTA CGGTAGTGGC
ACCGAACGGA TCCGCTTTAC ATGGGTACAC GAGGGCGGTT CGCGTGGTGA GTACTACCGC
CCATGGGAAG GATTGGCCGG CGAGATTCGG CGGCGCTATA TGCAGAGCGG TTCGGATGCG
ATGCAAGAGC ACTATGCTCA GTACATGAGT GAGCAGCCGT GTCCTGATTG TCAGGGTGCT
CGGCTGCGTC CAGAAAGTCT GGCTGTGCGC GTGGCCGGTC GCTCGATCCG TGATGTGACG
CGCATGAATA TTTCCCAGGC CCTTGATTGG GCGCGAGAGT TGCCAAATTG TCTCAGCGAG
ACTCAACGGC ACATCGTTGA TGATGTGTTG AAGGAGATCC GTGAACGACT CGGCTTTTTG
CACAATGTTG GCTTGCACTA TCTCACCCTC GACCGTGCTG CACCAACCTT GTCCGGCGGT
GAAGCGCAAC GCATCCGCCT CGCTTCGCAG ATCGGCTCAG GGTTGGTCGG TGTGATGTAC
ATCCTCGATG AGCCGAGCAT TGGTCTTCAC CAGCGTGATA ACCGTAAACT CCTCGACTCA
CTGCTGCGTC TGCGCGATCT AGGTAATACC TTGATCGTCG TTGAACATGA CCTTGAAACG
ATGCAGGCCG CCGATTGGAT TATTGACTTT GGTCCCGGTG CCGGTGTGAA GGGTGGCCAA
GTCGTGACGG CCGGTACGCC AGAGCAGGTG GCGCAACATC CGACATCGCT CACCGGGCAG
TATTTGTCGG GACGCCTCAC TATTCCGGTA CCGACGACCC GTCGCCGTCC TGATAACGGC
TGGTTGACGA TTGAAGGAGC TACGCTCAAT AATTTGCGCG ATGTGACGGT AAGTTTTCCA
CTCGGCTGTT TTATTGCCGT GACCGGTGTT TCCGGTTCGG GTAAATCGTC GCTGATCACC
GAGACGCTCT ATCCGGCATT GGCCAATCGG CTTAACCGTG CTCAACTTAA GCCCGGCCCC
TTCCGTGCGC TCCACGGTCT CGAACGACTC GATAAGGTGA TCAATATCGA TCAGCAGCCC
ATCGGGCGTA CACCTCGTTC CAATCCGGCT ACTTACGTGA AGCTGTTCGA TCTGCTCCGT
GAGCTGTTTG CCGAAACACC CGAAGCGAAG CTGCGTGGCT ACGGTCCCGG ACGGTTCAGT
TTTAACTTGC GTGGTGGACG GTGCGAGGCC TGTGAAGGGA ACGGTGAAAT TAAGATCGAC
ATGCAGTTCC TCGCCGATGT GTGGGTGCGG TGTGCTGAAT GTAAGGGAAA GCGCTATAAT
CGCGAGACGT TACAGGTTAA GTATAAGGGT AAGACGATTG CCGACGTGCT TGAGATGGAT
GTGCAAACCG CTCTTGAGTT TTTTGCCAAT GTCCCACGAG TACGTCGTAT CTTGCAGACC
TTACACGATG TTGGTCTCGA TTATATCAAA CTCGGTCAGC CGGCCACTAC CCTGTCTGGC
GGTGAGGCGC AGCGGGTAAA GTTAGCAAAG GAATTGGCCC GCGTTGCCAC CGGTCGAACC
ATTTACATTC TTGATGAGCC AACGACCGGT TTGCACTTCG CCGATATTCA ACATTTGTTG
CGCGTTTTGC ATCGCTTGGT TGATGCCGGC AATACGGTGA TTGTGATCGA GCATAACCTT
GATGTGATTA AGACGGCTGA CTATGTGATC GATATGGGGC CAGAGGGTGG TGATGGTGGT
GGTGAGGTGG TGGCGCTTGG CACACCAGAA GAGGTTGCAC GTCATCCGTC TTCGCACACG
GGGCGATTCT TGCGTGAGAT CCTCGAAGCG GTCGGTTTGG TGGGTGTTGG CGATAGCCAA
ACGTATGTGG ATTAA
 
Protein sequence
MSRTRCGSQY DTASPVGRRF RFYGISVETV VAKSTIVIKG AREHNLKGID LEIPRDRLVV 
LTGVSGSGKS SLAFDTLYAE GQRRYVESLS AYARQFLGQM EKPQVDLIEG LSPAIAIEQK
SASKNPRSTV GTVTEIYDYL RLLFARVGQQ YCHRCGQPVA AQSAQQMVDR ILTLPPGTRF
VILAPVVSQR KGEYKDIFAE AKAEGFIRVR VDGEIRSLDE EIKLNKKVKH SIEIVVDRLA
IPSTTDNGDR DGFTTRLTDS IETALRVGEG KVLISLADQP SDPDQPREWM MSESNTCLAC
GISFPELTPQ MFSFNSPQGA CPTCTGLGFR LEVDPALLVP NGELSIHDGA VTYWGEMRKK
QDTWAYKALQ AIAAHYRIDL DAPWDSLSQA QRDVIIYGSG TERIRFTWVH EGGSRGEYYR
PWEGLAGEIR RRYMQSGSDA MQEHYAQYMS EQPCPDCQGA RLRPESLAVR VAGRSIRDVT
RMNISQALDW ARELPNCLSE TQRHIVDDVL KEIRERLGFL HNVGLHYLTL DRAAPTLSGG
EAQRIRLASQ IGSGLVGVMY ILDEPSIGLH QRDNRKLLDS LLRLRDLGNT LIVVEHDLET
MQAADWIIDF GPGAGVKGGQ VVTAGTPEQV AQHPTSLTGQ YLSGRLTIPV PTTRRRPDNG
WLTIEGATLN NLRDVTVSFP LGCFIAVTGV SGSGKSSLIT ETLYPALANR LNRAQLKPGP
FRALHGLERL DKVINIDQQP IGRTPRSNPA TYVKLFDLLR ELFAETPEAK LRGYGPGRFS
FNLRGGRCEA CEGNGEIKID MQFLADVWVR CAECKGKRYN RETLQVKYKG KTIADVLEMD
VQTALEFFAN VPRVRRILQT LHDVGLDYIK LGQPATTLSG GEAQRVKLAK ELARVATGRT
IYILDEPTTG LHFADIQHLL RVLHRLVDAG NTVIVIEHNL DVIKTADYVI DMGPEGGDGG
GEVVALGTPE EVARHPSSHT GRFLREILEA VGLVGVGDSQ TYVD