Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2343 |
Symbol | |
ID | 7268693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 2849538 |
End bp | 2852552 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643567172 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_002463657 |
Protein GI | 219849224 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.684635 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00406706 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGAGCCGCA CTCGGTGCGG CTCGCAATAT GATACGGCGT CGCCGGTGGG GCGGCGCTTT CGCTTCTACG GCATCAGCGT GGAGACGGTC GTGGCTAAGT CGACAATCGT GATTAAAGGT GCGCGGGAAC ATAATCTCAA GGGGATCGAT CTTGAGATTC CCCGTGATCG GTTGGTGGTA TTGACCGGAG TGAGCGGTTC GGGCAAGAGT TCGCTGGCGT TTGATACCCT CTACGCCGAA GGCCAGCGGC GCTACGTTGA GAGCCTGTCG GCCTATGCGC GCCAATTCCT TGGGCAAATG GAGAAGCCAC AGGTCGATCT GATCGAGGGT TTGTCACCCG CTATCGCCAT TGAACAGAAG AGCGCCAGCA AGAATCCCCG TTCGACGGTT GGCACCGTTA CCGAGATTTA CGACTATCTG CGGTTGTTGT TTGCCCGTGT CGGTCAGCAG TATTGTCACC GCTGTGGGCA ACCGGTGGCG GCTCAATCGG CCCAGCAGAT GGTGGATCGG ATCTTGACCT TGCCGCCGGG GACACGCTTT GTAATACTCG CACCGGTTGT ATCGCAGCGT AAGGGTGAGT ATAAAGATAT TTTCGCCGAA GCTAAGGCCG AAGGTTTTAT CCGTGTGCGT GTTGATGGCG AAATTCGCTC TCTTGATGAA GAAATTAAGC TGAATAAGAA GGTAAAACAC TCCATCGAGA TTGTTGTCGA TCGTTTAGCC ATACCCTCAA CCACCGATAA CGGTGACCGC GACGGGTTCA CGACGCGCCT CACCGATAGT ATTGAGACGG CATTGCGGGT TGGTGAAGGC AAAGTTCTGA TCAGCCTCGC CGATCAACCG TCTGATCCCG ATCAGCCACG CGAATGGATG ATGAGTGAGA GCAATACCTG TCTCGCGTGT GGCATTTCTT TCCCCGAACT GACGCCTCAG ATGTTCTCGT TTAACTCGCC GCAAGGCGCG TGTCCGACCT GTACCGGGCT TGGCTTCCGG CTTGAGGTCG ATCCCGCCCT GCTCGTCCCC AATGGTGAAC TGTCCATTCA CGATGGTGCG GTGACGTATT GGGGAGAGAT GCGTAAAAAA CAGGACACGT GGGCCTACAA GGCGTTACAG GCCATAGCCG CCCATTACCG CATTGATCTC GATGCACCGT GGGATTCCCT TAGCCAGGCT CAGCGCGATG TGATTATCTA CGGTAGTGGC ACCGAACGGA TCCGCTTTAC ATGGGTACAC GAGGGCGGTT CGCGTGGTGA GTACTACCGC CCATGGGAAG GATTGGCCGG CGAGATTCGG CGGCGCTATA TGCAGAGCGG TTCGGATGCG ATGCAAGAGC ACTATGCTCA GTACATGAGT GAGCAGCCGT GTCCTGATTG TCAGGGTGCT CGGCTGCGTC CAGAAAGTCT GGCTGTGCGC GTGGCCGGTC GCTCGATCCG TGATGTGACG CGCATGAATA TTTCCCAGGC CCTTGATTGG GCGCGAGAGT TGCCAAATTG TCTCAGCGAG ACTCAACGGC ACATCGTTGA TGATGTGTTG AAGGAGATCC GTGAACGACT CGGCTTTTTG CACAATGTTG GCTTGCACTA TCTCACCCTC GACCGTGCTG CACCAACCTT GTCCGGCGGT GAAGCGCAAC GCATCCGCCT CGCTTCGCAG ATCGGCTCAG GGTTGGTCGG TGTGATGTAC ATCCTCGATG AGCCGAGCAT TGGTCTTCAC CAGCGTGATA ACCGTAAACT CCTCGACTCA CTGCTGCGTC TGCGCGATCT AGGTAATACC TTGATCGTCG TTGAACATGA CCTTGAAACG ATGCAGGCCG CCGATTGGAT TATTGACTTT GGTCCCGGTG CCGGTGTGAA GGGTGGCCAA GTCGTGACGG CCGGTACGCC AGAGCAGGTG GCGCAACATC CGACATCGCT CACCGGGCAG TATTTGTCGG GACGCCTCAC TATTCCGGTA CCGACGACCC GTCGCCGTCC TGATAACGGC TGGTTGACGA TTGAAGGAGC TACGCTCAAT AATTTGCGCG ATGTGACGGT AAGTTTTCCA CTCGGCTGTT TTATTGCCGT GACCGGTGTT TCCGGTTCGG GTAAATCGTC GCTGATCACC GAGACGCTCT ATCCGGCATT GGCCAATCGG CTTAACCGTG CTCAACTTAA GCCCGGCCCC TTCCGTGCGC TCCACGGTCT CGAACGACTC GATAAGGTGA TCAATATCGA TCAGCAGCCC ATCGGGCGTA CACCTCGTTC CAATCCGGCT ACTTACGTGA AGCTGTTCGA TCTGCTCCGT GAGCTGTTTG CCGAAACACC CGAAGCGAAG CTGCGTGGCT ACGGTCCCGG ACGGTTCAGT TTTAACTTGC GTGGTGGACG GTGCGAGGCC TGTGAAGGGA ACGGTGAAAT TAAGATCGAC ATGCAGTTCC TCGCCGATGT GTGGGTGCGG TGTGCTGAAT GTAAGGGAAA GCGCTATAAT CGCGAGACGT TACAGGTTAA GTATAAGGGT AAGACGATTG CCGACGTGCT TGAGATGGAT GTGCAAACCG CTCTTGAGTT TTTTGCCAAT GTCCCACGAG TACGTCGTAT CTTGCAGACC TTACACGATG TTGGTCTCGA TTATATCAAA CTCGGTCAGC CGGCCACTAC CCTGTCTGGC GGTGAGGCGC AGCGGGTAAA GTTAGCAAAG GAATTGGCCC GCGTTGCCAC CGGTCGAACC ATTTACATTC TTGATGAGCC AACGACCGGT TTGCACTTCG CCGATATTCA ACATTTGTTG CGCGTTTTGC ATCGCTTGGT TGATGCCGGC AATACGGTGA TTGTGATCGA GCATAACCTT GATGTGATTA AGACGGCTGA CTATGTGATC GATATGGGGC CAGAGGGTGG TGATGGTGGT GGTGAGGTGG TGGCGCTTGG CACACCAGAA GAGGTTGCAC GTCATCCGTC TTCGCACACG GGGCGATTCT TGCGTGAGAT CCTCGAAGCG GTCGGTTTGG TGGGTGTTGG CGATAGCCAA ACGTATGTGG ATTAA
|
Protein sequence | MSRTRCGSQY DTASPVGRRF RFYGISVETV VAKSTIVIKG AREHNLKGID LEIPRDRLVV LTGVSGSGKS SLAFDTLYAE GQRRYVESLS AYARQFLGQM EKPQVDLIEG LSPAIAIEQK SASKNPRSTV GTVTEIYDYL RLLFARVGQQ YCHRCGQPVA AQSAQQMVDR ILTLPPGTRF VILAPVVSQR KGEYKDIFAE AKAEGFIRVR VDGEIRSLDE EIKLNKKVKH SIEIVVDRLA IPSTTDNGDR DGFTTRLTDS IETALRVGEG KVLISLADQP SDPDQPREWM MSESNTCLAC GISFPELTPQ MFSFNSPQGA CPTCTGLGFR LEVDPALLVP NGELSIHDGA VTYWGEMRKK QDTWAYKALQ AIAAHYRIDL DAPWDSLSQA QRDVIIYGSG TERIRFTWVH EGGSRGEYYR PWEGLAGEIR RRYMQSGSDA MQEHYAQYMS EQPCPDCQGA RLRPESLAVR VAGRSIRDVT RMNISQALDW ARELPNCLSE TQRHIVDDVL KEIRERLGFL HNVGLHYLTL DRAAPTLSGG EAQRIRLASQ IGSGLVGVMY ILDEPSIGLH QRDNRKLLDS LLRLRDLGNT LIVVEHDLET MQAADWIIDF GPGAGVKGGQ VVTAGTPEQV AQHPTSLTGQ YLSGRLTIPV PTTRRRPDNG WLTIEGATLN NLRDVTVSFP LGCFIAVTGV SGSGKSSLIT ETLYPALANR LNRAQLKPGP FRALHGLERL DKVINIDQQP IGRTPRSNPA TYVKLFDLLR ELFAETPEAK LRGYGPGRFS FNLRGGRCEA CEGNGEIKID MQFLADVWVR CAECKGKRYN RETLQVKYKG KTIADVLEMD VQTALEFFAN VPRVRRILQT LHDVGLDYIK LGQPATTLSG GEAQRVKLAK ELARVATGRT IYILDEPTTG LHFADIQHLL RVLHRLVDAG NTVIVIEHNL DVIKTADYVI DMGPEGGDGG GEVVALGTPE EVARHPSSHT GRFLREILEA VGLVGVGDSQ TYVD
|
| |