Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0776 |
Symbol | |
ID | 4069521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 955815 |
End bp | 958760 |
Gene Length | 2946 bp |
Protein Length | 981 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637982782 |
Product | excinuclease ABC subunit A |
Protein accession | YP_589855 |
Protein GI | 94967807 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.839433 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0723742 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCACT GTGGGGCCCC GACAATTACC AGAGGCCATG TTCCCGGCGG AATTCGGCCA GCAACAAGGG TTATAGCGTT TCACACGATG GCGATTTCCA AGATCACGGT GCGCGGGGCG CGCCAGCACA ATCTCAAAAA CATCACGGTC GAAATCCCGC GGAACACGCT CACCGTCATT ACCGGTTTGA GCGGCTCCGG AAAATCGTCG CTCGCCTTCG ATACGATCTA CGCTGAGGGC CAGCGGCGCT ACGTTGAAAC GCTCTCGGCG TACGCGCGCC AGTTCCTCGA CCAGATGGAA CGTCCCGATG TGGATTCGAT TGACGGCCTC AGCCCTGCGA TCTCCATCGA GCAGAAGACT ACCAGCCGCA GCCCGCGCTC CACAGTCGGC ACTATCACCG AGATTTACGA CTATCTTCGC CTTCTCTATT CGTCGATCGG ACTACCCCAC TGTCCGCAGT GCGGGCGCGC AATTTCGCGG CAGTCTGTCG AACAGATCGT AGCGCGAGTG TTGGAGCTGA AACCCGAGGA CCGCGTCATG CTGATGGCGC CGATCGTCCG CGGCCGCAAA GGCGAGTTTA AGAAAGAGAT GGAAAAACTC GCGCAGCACG GCTTCACGCG CGCGCGCATT GACGGCGAAC TGCGCAACAT TGCCGACGAA GAGATCAAGC TCGACAAGCG CAAGAACCAC ACCATCGAAG TCGTGATTGA CCGCCTGCTG GTAAAACCCG GAATCGAGAA GCGCCTCGCA GCGTCGGTCG AGCTCGCGAT GAAACTAGGC AGCGGGCTGG TGCAGGTGGC CGTGGTTGGT GGTGATGAGC ATCTCTTCTC GTCGCGACTG GCCTGCCCGG AATGCGGCAT CAGCGTTCCG CAACTCGAGC CGCGCTCGTT CTCGTTCAAC AGCGTGTATG GCGCTTGTCC GGAGTGCCAC GGCCTGGGCA ACAAGTACGA TTTCGATCCC GCGAAGATCA TCACTGACTG GTCAAAGCCG CTGCTTGATG GCGGCCTTGG TCCAGGTTCG GCCTCCGGCA ATCTCATCCG CATGGTGGAG ATCGCCGCCG CCGCGAACGA TATTGATCTC AAGCTGCCCT TCGAACAGCT CCCGGAGAAG CAGCAGAACC TGCTGCTCTA CGGCGCGACG AATGGCAACG GCCGCAGCGG CTTCAAAGGC GTTCTTGCCT ACTTGAAGCA GAACCTCGAC GAGAGCACCA GCGAAGGCTA TCGCGACTGG CTGCTCGCTT ACATGTCACC CACCGAATGT CCGGTGTGCC ACGGCAAACG ACTGCGCCCG GAATCGCTCG CGGTAAAAGT GAATGGCATG TCCATCGCCG ACTTCACCGC GCTTCCGGTC TCACGTTCGG TAGATGCGGT GAAAGACATC AAGCTCAACG AACGTGAAGA TCGCATTGCC GGCCGCGTGC TGCGCGAAAT CGGCGAACGG CTCGGCTTCC TGAACCATGT CGGGTTGGGA TACATCTCGC TCAGCCGCTC GGCGGCAACG CTCTCCGGTG GCGAAGGGCA GCGCATCCGC CTCGCGACGC AGATTGGGTC GAAGCTCCGC GGCGTGCTCT ACGTTCTCGA CGAGCCATCC ATCGGCCTGC ATCATCGCGA TAACGAGCGC CTGATCACCG CGCTCGAGGA GCTTCGCGAT CTCGGCAACA CGGTGCTCGT CGTCGAGCAC GACGAAGAAA CCATCCGTCG CGCCAACTAC GTCGTAGATC TTGGTCCCGG CGCCGGACGC CACGGCGGCG AACTGGTTGC TCACGGCACG CCATCCGATA TCGAAGCTGC GCCCGAGTCG CTGACAGGGC AATACATCTC CGGCCGCCGC GCCATCGGCA TTCGTCACGA ACGCCGCGCG GTCACCGACA AAGGGATCGC CATCCTCGGA GCGCGCGAGA ACAACCTCAA GAACGTGGAC GTCAGCTTCC CGCTGGGCGT GATGACGGTC GTCACCGGTG TCTCCGGCTC AGGCAAATCC ACGCTGGTGA ACGACATCCT CTACCGCGCG CTCGCCCAGA AGCTTTATCG CTCGCGCGAG GAGGCCGGCC AGCACAAGTC CATCAGCGGC ACCGAGAACA TCGACAAGGT CATCCGCATT GACCAATCGC CCATCGGACG CACTCCGCGT TCGAATCCGG CGACCTACAC CGGCGTGTTC TCCAACATCC GCGACCTCTA CGCCATGCTG CCGGAATCGC GCGAGCGTGG CTACAAAGCC GGACGATTCT CGTTCAACGT TGCCGGCGGA CGCTGCGAGG CCTGCCAGGG CGAAGGCCAG CGCCGCATCG AGATGAATTT CCTTCCCGAC GTCTACGTGC AATGCGAGGT CTGCAACGGT CGCCGCTACA ATCACGAGAC TCTCGCCGTG AAGTACAAGG GCCACAGCAT CGCCGACCTG CTGGAGCTTC CAGTCGCCGA CGCGCTCGCC GTGCTCGAAG CCATTCCTCA GGTGAAGCAG CGCCTTCAGA CTTTAGTGGA TGTCGGCCTC GGCTATATTC ATCTCGGTCA ATCTGCCGTA ACTCTCTCCG GCGGCGAGGC CCAGCGCATC AAACTGGCGA GGGAATTGAG CAAGCGCCAG ACCGGCAAAA CGTTGTACCT GCTCGACGAA CCGACCACCG GCCTTCACTT CGAAGACGTT AACAAACTGC TCGACGTGCT GCATCGTCTG ACCGATCTCG GAAACACGAT CATCATCATC GAGCACAACA TGGATGTCAT CCGGAACGCC GACTGGATTA TTGACCTCGG GCCGGAGGGT GGCGAAGACG GTGGAAAAAT TGTGGCGCAA GGGACCCCCG AAGCGGTGTC TAAGGTAAAG AAGAGTTATA CCGGCCAGGC GCTCGCCCAG TCGCTGAAGA ACAGCGTGGT GCGTGCGCTG CCCGCGAAGG TCGCAGCAGA AATTGCTCTG CCACGACCGA CACGAGAATC TAGATCAGAC GGATAA
|
Protein sequence | MAHCGAPTIT RGHVPGGIRP ATRVIAFHTM AISKITVRGA RQHNLKNITV EIPRNTLTVI TGLSGSGKSS LAFDTIYAEG QRRYVETLSA YARQFLDQME RPDVDSIDGL SPAISIEQKT TSRSPRSTVG TITEIYDYLR LLYSSIGLPH CPQCGRAISR QSVEQIVARV LELKPEDRVM LMAPIVRGRK GEFKKEMEKL AQHGFTRARI DGELRNIADE EIKLDKRKNH TIEVVIDRLL VKPGIEKRLA ASVELAMKLG SGLVQVAVVG GDEHLFSSRL ACPECGISVP QLEPRSFSFN SVYGACPECH GLGNKYDFDP AKIITDWSKP LLDGGLGPGS ASGNLIRMVE IAAAANDIDL KLPFEQLPEK QQNLLLYGAT NGNGRSGFKG VLAYLKQNLD ESTSEGYRDW LLAYMSPTEC PVCHGKRLRP ESLAVKVNGM SIADFTALPV SRSVDAVKDI KLNEREDRIA GRVLREIGER LGFLNHVGLG YISLSRSAAT LSGGEGQRIR LATQIGSKLR GVLYVLDEPS IGLHHRDNER LITALEELRD LGNTVLVVEH DEETIRRANY VVDLGPGAGR HGGELVAHGT PSDIEAAPES LTGQYISGRR AIGIRHERRA VTDKGIAILG ARENNLKNVD VSFPLGVMTV VTGVSGSGKS TLVNDILYRA LAQKLYRSRE EAGQHKSISG TENIDKVIRI DQSPIGRTPR SNPATYTGVF SNIRDLYAML PESRERGYKA GRFSFNVAGG RCEACQGEGQ RRIEMNFLPD VYVQCEVCNG RRYNHETLAV KYKGHSIADL LELPVADALA VLEAIPQVKQ RLQTLVDVGL GYIHLGQSAV TLSGGEAQRI KLARELSKRQ TGKTLYLLDE PTTGLHFEDV NKLLDVLHRL TDLGNTIIII EHNMDVIRNA DWIIDLGPEG GEDGGKIVAQ GTPEAVSKVK KSYTGQALAQ SLKNSVVRAL PAKVAAEIAL PRPTRESRSD G
|
| |