Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2022 |
Symbol | |
ID | 4070352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2420423 |
End bp | 2423344 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984036 |
Product | excinuclease ABC subunit A |
Protein accession | YP_591097 |
Protein GI | 94969049 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAACG AGAGCATTAT TGTCCGCGGT GCGCGGGTAC ACAATTTAAA GAACATTGAC GTGGAGATTC CGCACAACCA GCTCACGGTT GTGACCGGGG TTTCTGGCTC GGGCAAGTCC TCCCTGGCCT TTGACACGAT TTATGCCGAG GGGCAGAGGC GGTATGTGGA GTCGCTGTCG GCGTATGCGC GGCAGTTCCT GGAGCGCATT GAGAAGCCGG ATGCCGACCT GATTGACGGT ATCGCACCGG CGGTAGCGAT CAAGCAGAAA AATAGCACCC GCAATCCGCG CTCGACGGTG GCGACGGCGA CGGAGATCTA CGACTATCTG CGGCTGCTCT TCGCCCGTGT GGGCCGGACC TACTGCGACA ACTGCGGCGG CGAGGTCAAG AAAGATACCG TCGACGAGAT CGCCGACCGG TTGCTGGCGA TGCCGGAGGG GACGCGCTTT AATGTGCTCT TTCCTCTGGT GCAGGCTCCG GCCCCGGTCG AACCGGAAAA GAAGCCGAAA GGTCGCAAGC CTAAGAAGCA AACTGCACCG GCACAGGATG AGTTGACGAA AGAGCGGCTG TTCGAGCTAC GCAAGCGGGG ATTCAATCGG CTCTTCCAGA CGGGACAGAT TTTCGAATTC TCAACGCCGG AGTCGCTGCT CGATATCGAT TTCTCGAAGC CCGTTTATCT GCTGGTGGAT CGAATTGCGA CGGCTCCGGA TAACCGTTCG CGCATTGTGG ATGCGATTGA GTCGGCGTAC CGCGAAGCCG GCGAAGTCAT TTTTGAAACC GCGCCGCGTG AAGAGGGCGG AGCGCCAGAA CGCCTTCGAT TTGCGCAGCG ATTTGAGTGC AAGAACTGTC ATGTGAAGTA CGACGAACCG GAGCCGCGGC TGTTTTCGTT CAACAATCCG TATGGGGCGT GTCCGAAGTG CCAGGGGTTT GGGAACACCA TCGATTTCGA TATGGACCTG GTGGTGCCCG ATCCAACGCT GACCTTGAAT GGGGGCGCGA TTGAGCCGTG GACGAAGCCG AAGTATCGTC CGCTGGGGAC GGAGATGAAG CGTTATGCGC GGAGCGTGGG TATTCCATTG GATACGCCCT GGCGCGAGCT GACCAAAGAG CAACGTGATG TGCTGATCGA GGGCGACGGC AAGTATCCCG GGGTGCGGGG GTTCTTCAAT CATCTCGAGC GCAAGAAGTA CAAGCTGCAC GTGCGGGTGT TCCTGAGCCG GTATCGCGGG TATTCGCAGT GTTCCAGTTG CGGCGGAGCG CGGCTGCGCA CCGAGGCACG CAATGTACGC GTCGCGGGGA AGAACATCTG CGAAGTGACG GCGATGACGG TCGAGGAGGC AACGAAGTTC TTTTCGACGA TCCAGCTCAC CCGCGAAGAG ACGGAGATCG CGGGCAAGCT GCTGGAAGAG ATCCAGAGCT TGCTGCGCTT CCTGAACGAA GTTGGACTGG AGTATTTGAG CCTCAACCGG CTGGCTTCGA CGCTGAGCGG GGGCGAAGCA CAGCGGATTC AATTGGCGAC TTCGCTGGGA TCACGGCTAG TGGGGACGCT GTATGTGTTG GATGAGCCAT CGATCGGGCT GCACAGTCGA GATACGAACC GGTTGATCCA CATCCTGCAT GACCTGCGCG ATCTGGGAAA CACTATCCTG GTGGTGGAGC ATGATCCGGA GATCATGCAG ACGGCCGACC GCATTCTCGA CCTAGGGCCG GGCGCCGGGG AAAATGGCGG CAAGTTGGTG GCGGCGGGGA CCTACAACGA GATCAAGAAG AACTCGGCAT CGCTGACGGG GCGGTATCTT GCGGACGAGT TGCATATTCC GATGCCGACG CAGCGGCGGG AGCCGAACTC GCGGAAGATC GTGGTGAAGA ACGCCTACGC TCACAATCTC AAGGGGATCG ATGTCGAGAT TCCGCTGGGG ATGATGGTGG TGATCACGGG CGTGTCGGGG AGCGGGAAAT CTACTCTGGT GCATGACATC CTGTACCAAG GGCTGGCGAC CGAGAAGCGG CAGGTGACCG GGCTGCAACT CAGCGGGTTC GAGAGCATCG AAGGCGCCGA GTACATCGAC GAAGTTGTGC TGGTGGACCA GTCGCCGATC GGGCGCACCC CGCGATCGAA CCCCATCACC TACATCAAGG CGTTTGACGC GATCCGCGAA CACTTCGCTT CCCTGCCTGA GTCGCAGAAG CGCGGTTACG CGGCGGGACA TTTCTCGTTC AATATTCCGG GCGGGCGTTG CGAAAACTGC CAGGGCGACG GAACTGTGAC GGTCGAAATG CAGTTCCTCG CCGATGTGGA ACTAATCTGC GAGGAGTGCA AGGGGACGCG GTACAAGCCG GAGATTCTTG AGATTCGGTA TCACGGGAAG AACATCCACG AGGTGCTGGA TCTGACGGTG AAGTCGGCGC TGCAGTTCTT CAGCGGATCG CCGAAGATCG TGGACAAGCT GCGTGTGCTC GACGAAGTGG GGCTGGGATA TTTGAGGCTG GGGCAGTCGG CAACCACGTT GAGTGGTGGC GAGGCGCAGC GCATGAAGCT GGCGCTGCAT CTGCAGCCGA AGATGAGGGA CGTCGGCCGT CCGGCGACGA CCGAGGACGG CAAACCGATT CGGCGGCATC CACGGATGCT CTACATCTTC GATGAACCGA CGACGGGGCT GCACTTCGAC GACGTGAGCA AACTTCTGGC GGCGTTCAAG AAGCTGATCG ACGCCGGCGG GTCGATTATC GTGATCGAAC ATAACCTCGA CGTGGTGAAG ACGGCGGATT GGGTGATCGA CCTAGGGCCG GAGGGCGGAA ATCGCGGCGG AAACCTGGTC GTAACCGGAA CACCGGAAAA GGTCGCGAAG ACCAAAGGCT CGTATACCGG CCAGTGGCTG GCGAAATATC TTCCGATCCA CGGAAATGGG TCGCATGACT GA
|
Protein sequence | MSNESIIVRG ARVHNLKNID VEIPHNQLTV VTGVSGSGKS SLAFDTIYAE GQRRYVESLS AYARQFLERI EKPDADLIDG IAPAVAIKQK NSTRNPRSTV ATATEIYDYL RLLFARVGRT YCDNCGGEVK KDTVDEIADR LLAMPEGTRF NVLFPLVQAP APVEPEKKPK GRKPKKQTAP AQDELTKERL FELRKRGFNR LFQTGQIFEF STPESLLDID FSKPVYLLVD RIATAPDNRS RIVDAIESAY REAGEVIFET APREEGGAPE RLRFAQRFEC KNCHVKYDEP EPRLFSFNNP YGACPKCQGF GNTIDFDMDL VVPDPTLTLN GGAIEPWTKP KYRPLGTEMK RYARSVGIPL DTPWRELTKE QRDVLIEGDG KYPGVRGFFN HLERKKYKLH VRVFLSRYRG YSQCSSCGGA RLRTEARNVR VAGKNICEVT AMTVEEATKF FSTIQLTREE TEIAGKLLEE IQSLLRFLNE VGLEYLSLNR LASTLSGGEA QRIQLATSLG SRLVGTLYVL DEPSIGLHSR DTNRLIHILH DLRDLGNTIL VVEHDPEIMQ TADRILDLGP GAGENGGKLV AAGTYNEIKK NSASLTGRYL ADELHIPMPT QRREPNSRKI VVKNAYAHNL KGIDVEIPLG MMVVITGVSG SGKSTLVHDI LYQGLATEKR QVTGLQLSGF ESIEGAEYID EVVLVDQSPI GRTPRSNPIT YIKAFDAIRE HFASLPESQK RGYAAGHFSF NIPGGRCENC QGDGTVTVEM QFLADVELIC EECKGTRYKP EILEIRYHGK NIHEVLDLTV KSALQFFSGS PKIVDKLRVL DEVGLGYLRL GQSATTLSGG EAQRMKLALH LQPKMRDVGR PATTEDGKPI RRHPRMLYIF DEPTTGLHFD DVSKLLAAFK KLIDAGGSII VIEHNLDVVK TADWVIDLGP EGGNRGGNLV VTGTPEKVAK TKGSYTGQWL AKYLPIHGNG SHD
|
| |