Gene Acid345_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2022 
Symbol 
ID4070352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2420423 
End bp2423344 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content60% 
IMG OID637984036 
Productexcinuclease ABC subunit A 
Protein accessionYP_591097 
Protein GI94969049 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAACG AGAGCATTAT TGTCCGCGGT GCGCGGGTAC ACAATTTAAA GAACATTGAC 
GTGGAGATTC CGCACAACCA GCTCACGGTT GTGACCGGGG TTTCTGGCTC GGGCAAGTCC
TCCCTGGCCT TTGACACGAT TTATGCCGAG GGGCAGAGGC GGTATGTGGA GTCGCTGTCG
GCGTATGCGC GGCAGTTCCT GGAGCGCATT GAGAAGCCGG ATGCCGACCT GATTGACGGT
ATCGCACCGG CGGTAGCGAT CAAGCAGAAA AATAGCACCC GCAATCCGCG CTCGACGGTG
GCGACGGCGA CGGAGATCTA CGACTATCTG CGGCTGCTCT TCGCCCGTGT GGGCCGGACC
TACTGCGACA ACTGCGGCGG CGAGGTCAAG AAAGATACCG TCGACGAGAT CGCCGACCGG
TTGCTGGCGA TGCCGGAGGG GACGCGCTTT AATGTGCTCT TTCCTCTGGT GCAGGCTCCG
GCCCCGGTCG AACCGGAAAA GAAGCCGAAA GGTCGCAAGC CTAAGAAGCA AACTGCACCG
GCACAGGATG AGTTGACGAA AGAGCGGCTG TTCGAGCTAC GCAAGCGGGG ATTCAATCGG
CTCTTCCAGA CGGGACAGAT TTTCGAATTC TCAACGCCGG AGTCGCTGCT CGATATCGAT
TTCTCGAAGC CCGTTTATCT GCTGGTGGAT CGAATTGCGA CGGCTCCGGA TAACCGTTCG
CGCATTGTGG ATGCGATTGA GTCGGCGTAC CGCGAAGCCG GCGAAGTCAT TTTTGAAACC
GCGCCGCGTG AAGAGGGCGG AGCGCCAGAA CGCCTTCGAT TTGCGCAGCG ATTTGAGTGC
AAGAACTGTC ATGTGAAGTA CGACGAACCG GAGCCGCGGC TGTTTTCGTT CAACAATCCG
TATGGGGCGT GTCCGAAGTG CCAGGGGTTT GGGAACACCA TCGATTTCGA TATGGACCTG
GTGGTGCCCG ATCCAACGCT GACCTTGAAT GGGGGCGCGA TTGAGCCGTG GACGAAGCCG
AAGTATCGTC CGCTGGGGAC GGAGATGAAG CGTTATGCGC GGAGCGTGGG TATTCCATTG
GATACGCCCT GGCGCGAGCT GACCAAAGAG CAACGTGATG TGCTGATCGA GGGCGACGGC
AAGTATCCCG GGGTGCGGGG GTTCTTCAAT CATCTCGAGC GCAAGAAGTA CAAGCTGCAC
GTGCGGGTGT TCCTGAGCCG GTATCGCGGG TATTCGCAGT GTTCCAGTTG CGGCGGAGCG
CGGCTGCGCA CCGAGGCACG CAATGTACGC GTCGCGGGGA AGAACATCTG CGAAGTGACG
GCGATGACGG TCGAGGAGGC AACGAAGTTC TTTTCGACGA TCCAGCTCAC CCGCGAAGAG
ACGGAGATCG CGGGCAAGCT GCTGGAAGAG ATCCAGAGCT TGCTGCGCTT CCTGAACGAA
GTTGGACTGG AGTATTTGAG CCTCAACCGG CTGGCTTCGA CGCTGAGCGG GGGCGAAGCA
CAGCGGATTC AATTGGCGAC TTCGCTGGGA TCACGGCTAG TGGGGACGCT GTATGTGTTG
GATGAGCCAT CGATCGGGCT GCACAGTCGA GATACGAACC GGTTGATCCA CATCCTGCAT
GACCTGCGCG ATCTGGGAAA CACTATCCTG GTGGTGGAGC ATGATCCGGA GATCATGCAG
ACGGCCGACC GCATTCTCGA CCTAGGGCCG GGCGCCGGGG AAAATGGCGG CAAGTTGGTG
GCGGCGGGGA CCTACAACGA GATCAAGAAG AACTCGGCAT CGCTGACGGG GCGGTATCTT
GCGGACGAGT TGCATATTCC GATGCCGACG CAGCGGCGGG AGCCGAACTC GCGGAAGATC
GTGGTGAAGA ACGCCTACGC TCACAATCTC AAGGGGATCG ATGTCGAGAT TCCGCTGGGG
ATGATGGTGG TGATCACGGG CGTGTCGGGG AGCGGGAAAT CTACTCTGGT GCATGACATC
CTGTACCAAG GGCTGGCGAC CGAGAAGCGG CAGGTGACCG GGCTGCAACT CAGCGGGTTC
GAGAGCATCG AAGGCGCCGA GTACATCGAC GAAGTTGTGC TGGTGGACCA GTCGCCGATC
GGGCGCACCC CGCGATCGAA CCCCATCACC TACATCAAGG CGTTTGACGC GATCCGCGAA
CACTTCGCTT CCCTGCCTGA GTCGCAGAAG CGCGGTTACG CGGCGGGACA TTTCTCGTTC
AATATTCCGG GCGGGCGTTG CGAAAACTGC CAGGGCGACG GAACTGTGAC GGTCGAAATG
CAGTTCCTCG CCGATGTGGA ACTAATCTGC GAGGAGTGCA AGGGGACGCG GTACAAGCCG
GAGATTCTTG AGATTCGGTA TCACGGGAAG AACATCCACG AGGTGCTGGA TCTGACGGTG
AAGTCGGCGC TGCAGTTCTT CAGCGGATCG CCGAAGATCG TGGACAAGCT GCGTGTGCTC
GACGAAGTGG GGCTGGGATA TTTGAGGCTG GGGCAGTCGG CAACCACGTT GAGTGGTGGC
GAGGCGCAGC GCATGAAGCT GGCGCTGCAT CTGCAGCCGA AGATGAGGGA CGTCGGCCGT
CCGGCGACGA CCGAGGACGG CAAACCGATT CGGCGGCATC CACGGATGCT CTACATCTTC
GATGAACCGA CGACGGGGCT GCACTTCGAC GACGTGAGCA AACTTCTGGC GGCGTTCAAG
AAGCTGATCG ACGCCGGCGG GTCGATTATC GTGATCGAAC ATAACCTCGA CGTGGTGAAG
ACGGCGGATT GGGTGATCGA CCTAGGGCCG GAGGGCGGAA ATCGCGGCGG AAACCTGGTC
GTAACCGGAA CACCGGAAAA GGTCGCGAAG ACCAAAGGCT CGTATACCGG CCAGTGGCTG
GCGAAATATC TTCCGATCCA CGGAAATGGG TCGCATGACT GA
 
Protein sequence
MSNESIIVRG ARVHNLKNID VEIPHNQLTV VTGVSGSGKS SLAFDTIYAE GQRRYVESLS 
AYARQFLERI EKPDADLIDG IAPAVAIKQK NSTRNPRSTV ATATEIYDYL RLLFARVGRT
YCDNCGGEVK KDTVDEIADR LLAMPEGTRF NVLFPLVQAP APVEPEKKPK GRKPKKQTAP
AQDELTKERL FELRKRGFNR LFQTGQIFEF STPESLLDID FSKPVYLLVD RIATAPDNRS
RIVDAIESAY REAGEVIFET APREEGGAPE RLRFAQRFEC KNCHVKYDEP EPRLFSFNNP
YGACPKCQGF GNTIDFDMDL VVPDPTLTLN GGAIEPWTKP KYRPLGTEMK RYARSVGIPL
DTPWRELTKE QRDVLIEGDG KYPGVRGFFN HLERKKYKLH VRVFLSRYRG YSQCSSCGGA
RLRTEARNVR VAGKNICEVT AMTVEEATKF FSTIQLTREE TEIAGKLLEE IQSLLRFLNE
VGLEYLSLNR LASTLSGGEA QRIQLATSLG SRLVGTLYVL DEPSIGLHSR DTNRLIHILH
DLRDLGNTIL VVEHDPEIMQ TADRILDLGP GAGENGGKLV AAGTYNEIKK NSASLTGRYL
ADELHIPMPT QRREPNSRKI VVKNAYAHNL KGIDVEIPLG MMVVITGVSG SGKSTLVHDI
LYQGLATEKR QVTGLQLSGF ESIEGAEYID EVVLVDQSPI GRTPRSNPIT YIKAFDAIRE
HFASLPESQK RGYAAGHFSF NIPGGRCENC QGDGTVTVEM QFLADVELIC EECKGTRYKP
EILEIRYHGK NIHEVLDLTV KSALQFFSGS PKIVDKLRVL DEVGLGYLRL GQSATTLSGG
EAQRMKLALH LQPKMRDVGR PATTEDGKPI RRHPRMLYIF DEPTTGLHFD DVSKLLAAFK
KLIDAGGSII VIEHNLDVVK TADWVIDLGP EGGNRGGNLV VTGTPEKVAK TKGSYTGQWL
AKYLPIHGNG SHD