Gene Acid345_0776 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0776 
Symbol 
ID4069521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp955815 
End bp958760 
Gene Length2946 bp 
Protein Length981 aa 
Translation table11 
GC content61% 
IMG OID637982782 
Productexcinuclease ABC subunit A 
Protein accessionYP_589855 
Protein GI94967807 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.839433 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0723742 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCACT GTGGGGCCCC GACAATTACC AGAGGCCATG TTCCCGGCGG AATTCGGCCA 
GCAACAAGGG TTATAGCGTT TCACACGATG GCGATTTCCA AGATCACGGT GCGCGGGGCG
CGCCAGCACA ATCTCAAAAA CATCACGGTC GAAATCCCGC GGAACACGCT CACCGTCATT
ACCGGTTTGA GCGGCTCCGG AAAATCGTCG CTCGCCTTCG ATACGATCTA CGCTGAGGGC
CAGCGGCGCT ACGTTGAAAC GCTCTCGGCG TACGCGCGCC AGTTCCTCGA CCAGATGGAA
CGTCCCGATG TGGATTCGAT TGACGGCCTC AGCCCTGCGA TCTCCATCGA GCAGAAGACT
ACCAGCCGCA GCCCGCGCTC CACAGTCGGC ACTATCACCG AGATTTACGA CTATCTTCGC
CTTCTCTATT CGTCGATCGG ACTACCCCAC TGTCCGCAGT GCGGGCGCGC AATTTCGCGG
CAGTCTGTCG AACAGATCGT AGCGCGAGTG TTGGAGCTGA AACCCGAGGA CCGCGTCATG
CTGATGGCGC CGATCGTCCG CGGCCGCAAA GGCGAGTTTA AGAAAGAGAT GGAAAAACTC
GCGCAGCACG GCTTCACGCG CGCGCGCATT GACGGCGAAC TGCGCAACAT TGCCGACGAA
GAGATCAAGC TCGACAAGCG CAAGAACCAC ACCATCGAAG TCGTGATTGA CCGCCTGCTG
GTAAAACCCG GAATCGAGAA GCGCCTCGCA GCGTCGGTCG AGCTCGCGAT GAAACTAGGC
AGCGGGCTGG TGCAGGTGGC CGTGGTTGGT GGTGATGAGC ATCTCTTCTC GTCGCGACTG
GCCTGCCCGG AATGCGGCAT CAGCGTTCCG CAACTCGAGC CGCGCTCGTT CTCGTTCAAC
AGCGTGTATG GCGCTTGTCC GGAGTGCCAC GGCCTGGGCA ACAAGTACGA TTTCGATCCC
GCGAAGATCA TCACTGACTG GTCAAAGCCG CTGCTTGATG GCGGCCTTGG TCCAGGTTCG
GCCTCCGGCA ATCTCATCCG CATGGTGGAG ATCGCCGCCG CCGCGAACGA TATTGATCTC
AAGCTGCCCT TCGAACAGCT CCCGGAGAAG CAGCAGAACC TGCTGCTCTA CGGCGCGACG
AATGGCAACG GCCGCAGCGG CTTCAAAGGC GTTCTTGCCT ACTTGAAGCA GAACCTCGAC
GAGAGCACCA GCGAAGGCTA TCGCGACTGG CTGCTCGCTT ACATGTCACC CACCGAATGT
CCGGTGTGCC ACGGCAAACG ACTGCGCCCG GAATCGCTCG CGGTAAAAGT GAATGGCATG
TCCATCGCCG ACTTCACCGC GCTTCCGGTC TCACGTTCGG TAGATGCGGT GAAAGACATC
AAGCTCAACG AACGTGAAGA TCGCATTGCC GGCCGCGTGC TGCGCGAAAT CGGCGAACGG
CTCGGCTTCC TGAACCATGT CGGGTTGGGA TACATCTCGC TCAGCCGCTC GGCGGCAACG
CTCTCCGGTG GCGAAGGGCA GCGCATCCGC CTCGCGACGC AGATTGGGTC GAAGCTCCGC
GGCGTGCTCT ACGTTCTCGA CGAGCCATCC ATCGGCCTGC ATCATCGCGA TAACGAGCGC
CTGATCACCG CGCTCGAGGA GCTTCGCGAT CTCGGCAACA CGGTGCTCGT CGTCGAGCAC
GACGAAGAAA CCATCCGTCG CGCCAACTAC GTCGTAGATC TTGGTCCCGG CGCCGGACGC
CACGGCGGCG AACTGGTTGC TCACGGCACG CCATCCGATA TCGAAGCTGC GCCCGAGTCG
CTGACAGGGC AATACATCTC CGGCCGCCGC GCCATCGGCA TTCGTCACGA ACGCCGCGCG
GTCACCGACA AAGGGATCGC CATCCTCGGA GCGCGCGAGA ACAACCTCAA GAACGTGGAC
GTCAGCTTCC CGCTGGGCGT GATGACGGTC GTCACCGGTG TCTCCGGCTC AGGCAAATCC
ACGCTGGTGA ACGACATCCT CTACCGCGCG CTCGCCCAGA AGCTTTATCG CTCGCGCGAG
GAGGCCGGCC AGCACAAGTC CATCAGCGGC ACCGAGAACA TCGACAAGGT CATCCGCATT
GACCAATCGC CCATCGGACG CACTCCGCGT TCGAATCCGG CGACCTACAC CGGCGTGTTC
TCCAACATCC GCGACCTCTA CGCCATGCTG CCGGAATCGC GCGAGCGTGG CTACAAAGCC
GGACGATTCT CGTTCAACGT TGCCGGCGGA CGCTGCGAGG CCTGCCAGGG CGAAGGCCAG
CGCCGCATCG AGATGAATTT CCTTCCCGAC GTCTACGTGC AATGCGAGGT CTGCAACGGT
CGCCGCTACA ATCACGAGAC TCTCGCCGTG AAGTACAAGG GCCACAGCAT CGCCGACCTG
CTGGAGCTTC CAGTCGCCGA CGCGCTCGCC GTGCTCGAAG CCATTCCTCA GGTGAAGCAG
CGCCTTCAGA CTTTAGTGGA TGTCGGCCTC GGCTATATTC ATCTCGGTCA ATCTGCCGTA
ACTCTCTCCG GCGGCGAGGC CCAGCGCATC AAACTGGCGA GGGAATTGAG CAAGCGCCAG
ACCGGCAAAA CGTTGTACCT GCTCGACGAA CCGACCACCG GCCTTCACTT CGAAGACGTT
AACAAACTGC TCGACGTGCT GCATCGTCTG ACCGATCTCG GAAACACGAT CATCATCATC
GAGCACAACA TGGATGTCAT CCGGAACGCC GACTGGATTA TTGACCTCGG GCCGGAGGGT
GGCGAAGACG GTGGAAAAAT TGTGGCGCAA GGGACCCCCG AAGCGGTGTC TAAGGTAAAG
AAGAGTTATA CCGGCCAGGC GCTCGCCCAG TCGCTGAAGA ACAGCGTGGT GCGTGCGCTG
CCCGCGAAGG TCGCAGCAGA AATTGCTCTG CCACGACCGA CACGAGAATC TAGATCAGAC
GGATAA
 
Protein sequence
MAHCGAPTIT RGHVPGGIRP ATRVIAFHTM AISKITVRGA RQHNLKNITV EIPRNTLTVI 
TGLSGSGKSS LAFDTIYAEG QRRYVETLSA YARQFLDQME RPDVDSIDGL SPAISIEQKT
TSRSPRSTVG TITEIYDYLR LLYSSIGLPH CPQCGRAISR QSVEQIVARV LELKPEDRVM
LMAPIVRGRK GEFKKEMEKL AQHGFTRARI DGELRNIADE EIKLDKRKNH TIEVVIDRLL
VKPGIEKRLA ASVELAMKLG SGLVQVAVVG GDEHLFSSRL ACPECGISVP QLEPRSFSFN
SVYGACPECH GLGNKYDFDP AKIITDWSKP LLDGGLGPGS ASGNLIRMVE IAAAANDIDL
KLPFEQLPEK QQNLLLYGAT NGNGRSGFKG VLAYLKQNLD ESTSEGYRDW LLAYMSPTEC
PVCHGKRLRP ESLAVKVNGM SIADFTALPV SRSVDAVKDI KLNEREDRIA GRVLREIGER
LGFLNHVGLG YISLSRSAAT LSGGEGQRIR LATQIGSKLR GVLYVLDEPS IGLHHRDNER
LITALEELRD LGNTVLVVEH DEETIRRANY VVDLGPGAGR HGGELVAHGT PSDIEAAPES
LTGQYISGRR AIGIRHERRA VTDKGIAILG ARENNLKNVD VSFPLGVMTV VTGVSGSGKS
TLVNDILYRA LAQKLYRSRE EAGQHKSISG TENIDKVIRI DQSPIGRTPR SNPATYTGVF
SNIRDLYAML PESRERGYKA GRFSFNVAGG RCEACQGEGQ RRIEMNFLPD VYVQCEVCNG
RRYNHETLAV KYKGHSIADL LELPVADALA VLEAIPQVKQ RLQTLVDVGL GYIHLGQSAV
TLSGGEAQRI KLARELSKRQ TGKTLYLLDE PTTGLHFEDV NKLLDVLHRL TDLGNTIIII
EHNMDVIRNA DWIIDLGPEG GEDGGKIVAQ GTPEAVSKVK KSYTGQALAQ SLKNSVVRAL
PAKVAAEIAL PRPTRESRSD G