Gene Acid345_3117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3117 
SymboluvrC 
ID4070231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3705552 
End bp3707384 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content59% 
IMG OID637985136 
Productexcinuclease ABC subunit C 
Protein accessionYP_592192 
Protein GI94970144 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTCC ACCAGAAAAT CCGTACCCTC CCTACCTCTC CCGGCGTGTA CCTCTACAAA 
AACGCCGAGG GCGAGATCAT TTACGTGGGG AAGGCTAAGA ACCTGCGCTC GCGGGTGGGG
TCGTATTTCG TTCGCGGGGC CGACGAAAAC TCCAAGACCG GCAGTCTCCT GCGCGAGGCG
GTGGACGTCG AGTACATCGT CGTCGATAAC GAGAAAGAAG CCCTCGCTCT CGAGAACAAC
CTCATCAAGC AGAAGAAGCC CCGCTTCAAC ATCCTTCTGC GCGACGACAA GACCTATCCC
TACATCAAGC TGACGATGGG CGAGAAGTGG CCGCGCGTCT ATGTCACCCG CCGCCTGAAG
AAAGACGGCT CCGAATACTA CGGTCCGTTC TTTCCGGCGA ACCTCGCCTA TCGCGTGGTG
GACTTGATCC ACCGCAACTT CCTCGTCCCA AGCTGTTATA TCGATCTCCG TCGATATCAT
CCGCGCCCGT GTCTGCAGCA CTACATCGGG CGCTGCCTCG GCCCATGCGT CGAAGGTCTG
ACGAATGAAG TGCAATACGG CGAAGCCGTA AAAGACGTAA AGCTCTTTCT CGAAGGCCGT
CACTCCGACT TGAAGCAGTC GCTCACCGCG CGCATGAATA AAGCCGCGGA AGGCATGCAG
TTTGAACTGG CGGCGAAGTA TCGCGACCTG ATCACAACCG TAGAAGACCT GCACCAGAAG
CAGCGCATCG CAGCCGCCGA GGGCGACGAC GCAGATGTCT TCGGCTACCA CTACGAGAAC
CACATGGTCG CCGTGAACCT CTTCCACATG CGCGGAGGCA AGGTCCTCGA CCGCCGCGAT
TTCTTCTTCG AAGACCTCGG TGAAATGGAA GCCACTGGCG GCCTCAACAC CGGTGAGTTT
TTCAGTACGC TCTTGCAACA GATCTATCTC GACAACAAGT ACGTGCCTCG CACCATCTAC
GTCCCGGTAG AGTTCGAAGA CCGCGAAGCC CTCTGCGAGA TTCTCAGCGA GCAGATGCAC
CGCAAGATCG ATATCAATGT CCCGCAGCGT GGCGACAAGC GCTCACTCAT CGACCTCGTT
GCTCAGAACG CCAAGCAGTC CTACGACCAG CGCTTCCGTG TTATGCGCCC GCAGACCGAC
GTCCTCAAGT CTGTCCTGCA AGACACGCTC GAGCTGCCCG AATTGCCGAA CCGCATCGAG
TGCTTCGACA TTTCGCACAT CCAGGGCGCC GAGACCGTAG CCAGCATGGT GGTGTGGGAA
GACGGCAAGA TGAAAAAGTC CGACTACCGG AAATTCATCA TCAAGACCGT GCAAGGTGTG
GACGACTTCG CTTCCATGCG CGAGGTCGTA ACCCGTCGCT ACAAGCGCAT CGTCGAGGAG
AACCAACCAA TGCCGAGCCT GGTCCTCATC GACGGCGGTG TCGGCCAACT CCACGCCGCG
GCTGGCGCCC TCGAAGCCAT CGGCATTACC AACCAGCCGC TCGCGTCGAT CGCCAAGCGC
GAAGAGATCA TCTACGTCCA CGGCCGCGAA GACGAACCTA TCCGCATCGA CCACCACTCG
CCGGTGCTGC ACATCATCCA GCTCATCCGA GACGAAGCCC ACCGTTTTGC GATCACTTTC
CATCGCAAAC GCCGCGAAAT CCGCGACCGC AGCAACGAGC TGTTGGAAAT CCCTGGCATC
GGCGAACAAG CCATGAAGCG CCTGCTCCGC CACTTCGGCA GCATTCAGTC GATCCGCACG
GCGAATGCAA CCAGCCTCGA AGCCGTGGTG AACCGCACCC AAGCCGAAGC GATACTCGCG
CACTTCCGCG CCGAAGAAAC CACACGTTCG TAG
 
Protein sequence
MDLHQKIRTL PTSPGVYLYK NAEGEIIYVG KAKNLRSRVG SYFVRGADEN SKTGSLLREA 
VDVEYIVVDN EKEALALENN LIKQKKPRFN ILLRDDKTYP YIKLTMGEKW PRVYVTRRLK
KDGSEYYGPF FPANLAYRVV DLIHRNFLVP SCYIDLRRYH PRPCLQHYIG RCLGPCVEGL
TNEVQYGEAV KDVKLFLEGR HSDLKQSLTA RMNKAAEGMQ FELAAKYRDL ITTVEDLHQK
QRIAAAEGDD ADVFGYHYEN HMVAVNLFHM RGGKVLDRRD FFFEDLGEME ATGGLNTGEF
FSTLLQQIYL DNKYVPRTIY VPVEFEDREA LCEILSEQMH RKIDINVPQR GDKRSLIDLV
AQNAKQSYDQ RFRVMRPQTD VLKSVLQDTL ELPELPNRIE CFDISHIQGA ETVASMVVWE
DGKMKKSDYR KFIIKTVQGV DDFASMREVV TRRYKRIVEE NQPMPSLVLI DGGVGQLHAA
AGALEAIGIT NQPLASIAKR EEIIYVHGRE DEPIRIDHHS PVLHIIQLIR DEAHRFAITF
HRKRREIRDR SNELLEIPGI GEQAMKRLLR HFGSIQSIRT ANATSLEAVV NRTQAEAILA
HFRAEETTRS