Gene Acid345_4083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4083 
Symbol 
ID4072505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4836675 
End bp4838375 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content58% 
IMG OID637986114 
ProductATP-dependent OLD family endonuclease 
Protein accessionYP_593157 
Protein GI94971109 
COG category[L] Replication, recombination and repair 
COG ID[COG3593] Predicted ATP-dependent endonuclease of the OLD family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.186045 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTCC ACTCCGTACT GGTCGAGAAC TTTCGTGGAA TTCGTCGCGC CGAAGTAAGT 
TTCAGCCGCG ACACCGTGCT TATTGGCGAA AACGACTCCG GAAAATCTCG GGTCATCGAA
GCCCTCTGCC TCGTTCTGAA CAGCTCTTCA GGCATCATCC CCTTCGAACC TTGTCACTTT
TGCGTTCCCG TCGAGGAAGC AAATCAGTTT TCGGTCCCCA TTCGAATCAC GATTACATTT
GCAGAACGCC ACAAAGGCGA ATGGTCGAGC GCCACATATC AGCCGATTCA GTCCTTGCTC
GCGCCCGAGA GTTCACGCCA GCGAGAACTC CGCCTCGAAG TCCACGCGGC CTCCGCCGAC
GCCAAGCCTC GCATCCGCTT CTTTTCCAGC GGCGGGGCGC TCGCTGGCTC GGAATCTCCC
GAATTGCTCG CATGGGTGCG CGCTAATAAC CCAATCATCC GACTCGAGCA GGGCCTGCTT
AATCAGGGCG GAAGAAATGG CCCGGTTCGC AACAAGGAGA TTCAGAGCTA TTCCGATCTT
GTTGACCGTC ACTACGCTGC GTTGGTTGGG GGCGCTTCCA CAAACGCAAC CCTCGACCTC
GAAGCCGGCT TCAAAGCGGC TCAAACCTTG GTCGCGCTCT CCTCGCAACA CCTTGATGCC
AACGCCCGCC GCACCAACTG GCTCCTCGAA GAGATCGCAG GTTCCACACT CAAACTTTCA
AGTCGTTTCG CAAACGACAG CACCTCCGCC CCACCCTCTG GAAATTCCTA CAAAATCGGG
CTGCTCCTTT TCGTTGGGGC ACTCTTGCGC GCACATTCGC AGCCCTTTGG TCCAGATGCC
GAACCCATCG TGATCTTTGA AGATCCGGAA GCGCACCTCC ATCCCCTGAC GTTGGCTTCC
GTTTGGTCGT TGATTGAGCG AATGCGCTGG CAAACGATCG TTTCCACCCA CTCCGGCGTC
CTCCTCACCG AGGCCCCACT CCACAGCATC CGTCGCCTGA TCCGTGTGAA CGATGAAATC
ATTGTTCATC GCGTGCGTGA CCGTGCGCTT ACCAAAACCG ACATGCGCAA GTTCCGCTAT
CACGTCCGCT CGCGTCGCGG GGCAGCCATG TTTGCCCGCT GCTGGCTGCT CGTTGAGGGT
GAAACCGAAT TCTGGCTCCT CCCCGAACTC GCGCGGCTCC TGGGATACAA CTTCGACGCC
GAGGGTATCT CCTGCGTTGA ATTCGCCCAA TGCGGCATTC CGCCTCTCGT CAAAGCCGCC
CGCGAGTTGG GCATCGAGTG GCACCTTCTC ACCGATGGCG ACAACTCCGG CGCCATCTAT
TCCGCCACCG CCGGACGTTT CTGCGGACCA AATGAGCTTC CAACCCGAAT TACCAGGCTC
CGCGAACCCG ACATCGAGCA CTGCTTCTGG CACAGCGGAT ACGCCAACGC AATTCTGAAG
CTAGCTGGCC GTACATCTTC CAGCCGTCCC AATCCTCGCT GGATCATCCG TGCAGCGATC
GACAAATCGT CAAAGCCCTA TCTCGCTCTG CAACTTCTAG AAGCCGTCGC CTCGAGCGGT
TCGGCTGGAG TTCCTCCCGC CCTGAAACGT GTCATCGAAT CCTGCGTTCG GATGGCTCGC
CGCACCAACG AACAATCTGC ACCCGATGTG ACCCAACCAG CTCTGAAAGA CAACCTGCTT
CACATGGAAG CGAGAAAGTG A
 
Protein sequence
MQLHSVLVEN FRGIRRAEVS FSRDTVLIGE NDSGKSRVIE ALCLVLNSSS GIIPFEPCHF 
CVPVEEANQF SVPIRITITF AERHKGEWSS ATYQPIQSLL APESSRQREL RLEVHAASAD
AKPRIRFFSS GGALAGSESP ELLAWVRANN PIIRLEQGLL NQGGRNGPVR NKEIQSYSDL
VDRHYAALVG GASTNATLDL EAGFKAAQTL VALSSQHLDA NARRTNWLLE EIAGSTLKLS
SRFANDSTSA PPSGNSYKIG LLLFVGALLR AHSQPFGPDA EPIVIFEDPE AHLHPLTLAS
VWSLIERMRW QTIVSTHSGV LLTEAPLHSI RRLIRVNDEI IVHRVRDRAL TKTDMRKFRY
HVRSRRGAAM FARCWLLVEG ETEFWLLPEL ARLLGYNFDA EGISCVEFAQ CGIPPLVKAA
RELGIEWHLL TDGDNSGAIY SATAGRFCGP NELPTRITRL REPDIEHCFW HSGYANAILK
LAGRTSSSRP NPRWIIRAAI DKSSKPYLAL QLLEAVASSG SAGVPPALKR VIESCVRMAR
RTNEQSAPDV TQPALKDNLL HMEARK