Gene Acid345_4656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4656 
Symbol 
ID4070813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5512291 
End bp5514312 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content60% 
IMG OID637986696 
ProductDNA ligase, NAD-dependent 
Protein accessionYP_593730 
Protein GI94971682 
COG category[L] Replication, recombination and repair 
COG ID[COG0272] NAD-dependent DNA ligase (contains BRCT domain type II) 
TIGRFAM ID[TIGR00575] DNA ligase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.203554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGCA CCAAAGATCC CGCAAAGCAA GCCGAAGACC TGCGCGAAAA GCTGCGTTAT 
CACGAACATC GCTATTACGT GCTCGACGAC CCGGAAATCT CGGACGCCGA CTATGACGTG
ATGATGAACG AGTTGAAGGC CTTGGAGGCC AAGCACCCCG AGCTGTTGAC CCCGGATTCG
CCCACCCAGC GCGTGGGCGG AAAGCCGCGC GAGGGCTTTG TAAAAGTGGC GCATTCCGCG
CCCATGCTGT CGCTGGACAA CGCCTACAAC GAGGAGGAAC TCCGCGACTG GGCCCGGCGC
GTAGAAGAAC TCAGCGGGAA GGCCGAGATC GAGTACGAGT GTGAGTTGAA GCTGGATGGG
CTCTCGATGG CGCTGCGCTA CCAGGATGCG CGCTTTGTGC TGGCCGTCAC CCGCGGCGAT
GGCTCCATCG GCGAAGACGT GACGCTCAAC TTGCGGACAG TGAAGTCGGT GCCGCTCGGC
GTCAGTTCCG CCACGTTGAA GAAGACCCAC ATGCTCGGCG ATTTCGAAGT GCGCGGCGAA
GTGATCTTCC CAACCAAATC GTTCGAGAAG ATGAACGAAG ACCGCGAAAA GCAGGGGCTG
GCGAAGTTTG CTAACCCGCG AAACGCGGCG GCCGGCGCCG TGCGCGTGCT GGAGCCCAAC
ATCACCGCGC AGAGGCGTCT GGATTTTTAT GCGTACTTCC TGCTGGTGGA CGGCCGCGTG
CATATCGATC GGCAATCCGA GGCGCTCGAC ACGTTGGAGA AACTTGGGTT CAAGGTGAAT
TCCAATCGCG CGGTCTTCAA GTCGATTGAT GACGTGCTGA AATTCATCCA CAAGAAAGAA
GAAGATCGCG AGAAGCTGCC TTACGAAATT GACGGCGTCG TGATCAAGGT CAACAGCACC
GCACTCTGGC AGCGCCTGGG CTTCACCGGC AAAGCGCCGC GTTGGGCGAT CGCTTACAAA
TACGCGGCGC GCGCGGCCGT TACGCAGGTG GAAGACATTC TTGTGCAGGT GGGACGCACC
GGGAAACTCA CGCCAGTCGC GGCTTTGAAG CCTGTGCCCA TCGGCGGCAC AACGGTGAGC
CGCGCCACCC TCCACAACAT GGACGAGATC GATCGCCTTG GATTGCTCAT CGGCGATTGG
GTGCAGGTCG AGCGCGGCGG CGATGTGATC CCCAAGGTCG TGAAGGTCAT CGACGACAAG
GATCACCCGC GCGGCAAGAA GAAATTCAAG ATGCCCGAAC GTTGCCCCGA ATGCGGCGGC
CACGTTGTAC GCACCGAGGG CGAGGCCGAC CATCGCTGTG TGAATGCGAA TTGTCCGGCG
AAACTGCGCG AGAGCATTCT GCACTTCGCG TCGCGCGGCG TGATGAACAT CGAGGGAATG
GGCGATTCGC TGGTCAACCA ACTCGTCGAC CGAGGGCTGG TAAAGAACGT GGCCGATATC
TACGAACTCG ACGAAGAGAA GCTTCTCTCG CTCGAGCGCA TGGGCAAGAA GTCAGCTCAG
AACATCCTCG ACGAGATTAA AGGCACGAAG AAGTTGCCGC TGGAGCGCGT GATCTACGGT
CTCGGCATCC GCATGGTAGG CGAGCGCACC GCGCAATTCC TCGCCGAACA CTTCGGTTCG
CTCGATGGCG TGATGAAAGC CACCGAAGAA GAGCTGCTGG AAGTCGAAGA AGTCGGGCCG
CGCATCGCGC AGAGTATTCA CGAGTTCTTC GCCGAGCCCA GCAATCGCGA ACTGGTAAAA
CGCCTCGAAG CCGCCGGGCT GCAATTCAAG GGCGTAAAGA AAGAGCGCGG CACCGCGCTC
GCCGGACAAA CCTTCGTCCT GACCGGCAGC TTACCGACCT ACTCGCGCGA TGAAGCCAAG
AAACTGATCG AAGATGCCGG CGGAAAAGTC AGTGGGTCGG TGAGCAAAAA AACCAACTAT
GTCGTCGCCG GCGAAGAGGC CGGATCGAAG CTCGACAAAG CCCGCGACCT GGGCGTTGCG
GTAATCGACG AAGATGCCCT GAAAAAACTG CTAGGGAAGT AG
 
Protein sequence
MSRTKDPAKQ AEDLREKLRY HEHRYYVLDD PEISDADYDV MMNELKALEA KHPELLTPDS 
PTQRVGGKPR EGFVKVAHSA PMLSLDNAYN EEELRDWARR VEELSGKAEI EYECELKLDG
LSMALRYQDA RFVLAVTRGD GSIGEDVTLN LRTVKSVPLG VSSATLKKTH MLGDFEVRGE
VIFPTKSFEK MNEDREKQGL AKFANPRNAA AGAVRVLEPN ITAQRRLDFY AYFLLVDGRV
HIDRQSEALD TLEKLGFKVN SNRAVFKSID DVLKFIHKKE EDREKLPYEI DGVVIKVNST
ALWQRLGFTG KAPRWAIAYK YAARAAVTQV EDILVQVGRT GKLTPVAALK PVPIGGTTVS
RATLHNMDEI DRLGLLIGDW VQVERGGDVI PKVVKVIDDK DHPRGKKKFK MPERCPECGG
HVVRTEGEAD HRCVNANCPA KLRESILHFA SRGVMNIEGM GDSLVNQLVD RGLVKNVADI
YELDEEKLLS LERMGKKSAQ NILDEIKGTK KLPLERVIYG LGIRMVGERT AQFLAEHFGS
LDGVMKATEE ELLEVEEVGP RIAQSIHEFF AEPSNRELVK RLEAAGLQFK GVKKERGTAL
AGQTFVLTGS LPTYSRDEAK KLIEDAGGKV SGSVSKKTNY VVAGEEAGSK LDKARDLGVA
VIDEDALKKL LGK