Gene Acid345_0779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0779 
Symbol 
ID4069524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp961481 
End bp963307 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content60% 
IMG OID637982785 
ProductATP dependent DNA ligase 
Protein accessionYP_589858 
Protein GI94967810 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase 
TIGRFAM ID[TIGR02776] DNA ligase D
[TIGR02777] DNA ligase D, 3'-phosphoesterase domain
[TIGR02779] DNA polymerase LigD, ligase domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.991922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.120219 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACTCG AGGAATACAA AAAGAAACGT CGCTTTACTG ACACCCCCGA GCCTCCACCC 
TCGATCGACA AAAGCAAAGG TCATCGCTTC GTGGTGCAGA AGCACCACGC CTCACGTCTC
CACTACGACT TCCGCCTTGA GATGGACGGC GTGCTGAAGT CCTGGGCCGT GCCCAAAGGC
CCATCGCTCG ATCCCGCCGA CAAGCGCCTC GCCATGGCTG TCGAAGATCA TCCCGTCTCG
TATCTCAAAT TCGAAGGCAT CATCCCCGAG AACAACTACG GCGCAGGCAC CGTGATGGTC
TGGGACATCG GCACCTGGGA GCCGGTTGGC GACGCCGACG CCATGCTCGC CAAGGGCGAT
CTCAAATTCC GGTTGAAGGG CAAAAAGCTC AACGGCGAAT TCGCTCTGGT GCACATCAAG
TCGCGCCGCT CCGGCACCAA GGGTAACGAG TGGCTGCTGA TCAAGCACCG CGATGACGCC
GTCGTCCCTG GCTACGACAT CGACGAGTAC GACTTCTCCG CCCTCACCAA GCGCTCGCTC
GACGACATCG CCGGCGACCA GAAATCGGCC GAGTGGCAAA GCAATCGCGC CGGATCCTCC
AACATCCCGC AGAAAAGCGC GTGGCTCGCG GACGCCATCA AGAAGGCCGA CAAGAAAGCT
GCCGCAAAAA AGACCGCTGT AAAAACAAAG GCTCCAGCAA AAAAGTCCGC CAAAACCGCA
GCCAAGAAAG CTGTGAAGAC CACCGCAACC AAGAAACAGA AAGACGCGCA CCCGGCGTTC
GCCGATCTCA AAGGCGCGCG TCACGCCGCG ATGCCGTCGC AAATCCAGCC CATGCTCGCC
ACGCTCGTAG ATGAGCCCTT TGAAGACTCC CAGTGGCTCT ACGAGATCAA GTGGGACGGC
TATCGCGCCG TCACCTTCCT CAACGATGGC AAACTTCGCT TCGTCTCGCG CAACGGCAAC
GACCTAACTA ACGCCTATCC TGAACTGCAC GACATCGGCG GAAGCATCTC CGCGCAACGC
GCCATTCTCG ACGGCGAAAT CGTCGCCCTC GACGGCGAAG GCCGCTCCTC CTTCAGCCTG
ATGCAACAAC GCACTGGCAT CGGCGAGGGC GGACGCCGCA CCGGCAAGGG CAACGCCAAC
ATTCCCGTGC AGTATTACGC CTTCGACCTG CTCTACCTTG ACGGCTACGA CCTCACGCAC
GTCTCGCTCG AGGACCGCAA ACGGGTGCTC AGCGAAATCA TCTCGCCCAG CGACGTACTG
CGTGTCTCTG ACTCCTTCGA CGAGGGCCTG CCTCTCTACG AAGCCGCCCG CGCGCGCGGG
CTCGAAGGCA TCATCGCCAA GCGTCGCGAG AGTTGCTATC TCACCAAGCG CAGCCGCGAG
TGGCTGAAGA TCAAGATCAC GCAGCGCCAG GAGTGCGTGA TTGGCGGCTA TACCGAGCCC
AAGGGCAGTC GCGAAAATTT CGGCTCCGTC GTCCTCGGCC TCTACGACGA CAAAGGCCGC
CTCATCCCCG TTGGCCAGGC CGGCAGCGGT TTCACCGCCC AGTCAAACGC TGCCCTGTGG
AAGAAACTCC AGAAGCTCGA AACCAAAACT TCACCGTTCT TCGGCAAGCC CGACAGCCCG
CGCCAGGTCC ACTATGTCCG CCCTGAACTC GTTGCCGAAA TCAAGTTCAC CGAGTGGACG
CACCAAGGCC AAAGCGGCCA GGTCAGAATG CGCGCCCCTG TTTTCGAAGG GCTGCGCACT
GACAAATCGC CGAGCGAATG CGTCTTCGAT TTCGCGAAGC CAACAAAATT AGAAGTGAAA
AAAGCCGAAA GCGGCGACGC CGCGTAG
 
Protein sequence
MALEEYKKKR RFTDTPEPPP SIDKSKGHRF VVQKHHASRL HYDFRLEMDG VLKSWAVPKG 
PSLDPADKRL AMAVEDHPVS YLKFEGIIPE NNYGAGTVMV WDIGTWEPVG DADAMLAKGD
LKFRLKGKKL NGEFALVHIK SRRSGTKGNE WLLIKHRDDA VVPGYDIDEY DFSALTKRSL
DDIAGDQKSA EWQSNRAGSS NIPQKSAWLA DAIKKADKKA AAKKTAVKTK APAKKSAKTA
AKKAVKTTAT KKQKDAHPAF ADLKGARHAA MPSQIQPMLA TLVDEPFEDS QWLYEIKWDG
YRAVTFLNDG KLRFVSRNGN DLTNAYPELH DIGGSISAQR AILDGEIVAL DGEGRSSFSL
MQQRTGIGEG GRRTGKGNAN IPVQYYAFDL LYLDGYDLTH VSLEDRKRVL SEIISPSDVL
RVSDSFDEGL PLYEAARARG LEGIIAKRRE SCYLTKRSRE WLKIKITQRQ ECVIGGYTEP
KGSRENFGSV VLGLYDDKGR LIPVGQAGSG FTAQSNAALW KKLQKLETKT SPFFGKPDSP
RQVHYVRPEL VAEIKFTEWT HQGQSGQVRM RAPVFEGLRT DKSPSECVFD FAKPTKLEVK
KAESGDAA