Gene Acid345_4475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4475 
Symbol 
ID4070958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5309657 
End bp5311387 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content60% 
IMG OID637986514 
ProductDNA ligase I, ATP-dependent (dnl1) 
Protein accessionYP_593549 
Protein GI94971501 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase 
TIGRFAM ID[TIGR00574] DNA ligase I, ATP-dependent (dnl1) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTTC TCGCCCAAAC CTGCGAGGCC ATAGCCGCCA CCTCTAAAAA GACGGAAAAG 
ATCGCCATCG TCGCGACGTA TTTACAGTCG CGGACGGTGC CGGAAGCTGC GCTGTCGACG
CTGTTTCTCT CGGGGCGGAC TTTTGCGGCG CACGAAGAGC GCACCCTGCA GGTCGGCGGA
TCGATCCTGT GGCGCGTGGT GGGCGAGCTT TCTGGTGCGA GTGAAGCCAA GATGACCGCG
GCTTACAAAC GGCACGGCGA TCTTGGCGAT GCCACGCTCG GGGTGTTGCG CGGGGTTGCG
CCTGAAGAAA GCACTCTCAC GCTGAAAGAA GTGGATTACA TGTTCCAACA GATTGCGGCT
GTCAGCGGTC CGGCAGCGAA ATCGCGGCTG ATCGTGACGC TGCTCGCGCG GGCGACGGCA
CCGGAAGCGA AGTACCTCGT CAAGTTCATC ACCGGCGAGT TGCGCATAGG ATTGAAGGAA
AGCCAAGTCG AAGAGGCGAT CGCCAAGGCG TATGGTCGCG AACTCGCGGA AGTGCGGCGC
GCCAACATGC TGGTCGGGGA CATCGGCGAA ACGCTGGTGC TGGCCGCGCA TGACAAGCTT
GCGACTGCGC GGATGCGACT CTTCCACCCG ATGGGATTCA TGCTCGCCAC TCCGGCAGAG
AGTGCGAACG AAGCCTTCGC GGAATTCGAG CACGCCATCG TCGAAGACAA GTACGACGGT
ATCCGCGCGC AAGCGCATAT CTCGCGAGAC AAAGTGCGGA TCTTTTCGCG AACTCTCGAC
GACATCACCG ATTCCTTCCC CGAACTCATT CCCGCCCTGA AAGCGATCGA GCACGAAGTC
ATCCTCGATG GCGAAATCCT CGCGTGGCGC TGCGGCCAGG CGCTAGCGTT CAGCGAATTG
CAGAAGAGAC TTGGGCGCAA GAACGTTTCG GCAGCCATGC AGCGCGAAGT GCCGGTGAGC
TACGTCACAT TCGATTTGCT TTATGCGAAG GGCCAGTTGG TGATTGATCG TCCGCTGCAG
GAGCGCGCGG CGATGCTGGA TGGAATCTTT TCGGAAGGTG CACCGCGATT GGTCAACGTT
GATCCGCACG GTCAGGCGTC ATTGATGTTT GCCGAAGTTA CACCAGAGCA AAGGGTATTG
CGCGCACCGC AGGCACGAGC GGATTCCCCC GAAGAGCTCG ATCGCCTGTT CGCAGCCGCT
CAGGAACGCG GCAACGAAGG GCTGATGATC AAAGACATTC ATTCGGCCTA CGCGGTCGGG
CGCCGCGGCA AATCGTGGCT GAAGCTGAAG CGCGAGCTGG CAATGCTGGA CGTCGTTGTG
ACCGCAGTCG AACTCGGACA CGGCAAACGG GCGGGCATCC TGAGCGACTA CACCTTCGCC
GTGCGCGGTG GTGAAGAACT ATTGAACATC GGCAAAGCCT ACTCCGGGCT TACGGACAAA
GAAATCGCGG AGATGGACGA GTGGTTTCGG GCTCACACCC TGGTCGATCA TGGCTTCGTT
CGTGAGGTCG AGCCCAAGAT CGTCATCGAA GTGGCGTTCA ACGCGGTGAT GAAGTCCGAT
CGCCATGCCA GCGGATTTGC CCTGCGGTTC CCGCGAATTC TGCGCATTCG CGATGATAAG
GGGGGCGAAG AGATCGACAC GCTGGAGCGT GCTGAGGAGA TTTACCGGTC GCAGTTTCAC
CAGCGCACGC GACGTATTCA CCGCGGAGAC ACAAAGGCGC AGAGTTCTTA G
 
Protein sequence
MQLLAQTCEA IAATSKKTEK IAIVATYLQS RTVPEAALST LFLSGRTFAA HEERTLQVGG 
SILWRVVGEL SGASEAKMTA AYKRHGDLGD ATLGVLRGVA PEESTLTLKE VDYMFQQIAA
VSGPAAKSRL IVTLLARATA PEAKYLVKFI TGELRIGLKE SQVEEAIAKA YGRELAEVRR
ANMLVGDIGE TLVLAAHDKL ATARMRLFHP MGFMLATPAE SANEAFAEFE HAIVEDKYDG
IRAQAHISRD KVRIFSRTLD DITDSFPELI PALKAIEHEV ILDGEILAWR CGQALAFSEL
QKRLGRKNVS AAMQREVPVS YVTFDLLYAK GQLVIDRPLQ ERAAMLDGIF SEGAPRLVNV
DPHGQASLMF AEVTPEQRVL RAPQARADSP EELDRLFAAA QERGNEGLMI KDIHSAYAVG
RRGKSWLKLK RELAMLDVVV TAVELGHGKR AGILSDYTFA VRGGEELLNI GKAYSGLTDK
EIAEMDEWFR AHTLVDHGFV REVEPKIVIE VAFNAVMKSD RHASGFALRF PRILRIRDDK
GGEEIDTLER AEEIYRSQFH QRTRRIHRGD TKAQSS