Gene EcSMS35_2566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2566 
SymbolligA 
ID6145611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2619139 
End bp2621154 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content54% 
IMG OID641617437 
ProductNAD-dependent DNA ligase LigA 
Protein accessionYP_001744602 
Protein GI170680446 
COG category[L] Replication, recombination and repair 
COG ID[COG0272] NAD-dependent DNA ligase (contains BRCT domain type II) 
TIGRFAM ID[TIGR00575] DNA ligase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000140501 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATCAA TCGAACAACA ACTGACAGAA CTGCGAACGA CGCTTCGCCA TCATGAATAT 
CTTTATCATG TGATGGATGC GCCGGAAATT CCCGACGCGG AATACGACAG GTTGATGCGC
GAACTGCGCG AGCTGGAAAC CAAACATCCA GAACTGATTA CGCCTGATTC GCCTACCCAA
CGTGTAGGTG CTGCGCCGCT GGCGGCTTTC AGCCAGATCC GCCATGAAGT GCCAATGCTG
TCGCTGGATA ACGTTTTTGA TGAAGAAAGC TTTCTTGCCT TCAACAAACG AGTGCAGGAC
CGTCTGAAAA GCAACGAGAA AGTCACCTGG TGCTGTGAGC TGAAGCTGGA TGGTCTTGCC
GTCAGTATTC TGTATGAAAA TGGTGTTTTA GTCAGTGCCG CGACGCGTGG CGATGGCACC
ACCGGGGAAG ATATCACGTC TAATGTGCGT ACTATTCGCG CCATTCCGCT GAAGCTACAC
GGAGAGAATA TCCCGGCGCG TCTGGAAGTG CGTGGTGAAG TGTTCCTGCC GCAGGCGGGG
TTCGAAAAGA TTAACGAAGA TGCGCGACGC ACGGGCGGGA AAGTTTTTGC TAACCCACGT
AATGCGGCAG CTGGTTCACT GCGTCAGCTT GATCCGCGTA TTACAGCGAA GCGACCGCTC
ACTTTTTTCT GCTATGGCGT TGGTGTTCTG GAAGGTGGCG AGCTGCCGGA TACTCATCTT
GGCCGCTTAC TGCAATTTAA AAAGTGGGGG TTGCCGGTCA GCGATCGGGT AACGCTTTGT
GAATCGGCGG AAGAAGTGCT GGCGTTCTAT CACAAAGTGG AAGAAGACCG CCCGACGCTG
GGCTTTGATA TCGACGGCGT GGTGATTAAA GTCAACTCAC TGGCACAGCA GGAGCAGCTT
GGCTTTGTCG CGCGTGCCCC GCGCTGGGCG GTAGCGTTTA AATTTCCGGC GCAGGAACAG
ATGACCTTTG TGCGTGACGT CGAGTTTCAG GTTGGGCGTA CTGGCGCGAT TACGCCTGTT
GCGCGTCTGG AACCTGTCCA TGTTGCTGGC GTGCTGGTGA GTAACGCAAC CTTACACAAT
GCGGATGAAA TCGAACGTCT TGGTTTACGC ATTGGCGATA AAGTGGTGAT TCGCCGCGCT
GGCGACGTGA TCCCGCAGGT GGTTAACGTC GTGCTTTCTG AACGCCCGGA AGATACCCGT
GAGGTTGTAT TCCCGACGCA TTGTCCGGTA TGTGGTTCTG ACGTTGAGCG TGTGGAAGGT
GAAGCGGTTG CCCGCTGTAC CGGTGGCCTG ATTTGCGGTG CGCAGCGTAA AGAGTCGCTG
AAACACTTTG TTTCCCGCCG TGCGATGGAT GTTGACGGAA TGGGCGACAA AATCATCGAT
CAGCTGGTTG AAAAAGAATA TGTCCACACT CCGGCGGATC TGTTCAAACT CACCGCAGGC
AAACTGACCG GACTGGAGCG TATGGGGCCA AAATCGGCAC AAAACGTGGT TAACGCGCTG
GAAAAAGCGA AAGAAACCAC CTTTGCTCGC TTCCTCTATG CACTTGGCAT CCGTGAAGTC
GGCGAGGCCA CCGCAGCAGG TCTGGCGGCA TATTTCGGCA CGCTGGAAGC GCTGGAAGCC
GCTTCGATTG AAGAGCTGCA AAAGGTACCT GATGTTGGTA TTGTCGTTGC CTCCCACGTT
CACAACTTCT TTGCCGAAGA AAGCAACCGC AATGTCATCA GCGAGCTGTT GGCGGAAGGT
GTTCACTGGC CTGAGCCGAT CGTTATCAAT GCGGAAGAGA TTGACAGTCC GTTTGCTGGT
AAAACCGTGG TGCTTACGGG CAGCTTAAGC CAGATGTCGC GTGATGACGC TAAAGCTCGA
CTGGTCGAAC TGGGGGCGAA AGTCGCGGGC AGCGTGTCGA AGAAAACCGA TCTGGTGATA
GCGGGTGAAG CTGCAGGATC TAAACTGGCG AAGGCGCAGG AACTGGGCAT TGAAGTCATC
GACGAAACGG AAATGCTGCG TTTGCTGGGT AGCTGA
 
Protein sequence
MESIEQQLTE LRTTLRHHEY LYHVMDAPEI PDAEYDRLMR ELRELETKHP ELITPDSPTQ 
RVGAAPLAAF SQIRHEVPML SLDNVFDEES FLAFNKRVQD RLKSNEKVTW CCELKLDGLA
VSILYENGVL VSAATRGDGT TGEDITSNVR TIRAIPLKLH GENIPARLEV RGEVFLPQAG
FEKINEDARR TGGKVFANPR NAAAGSLRQL DPRITAKRPL TFFCYGVGVL EGGELPDTHL
GRLLQFKKWG LPVSDRVTLC ESAEEVLAFY HKVEEDRPTL GFDIDGVVIK VNSLAQQEQL
GFVARAPRWA VAFKFPAQEQ MTFVRDVEFQ VGRTGAITPV ARLEPVHVAG VLVSNATLHN
ADEIERLGLR IGDKVVIRRA GDVIPQVVNV VLSERPEDTR EVVFPTHCPV CGSDVERVEG
EAVARCTGGL ICGAQRKESL KHFVSRRAMD VDGMGDKIID QLVEKEYVHT PADLFKLTAG
KLTGLERMGP KSAQNVVNAL EKAKETTFAR FLYALGIREV GEATAAGLAA YFGTLEALEA
ASIEELQKVP DVGIVVASHV HNFFAEESNR NVISELLAEG VHWPEPIVIN AEEIDSPFAG
KTVVLTGSLS QMSRDDAKAR LVELGAKVAG SVSKKTDLVI AGEAAGSKLA KAQELGIEVI
DETEMLRLLG S