Gene Mlg_0681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0681 
Symbol 
ID4268476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp754660 
End bp756681 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content70% 
IMG OID638125430 
ProductDNA ligase, NAD-dependent 
Protein accessionYP_741525 
Protein GI114319842 
COG category[L] Replication, recombination and repair 
COG ID[COG0272] NAD-dependent DNA ligase (contains BRCT domain type II) 
TIGRFAM ID[TIGR00575] DNA ligase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.295891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.841285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAAA CGGGAACCGA CCCGCAGGCC CGCATCGAGG CCCTGCGCCG GGAGATCCGC 
GAGCACGACC ACCGCTATTA CGTGCTGGAT GCGCCGGTGA TCGCGGATGC CGAGTATGAC
GCCCTGATGG CCGAGCTCCA GGCCCTGGAG GCGGAGCACC CCGAGCTGAT CACCCCGGAC
TCCCCCTCCC AGCGGGTGGC CGGGCGCCCC GCCGAGGGGT TCGGCGAGGT GACGCATGCC
GAGCCTATGC TCTCGCTGGA CAACGCCTTC GAGGAGGCGG ACCTGGCGGA GTTCGACCGC
CGGGTACGCC AGGCCCTGGG CCTCGACCCC GTGGTCTATG TGGCCGAGCC CAAGCTGGAC
GGGCTGTCGG TGAGCATCCG CTATGAGGAC GGCCGGTTGG TGCGGGCCGG TACCCGCGGC
GACGGCCGGG TGGGGGAGGC GATCACCGAG AACGTCCGCA CCATCCGCAG TGTGCCCCTG
CGTCTGCGCG GCGAGGGCTG GCCACCGGTG ATGGAGGTGC GCGGCGAGGT GGTGATCCGC
CGCGCCGATT TCGAACGCCT GAACGAACAG CGCCTGGCGG ACGGCGAGCG CCCCTTCGCC
AATCCGCGTA ATGCCGCCGC GGGCAGCCTG CGTCAGTTGG ATCCGCGGAT CACCGCCCGC
CGCCGACTCA CCTTCTTCAC CTTCGGGGTC GCCGCCGCCG GGCGCCTGGC CGCCAGCCAC
CATGAGGTGC TGGACAAGCT GGCCGGGTGG GGCTTTCGGG TCAATGAGCG GGTGGAGCGG
GTGCGCGGCC TTGACGGCTG CCGCGAGTAT TACCAGCGAC TGCTGGCGGA TCGCGACGAA
CTCTCCTTCG AGATCGATGG GGTGGTCTAC AAGGTGGATG ACCTGGATGC CCGCGAGGAG
CTGGGCTTCA CCGCCCGCGC CCCGCGCTGG GCCATCGCCT GGAAACTGCC GGCGCAGGAG
GCGACCACGG TGGTGCGCCG GATCCTGCCG TCGGTGGGGC GCACCGGGGC GATCACCCCG
GTGGCGGAGC TGGAGCCGGT GGGGGTGGGC GGGGTCACGG TGAGCCGCGC CACGCTACAC
AACCTGGATG AGGTGCGGCG TAAGGATGTG CGTAAGGGCG ATACGGTGAT GGTGCGCCGG
GCCGGCGACG TGATCCCGGA GATTACGGCG GTGGTGACCG AGAAGCGGCC CGAAGGGGCG
GAGCCCTGGG CGATGCCGGC GGAGTGCCCG GTCTGTGGCT CGGAGGTGTT GCGCCTGGAT
GACGAGGCGG TGCACCGCTG CATGGGCGGT CTCTATTGCC CGGCGCAGCG CGAGGGCGCC
CTGCTGCATT TTGCCTCGCG CAAGGCCCTG GATATCGACG GCCTGGGCGA GAAGGTCGTC
AGTCAGTTGG TGGAGCGGGG CATGGTGCGC TCGCCGGCGG ATCTGTTCAC TCTGGAGCAC
TGCCAGCTCG CCGGCCTGGA GCGGATGGGG GACAAGTCAG CCGACAACCT GGTCGCGGCG
CTGGATAAGG CGCGGCGGAC AACGCTGCCG CGGTTTCTGT ATGCGCTGGG GATCCAGCAC
GTGGGGGAGG TGACCGCGCG GCGGCTGGCG GAGCACTTCG GGTCGCTGGA GGCGATCATG
AATGCCGATG AGTCGGCGCT GGCGGAGACC CCGGACGTGG GCCCGGTGGT GGCGCAGGCG
ATTGCCCATT TCTTTGCCGA GCCGCATAAC CGTGAGGTGG TGCAGGCGTT GCGTGCGGCC
GGTGTCACCT GGGAGGAGGT GGATCCGGCG GAGCGCGGCG AGCAGCCGCT GGCCGGCAGG
ACCTTTGTCC TGACCGGGAC GCTCTCGGGG ATGACCCGGG ATGAGGCGAA GGCGGCCCTG
GAGGCGTTGG GGGCGCGGGT GAGTGGCAGT GTCTCGAAGA AGACGGACTA TCTGGTGGCC
GGGGAGAAGG CGGGGAGCAA GCTGGCCAAG GCGGAGTCGC TGGGGGTGGA GGTGTTGGAC
GAGCAGGCCT TGCAGGCGCT GCTGCAGGAG CATGGTCGCT GA
 
Protein sequence
MAQTGTDPQA RIEALRREIR EHDHRYYVLD APVIADAEYD ALMAELQALE AEHPELITPD 
SPSQRVAGRP AEGFGEVTHA EPMLSLDNAF EEADLAEFDR RVRQALGLDP VVYVAEPKLD
GLSVSIRYED GRLVRAGTRG DGRVGEAITE NVRTIRSVPL RLRGEGWPPV MEVRGEVVIR
RADFERLNEQ RLADGERPFA NPRNAAAGSL RQLDPRITAR RRLTFFTFGV AAAGRLAASH
HEVLDKLAGW GFRVNERVER VRGLDGCREY YQRLLADRDE LSFEIDGVVY KVDDLDAREE
LGFTARAPRW AIAWKLPAQE ATTVVRRILP SVGRTGAITP VAELEPVGVG GVTVSRATLH
NLDEVRRKDV RKGDTVMVRR AGDVIPEITA VVTEKRPEGA EPWAMPAECP VCGSEVLRLD
DEAVHRCMGG LYCPAQREGA LLHFASRKAL DIDGLGEKVV SQLVERGMVR SPADLFTLEH
CQLAGLERMG DKSADNLVAA LDKARRTTLP RFLYALGIQH VGEVTARRLA EHFGSLEAIM
NADESALAET PDVGPVVAQA IAHFFAEPHN REVVQALRAA GVTWEEVDPA ERGEQPLAGR
TFVLTGTLSG MTRDEAKAAL EALGARVSGS VSKKTDYLVA GEKAGSKLAK AESLGVEVLD
EQALQALLQE HGR