Gene Mlg_1088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1088 
Symbol 
ID4270033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1268146 
End bp1270173 
Gene Length2028 bp 
Protein Length675 aa 
Translation table11 
GC content67% 
IMG OID638125840 
Productexcinuclease ABC subunit B 
Protein accessionYP_741930 
Protein GI114320247 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCA GGTTCCAGCT CCAAAGCCGC TTCCAACCCG CCGGCGACCA GCCCACGGCC 
ATCGCCTCGC TGGTCGAGGG CCTCAACGCC GGCGAACGCC ACCAGACCCT GCTCGGCGTC
ACCGGCTCCG GCAAAACCTT CACCATGGCC AACGTCATCC AGGCCGTGCA GCGGCCTGCC
ATGGTCCTGG CCCCCAACAA GACCCTCGCC GCCCAACTCT ACGGCGAGAT GCGCGAGTTC
TTCCCCAAGA ACCGCGTCGA GTACTTCGTC TCCTACTACG ACTACTACCA ACCCGAGGCC
TACGTCCCGG CCTCCGACAC CTTCATCGAG AAGGACGCCT CGGTGAACGA ACACATCGAA
CAGATGCGCC TGTCCGCCAC CAAGGCCATC CTGGAACGGC AGGACACCAT CATCGTCGCC
TCCGTCTCCT CCATCTACGG CCTGGGCGAC CCCCAGTCCT ACATGGCCAT GCTGCTGCAC
CTGGTGCGCG GCGAGACCAT CGACCAGCGC CAGATCCTGC GCCGCCTGGC CGAAATGCAA
TACACCCGCA ACGACATGGA CTTCACCCGC GGCACCTACC GCGCCCGGGG CGAGGTCATC
GACATCTTCC CGGCCGACGC CGAGGACGAG GCCGTGCGGG TGGAACTGTT CGACGACGAG
ATCGAAGAGC TGTCGCACTT CGACCCCCTC ACCGGCGAAG TGCTGCGCCG CGTGCCGCGC
CTGACCATCT ACCCCAAGAC CCACTACGTC ACCCCCCGCG GCCAGGTGCT GCGCGCCATC
GACGCCATTA AGGAGGAACT GCGCGACCGG CTGGCGGAGC TGCGCGCCGC CGACAAACTG
GTGGAGGCCC AGCGCTTGGA GCAGCGCACC TTGTTCGATC TGGAGATGAT GCACGAGCTG
GGCTACTGCA ACGGCATCGA GAACTACTCC CGGCACCTCT CCGGCCGCGC CCCCGGTGAG
CCGCCGCCCA CCCTGTTCGA CTACGTGCCT GAGGACGCCG TGGTCTTCAT CGACGAATCC
CACGTCACCA TTCCGCAGTT GGGCGGCATG TACAAAGGCG ACCGCAGCCG CAAGCAGACC
CTGGTGGACT ACGGCTTCCG CCTGCCCTCG GCCCTGGACA ACCGGCCGCT GAAATTCGAG
GAGTGGTTGC GCCTGGCACC CCAGTGCGTG CTGGTCTCCG CCACCCCCGG CCCCTGGGAG
CATGAGCACA GCCAGCGGGT GGTGGAGCAG GTGGTGCGCC CCACCGGCCT GCTCGATCCG
GAAGTGGAGG TGCGCCCGGC GCTCAGCCAG GTGGACGACG TCTACGGCGA GATCACCGAG
CGCGCCCGAC GCGACGAGAG GGTGCTGGTC ACCACCCTGA CCAAGCGCAT GGCCGAGGAC
CTGACCGAAT ACCTGCACGA GAACGGCGTG CGGGTGCGCT ACCTGCACTC CGATGTGGAC
ACCGTGGAGC GCACCGAGAT CATCCGCGAT CTACGCCTGG GCGAGTTTGA CGTGCTGGTG
GGCATCAACC TGCTGCGCGA GGGCCTGGAC ATCCCCGAGG TGTCGCTGGT GGCCATTTTG
GATGCGGACA AGGAGGGGTT CCTGCGTTCC GAGCGCTCGC TGATTCAGAC CATCGGCCGG
GCGGCGCGTA ACCTTGGCGG CAAGGCCATC CTTTACGGTG ATGAGATCAC CAACTCCATG
CGCCGGGCCA TTGACGAGAC CGAGCGGCGC CGCGCCAAGC AGCAGGCCCA CAACGAGGCC
CATGGCATCA CCCCGAAGGG GGTGCGCAAG GACGTGGCCG ATATCATGGA GCGGGGCGGC
GCCCCGATAC CGGGCGCGCC GCGGGGCCGT ATCGACAAGG TGGCGGAAGA GGCCGCCAAG
TACGGCCGTT ACACCCCGGC CGAGGCGGTC AAGCGCATCA AGCAGCTGGA GAAACAGATG
CGCGAGCACG CCCGCAACCT GGAGTTTGAA GAGGCCGCCC AGCTACGCGA CGAGATCAAG
CGCCTGGAGC GCTACGCCCT GGGCCGACCC GACGTGGCCA GCGCCTGA
 
Protein sequence
MTARFQLQSR FQPAGDQPTA IASLVEGLNA GERHQTLLGV TGSGKTFTMA NVIQAVQRPA 
MVLAPNKTLA AQLYGEMREF FPKNRVEYFV SYYDYYQPEA YVPASDTFIE KDASVNEHIE
QMRLSATKAI LERQDTIIVA SVSSIYGLGD PQSYMAMLLH LVRGETIDQR QILRRLAEMQ
YTRNDMDFTR GTYRARGEVI DIFPADAEDE AVRVELFDDE IEELSHFDPL TGEVLRRVPR
LTIYPKTHYV TPRGQVLRAI DAIKEELRDR LAELRAADKL VEAQRLEQRT LFDLEMMHEL
GYCNGIENYS RHLSGRAPGE PPPTLFDYVP EDAVVFIDES HVTIPQLGGM YKGDRSRKQT
LVDYGFRLPS ALDNRPLKFE EWLRLAPQCV LVSATPGPWE HEHSQRVVEQ VVRPTGLLDP
EVEVRPALSQ VDDVYGEITE RARRDERVLV TTLTKRMAED LTEYLHENGV RVRYLHSDVD
TVERTEIIRD LRLGEFDVLV GINLLREGLD IPEVSLVAIL DADKEGFLRS ERSLIQTIGR
AARNLGGKAI LYGDEITNSM RRAIDETERR RAKQQAHNEA HGITPKGVRK DVADIMERGG
APIPGAPRGR IDKVAEEAAK YGRYTPAEAV KRIKQLEKQM REHARNLEFE EAAQLRDEIK
RLERYALGRP DVASA