Gene Mlg_0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0804 
Symbol 
ID4270635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp903716 
End bp907444 
Gene Length3729 bp 
Protein Length1242 aa 
Translation table11 
GC content73% 
IMG OID638125555 
ProductDNA helicase/exodeoxyribonuclease V, beta subunit 
Protein accessionYP_741648 
Protein GI114319965 
COG category[L] Replication, recombination and repair 
COG ID[COG1074] ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) 
TIGRFAM ID[TIGR00609] exodeoxyribonuclease V, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.888497 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA GTGCGCCCCG ACCGCTGGAG TTGCTGCGCC TGCCGCTGCA CGGCAGCCGC 
CTGATCGAGG CCAGCGCCGG CACCGGTAAG ACCTTCACCA TCGCCGCCCT CTACCTGCGC
CTGGTCCTTG GGCACGGCGA GCAGCGCGCG GGCGGCGGGC CGCTGGTGCC ACCGCAGATC
CTGGTGGTCA CCTTTACCGA GGCGGCCACC CGGGAGTTGC GCGACCGCAT CCGCGAACGG
CTCAGCCAGG CTGCAGCGGC CTTCCGCGAT CCGGCCCGGT ACCCGGACGA TCCGGTGCTG
CCGGCACTGC GCGCTGAATA TGACGAGCAC GAGCGGCCCG CCATGGCGCG CCGACTGGAG
TTGGCCGCTG AGTGGATGGA CGAGTCGGCG GTCTCCACCA TCCACAGCTG GTGCTACCGC
ATGCTGCGCG AGCACGCCTT CGACAGCGGT AGCCTCTTCA CCCAGGACCT GGAGGCAGAT
CAGACCGCGC TCCTGGCCGA GGCCGTGCGC GATTACTGGC GCAGCTTCCT CTATCCATTG
GCGCCGGAGG CCCTGGTCCT GGCGCGCCGT GCGCTGGGTG ACGACCCCGA GGCCCAGCGG
CAGAAGGTCC GGCCGCTGAT CGGCCAGGCG GCCCCGGTGG CGCCCGGACT GGACTCGCCG
GAGGGCTTTG CCGGGGTGCT CGACGACCGG TTGCAGGCCC TGGAGCGCCT CAAACGGCCC
TGGCGCGAGC AGTTCGAGGC CCTGGAGGCG CAGTTCCATG CCCTGCGCAA GGCGGTGCTC
AACGGCCGGA AGTACCAAAA GCCCGAGGCC CTGATGGCCG CCATGCGGGC CTGGGCGGAG
GACCCCGCGG CCCTGCAGCC GGAGCCGGTG GGCAGCACCA GTGTGCTGGG GCGGCTCTGC
GCCCGGGGGC TCGCCGCCGG TTGCCGGAAG GGACAGACGC TGCCGGAGGA CCTGCACCCC
GGCTTCACGG CGTTGGATGA CTACGAGCAC CTCTGCCAAC TGCCGGAGGC GGACCTGCTC
AATGCCGCGG CCAACTGGAT CGGTCAACGC TTCCACCAGG CCCAGCGGCG ACGGGCGCAG
ATGGGCTTCG ATGACCTGCT CACCCGGCTG GACCAGGCGC TGACCCAGGG CGAGGCGGGT
GAGCGTCTGG CCGCCACCCT GCGCCGGCAG TTCCCGGTGG CCCTGATCGA TGAGTTCCAG
GACACCGACC CGGTGCAGTA CCGCATCTTC CAGCGGGTCT ACCGGGTGGC GGACAACCCC
CGGGAACAGG CCCTGCTGCT CATCGGCGAC CCCAAACAGG CCATCTACAG CTTCCGCGGC
GCCGACATCC ACAGTTACCT GCAGGCGCGG CGGGCCACTG CTGGCCGCCA CTACACCCTG
CCGCGCAACT TCCGCTCCAC CGGGGCCATG GTGCGCGGGG TCAACCGGCT GTTCCAGCAT
GCCGAGCAGG CCTGGCCCGA GGGCGCCTTC CTGTTCCGGG TGGCGGAGGA AAACCCCGTG
CCCTTCCTGC CGGTGGCGGC CCAGGGCCGC GCGGAGGCGC TGGAGGTGCA CGGCGCCGCC
CAGCCGGCCC TCACTCTCTG GTGGGAGGAC GAGGGCGAGC CGGTGGCCGG GAAGCACTAC
CTGCCGCGCC AGGCGGCGGC CTGTGCCAGC CACATCGTGG CGTTGCTGAA CGCCGGTGCT
GCGCGCCAGG CCGGGTTCCG CGACCCGGAT GGAGGGTTGG TGTCGCTGCG GCCCCGGGAC
ATGGCGGTGC TGGTGCGCGA CTACAACGAG GCGCGCGCCA TCCAGCAGGC CCTGGCCCGG
CGCCGGGTGC GCAGCGTTTA CCTGTCCGAC CGGGAGTCCG TCTACCGCAC CGATGCGGCC
GGGGACCTGC TGCGCTGGCT GCGCGCCTGC GCCGAGCCCA CCGTGGAGCG GCTGTTACGG
GCCGCCCTGG CCACGCCCAC CCTGGGCCTG CCGGTGGCCG AGCTGCACCG CCTGACCGCC
GACGAGCTGC TTTGGGAGCG CCGGGTCGAG CAGTTCCGCG GATACCGCCG GCTCTGGCAG
GGGCAGGGGG TCCTGCCCAT GCTGCGCCGG CTGCTGCACG ATTTCGCGGT GCCGGCGCGG
CTGGTGGCGC GAGCGGACGG CGAACGGGTG CTGACCAACC TGCTGCATCT CAGCGAGCTG
CTGCAGCAGG CGGCCGCCGA ACTGGACGGC GAGCAGGCCC TGATCCGCCA CTTGGTGGAG
CATCGCAGCG GCAACGGCGA GGGAGGGGAT GAGCAGATCC TGCGCCTGGA GAGCGACGAG
GACCTGGTCA AGGTGGTCAC TGTGCACAAA GCCAAGGGCC TGGAGTACCC GCTGGTCTTC
CTCCCGTTCA TCTGCACCTT CAAACCGGTG GACGGGAAGC GCCTTCCCTA CCTCGACCGC
ACCACCGGCG GTGCCCCGCG CTGGTTGTTC GAGATGGACG AGGAGACCCG CGCGCGCGCC
GACCGCGCCC GCCTGGCCGA GGACCTGCGC CTGCTCTACG TGGCCATGAC CCGGGCCCGC
CACGCCTGCT GGCTGGGCCT GGCCCCCGTC AAGCGGGGCA CCCGCAAGTC CAACCAACTG
CACCAGAGCG CGGTGGGCTA CCTGCTGGCT GGCCCGGAGG GAGTGGTGGA CGGGGGGCTG
GAGGCCGCCC TGCTGCGGGC GCGCGGCGAG GAGTCGGCGA TTGCGGTCAC CCCGCTGCCG
GCGGCCAGCG ACGAGGGCTA TCGCGGTCCG GAGCACCGCC CGGCCCTCGG TCCGGCCCGG
GTGCCGCGGC GGCCGGCCTA CGAGCACTGG TGGATCGCCA GTTACAGCGC CCTGTCCCTG
CACGCGGAAC CCGCGGCCGG TGAGGCGGTG GGGGAGCCGG CCCGGGCGAG TGCATCGACG
GCGCCGGAGA GCGTGGGCCA GGAGATCATC CGCGAGACGG TCGCCGAGGC CGGAGAGAGC
GCCGGGCCCG ACAGCGCGGC CCTCGCATTG CACCGCTTCC CGCGCGGCCC GGGGCCGGGG
ACCTTCCTCC ACGGCCTGCT GGAATGGGCC GCGGACCAGG GGTTTGCCGG CCTGGCCGAA
CAGCCCGCCC TGTTACAGCG GGAAGTGGCC CGCCGCTGTC GCCGGCGCGG CTGGTCCGAG
TGGGCCGGGC CGGTGGCCGA CTGGCTGCTG CGTCTGCTGC AGGCCGACTA CCCCCTGCCG
GAGGGTGGGG CGCTGCGCCT CACGGCCCTG CCCAGCTATC AGCCGGAGCT GGAATTCCTG
TTCCAGACCC GCTGGTTGAA TGCCGCCCGC CTGGACCGGG CGGTGACCGC CGCGACCCTG
GACGCGGCGC CGCGGCCACC GGCCCGGGCG GCGCTGCTGA ACGGGATGCT CAAGGGCTTT
ATCGACCTGG TCTGCGAGCA GGACGGGCGC TACTACGTGG CCGATTACAA GTCCAACTGG
CTGGGCCAGG GGCTGGAGGA CTATGCGCCG GAGGCCCTGC GGGCGGCCGT GCTCGCCCGC
CGCTACGACC TGCAACTGGT GCTCTATACC CTGGCGCTGC ACCGGCTGCT ACGGGCGCGA
CTGCCCGGCT ACGACTACGA GCGCCACGTG GGCGGGGCGG TGTACCTCTT CCTGCGCGGC
CTGGATGCCC CGGGTCGGGG TCTGTTCCAC TGCCGGCCAC CGCGGGCGCT GATCGAACAA
CTGGACGACT GGCTGCAGCA GGGCCCCGAC CGGCCCGATG TGCAGGCCAC GGGGAGTGAT
GATGCCTGA
 
Protein sequence
MSQSAPRPLE LLRLPLHGSR LIEASAGTGK TFTIAALYLR LVLGHGEQRA GGGPLVPPQI 
LVVTFTEAAT RELRDRIRER LSQAAAAFRD PARYPDDPVL PALRAEYDEH ERPAMARRLE
LAAEWMDESA VSTIHSWCYR MLREHAFDSG SLFTQDLEAD QTALLAEAVR DYWRSFLYPL
APEALVLARR ALGDDPEAQR QKVRPLIGQA APVAPGLDSP EGFAGVLDDR LQALERLKRP
WREQFEALEA QFHALRKAVL NGRKYQKPEA LMAAMRAWAE DPAALQPEPV GSTSVLGRLC
ARGLAAGCRK GQTLPEDLHP GFTALDDYEH LCQLPEADLL NAAANWIGQR FHQAQRRRAQ
MGFDDLLTRL DQALTQGEAG ERLAATLRRQ FPVALIDEFQ DTDPVQYRIF QRVYRVADNP
REQALLLIGD PKQAIYSFRG ADIHSYLQAR RATAGRHYTL PRNFRSTGAM VRGVNRLFQH
AEQAWPEGAF LFRVAEENPV PFLPVAAQGR AEALEVHGAA QPALTLWWED EGEPVAGKHY
LPRQAAACAS HIVALLNAGA ARQAGFRDPD GGLVSLRPRD MAVLVRDYNE ARAIQQALAR
RRVRSVYLSD RESVYRTDAA GDLLRWLRAC AEPTVERLLR AALATPTLGL PVAELHRLTA
DELLWERRVE QFRGYRRLWQ GQGVLPMLRR LLHDFAVPAR LVARADGERV LTNLLHLSEL
LQQAAAELDG EQALIRHLVE HRSGNGEGGD EQILRLESDE DLVKVVTVHK AKGLEYPLVF
LPFICTFKPV DGKRLPYLDR TTGGAPRWLF EMDEETRARA DRARLAEDLR LLYVAMTRAR
HACWLGLAPV KRGTRKSNQL HQSAVGYLLA GPEGVVDGGL EAALLRARGE ESAIAVTPLP
AASDEGYRGP EHRPALGPAR VPRRPAYEHW WIASYSALSL HAEPAAGEAV GEPARASAST
APESVGQEII RETVAEAGES AGPDSAALAL HRFPRGPGPG TFLHGLLEWA ADQGFAGLAE
QPALLQREVA RRCRRRGWSE WAGPVADWLL RLLQADYPLP EGGALRLTAL PSYQPELEFL
FQTRWLNAAR LDRAVTAATL DAAPRPPARA ALLNGMLKGF IDLVCEQDGR YYVADYKSNW
LGQGLEDYAP EALRAAVLAR RYDLQLVLYT LALHRLLRAR LPGYDYERHV GGAVYLFLRG
LDAPGRGLFH CRPPRALIEQ LDDWLQQGPD RPDVQATGSD DA