Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0804 |
Symbol | |
ID | 4270635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 903716 |
End bp | 907444 |
Gene Length | 3729 bp |
Protein Length | 1242 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 638125555 |
Product | DNA helicase/exodeoxyribonuclease V, beta subunit |
Protein accession | YP_741648 |
Protein GI | 114319965 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1074] ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) |
TIGRFAM ID | [TIGR00609] exodeoxyribonuclease V, beta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.888497 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA GTGCGCCCCG ACCGCTGGAG TTGCTGCGCC TGCCGCTGCA CGGCAGCCGC CTGATCGAGG CCAGCGCCGG CACCGGTAAG ACCTTCACCA TCGCCGCCCT CTACCTGCGC CTGGTCCTTG GGCACGGCGA GCAGCGCGCG GGCGGCGGGC CGCTGGTGCC ACCGCAGATC CTGGTGGTCA CCTTTACCGA GGCGGCCACC CGGGAGTTGC GCGACCGCAT CCGCGAACGG CTCAGCCAGG CTGCAGCGGC CTTCCGCGAT CCGGCCCGGT ACCCGGACGA TCCGGTGCTG CCGGCACTGC GCGCTGAATA TGACGAGCAC GAGCGGCCCG CCATGGCGCG CCGACTGGAG TTGGCCGCTG AGTGGATGGA CGAGTCGGCG GTCTCCACCA TCCACAGCTG GTGCTACCGC ATGCTGCGCG AGCACGCCTT CGACAGCGGT AGCCTCTTCA CCCAGGACCT GGAGGCAGAT CAGACCGCGC TCCTGGCCGA GGCCGTGCGC GATTACTGGC GCAGCTTCCT CTATCCATTG GCGCCGGAGG CCCTGGTCCT GGCGCGCCGT GCGCTGGGTG ACGACCCCGA GGCCCAGCGG CAGAAGGTCC GGCCGCTGAT CGGCCAGGCG GCCCCGGTGG CGCCCGGACT GGACTCGCCG GAGGGCTTTG CCGGGGTGCT CGACGACCGG TTGCAGGCCC TGGAGCGCCT CAAACGGCCC TGGCGCGAGC AGTTCGAGGC CCTGGAGGCG CAGTTCCATG CCCTGCGCAA GGCGGTGCTC AACGGCCGGA AGTACCAAAA GCCCGAGGCC CTGATGGCCG CCATGCGGGC CTGGGCGGAG GACCCCGCGG CCCTGCAGCC GGAGCCGGTG GGCAGCACCA GTGTGCTGGG GCGGCTCTGC GCCCGGGGGC TCGCCGCCGG TTGCCGGAAG GGACAGACGC TGCCGGAGGA CCTGCACCCC GGCTTCACGG CGTTGGATGA CTACGAGCAC CTCTGCCAAC TGCCGGAGGC GGACCTGCTC AATGCCGCGG CCAACTGGAT CGGTCAACGC TTCCACCAGG CCCAGCGGCG ACGGGCGCAG ATGGGCTTCG ATGACCTGCT CACCCGGCTG GACCAGGCGC TGACCCAGGG CGAGGCGGGT GAGCGTCTGG CCGCCACCCT GCGCCGGCAG TTCCCGGTGG CCCTGATCGA TGAGTTCCAG GACACCGACC CGGTGCAGTA CCGCATCTTC CAGCGGGTCT ACCGGGTGGC GGACAACCCC CGGGAACAGG CCCTGCTGCT CATCGGCGAC CCCAAACAGG CCATCTACAG CTTCCGCGGC GCCGACATCC ACAGTTACCT GCAGGCGCGG CGGGCCACTG CTGGCCGCCA CTACACCCTG CCGCGCAACT TCCGCTCCAC CGGGGCCATG GTGCGCGGGG TCAACCGGCT GTTCCAGCAT GCCGAGCAGG CCTGGCCCGA GGGCGCCTTC CTGTTCCGGG TGGCGGAGGA AAACCCCGTG CCCTTCCTGC CGGTGGCGGC CCAGGGCCGC GCGGAGGCGC TGGAGGTGCA CGGCGCCGCC CAGCCGGCCC TCACTCTCTG GTGGGAGGAC GAGGGCGAGC CGGTGGCCGG GAAGCACTAC CTGCCGCGCC AGGCGGCGGC CTGTGCCAGC CACATCGTGG CGTTGCTGAA CGCCGGTGCT GCGCGCCAGG CCGGGTTCCG CGACCCGGAT GGAGGGTTGG TGTCGCTGCG GCCCCGGGAC ATGGCGGTGC TGGTGCGCGA CTACAACGAG GCGCGCGCCA TCCAGCAGGC CCTGGCCCGG CGCCGGGTGC GCAGCGTTTA CCTGTCCGAC CGGGAGTCCG TCTACCGCAC CGATGCGGCC GGGGACCTGC TGCGCTGGCT GCGCGCCTGC GCCGAGCCCA CCGTGGAGCG GCTGTTACGG GCCGCCCTGG CCACGCCCAC CCTGGGCCTG CCGGTGGCCG AGCTGCACCG CCTGACCGCC GACGAGCTGC TTTGGGAGCG CCGGGTCGAG CAGTTCCGCG GATACCGCCG GCTCTGGCAG GGGCAGGGGG TCCTGCCCAT GCTGCGCCGG CTGCTGCACG ATTTCGCGGT GCCGGCGCGG CTGGTGGCGC GAGCGGACGG CGAACGGGTG CTGACCAACC TGCTGCATCT CAGCGAGCTG CTGCAGCAGG CGGCCGCCGA ACTGGACGGC GAGCAGGCCC TGATCCGCCA CTTGGTGGAG CATCGCAGCG GCAACGGCGA GGGAGGGGAT GAGCAGATCC TGCGCCTGGA GAGCGACGAG GACCTGGTCA AGGTGGTCAC TGTGCACAAA GCCAAGGGCC TGGAGTACCC GCTGGTCTTC CTCCCGTTCA TCTGCACCTT CAAACCGGTG GACGGGAAGC GCCTTCCCTA CCTCGACCGC ACCACCGGCG GTGCCCCGCG CTGGTTGTTC GAGATGGACG AGGAGACCCG CGCGCGCGCC GACCGCGCCC GCCTGGCCGA GGACCTGCGC CTGCTCTACG TGGCCATGAC CCGGGCCCGC CACGCCTGCT GGCTGGGCCT GGCCCCCGTC AAGCGGGGCA CCCGCAAGTC CAACCAACTG CACCAGAGCG CGGTGGGCTA CCTGCTGGCT GGCCCGGAGG GAGTGGTGGA CGGGGGGCTG GAGGCCGCCC TGCTGCGGGC GCGCGGCGAG GAGTCGGCGA TTGCGGTCAC CCCGCTGCCG GCGGCCAGCG ACGAGGGCTA TCGCGGTCCG GAGCACCGCC CGGCCCTCGG TCCGGCCCGG GTGCCGCGGC GGCCGGCCTA CGAGCACTGG TGGATCGCCA GTTACAGCGC CCTGTCCCTG CACGCGGAAC CCGCGGCCGG TGAGGCGGTG GGGGAGCCGG CCCGGGCGAG TGCATCGACG GCGCCGGAGA GCGTGGGCCA GGAGATCATC CGCGAGACGG TCGCCGAGGC CGGAGAGAGC GCCGGGCCCG ACAGCGCGGC CCTCGCATTG CACCGCTTCC CGCGCGGCCC GGGGCCGGGG ACCTTCCTCC ACGGCCTGCT GGAATGGGCC GCGGACCAGG GGTTTGCCGG CCTGGCCGAA CAGCCCGCCC TGTTACAGCG GGAAGTGGCC CGCCGCTGTC GCCGGCGCGG CTGGTCCGAG TGGGCCGGGC CGGTGGCCGA CTGGCTGCTG CGTCTGCTGC AGGCCGACTA CCCCCTGCCG GAGGGTGGGG CGCTGCGCCT CACGGCCCTG CCCAGCTATC AGCCGGAGCT GGAATTCCTG TTCCAGACCC GCTGGTTGAA TGCCGCCCGC CTGGACCGGG CGGTGACCGC CGCGACCCTG GACGCGGCGC CGCGGCCACC GGCCCGGGCG GCGCTGCTGA ACGGGATGCT CAAGGGCTTT ATCGACCTGG TCTGCGAGCA GGACGGGCGC TACTACGTGG CCGATTACAA GTCCAACTGG CTGGGCCAGG GGCTGGAGGA CTATGCGCCG GAGGCCCTGC GGGCGGCCGT GCTCGCCCGC CGCTACGACC TGCAACTGGT GCTCTATACC CTGGCGCTGC ACCGGCTGCT ACGGGCGCGA CTGCCCGGCT ACGACTACGA GCGCCACGTG GGCGGGGCGG TGTACCTCTT CCTGCGCGGC CTGGATGCCC CGGGTCGGGG TCTGTTCCAC TGCCGGCCAC CGCGGGCGCT GATCGAACAA CTGGACGACT GGCTGCAGCA GGGCCCCGAC CGGCCCGATG TGCAGGCCAC GGGGAGTGAT GATGCCTGA
|
Protein sequence | MSQSAPRPLE LLRLPLHGSR LIEASAGTGK TFTIAALYLR LVLGHGEQRA GGGPLVPPQI LVVTFTEAAT RELRDRIRER LSQAAAAFRD PARYPDDPVL PALRAEYDEH ERPAMARRLE LAAEWMDESA VSTIHSWCYR MLREHAFDSG SLFTQDLEAD QTALLAEAVR DYWRSFLYPL APEALVLARR ALGDDPEAQR QKVRPLIGQA APVAPGLDSP EGFAGVLDDR LQALERLKRP WREQFEALEA QFHALRKAVL NGRKYQKPEA LMAAMRAWAE DPAALQPEPV GSTSVLGRLC ARGLAAGCRK GQTLPEDLHP GFTALDDYEH LCQLPEADLL NAAANWIGQR FHQAQRRRAQ MGFDDLLTRL DQALTQGEAG ERLAATLRRQ FPVALIDEFQ DTDPVQYRIF QRVYRVADNP REQALLLIGD PKQAIYSFRG ADIHSYLQAR RATAGRHYTL PRNFRSTGAM VRGVNRLFQH AEQAWPEGAF LFRVAEENPV PFLPVAAQGR AEALEVHGAA QPALTLWWED EGEPVAGKHY LPRQAAACAS HIVALLNAGA ARQAGFRDPD GGLVSLRPRD MAVLVRDYNE ARAIQQALAR RRVRSVYLSD RESVYRTDAA GDLLRWLRAC AEPTVERLLR AALATPTLGL PVAELHRLTA DELLWERRVE QFRGYRRLWQ GQGVLPMLRR LLHDFAVPAR LVARADGERV LTNLLHLSEL LQQAAAELDG EQALIRHLVE HRSGNGEGGD EQILRLESDE DLVKVVTVHK AKGLEYPLVF LPFICTFKPV DGKRLPYLDR TTGGAPRWLF EMDEETRARA DRARLAEDLR LLYVAMTRAR HACWLGLAPV KRGTRKSNQL HQSAVGYLLA GPEGVVDGGL EAALLRARGE ESAIAVTPLP AASDEGYRGP EHRPALGPAR VPRRPAYEHW WIASYSALSL HAEPAAGEAV GEPARASAST APESVGQEII RETVAEAGES AGPDSAALAL HRFPRGPGPG TFLHGLLEWA ADQGFAGLAE QPALLQREVA RRCRRRGWSE WAGPVADWLL RLLQADYPLP EGGALRLTAL PSYQPELEFL FQTRWLNAAR LDRAVTAATL DAAPRPPARA ALLNGMLKGF IDLVCEQDGR YYVADYKSNW LGQGLEDYAP EALRAAVLAR RYDLQLVLYT LALHRLLRAR LPGYDYERHV GGAVYLFLRG LDAPGRGLFH CRPPRALIEQ LDDWLQQGPD RPDVQATGSD DA
|
| |