Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2748 |
Symbol | |
ID | 4270217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 3117911 |
End bp | 3120733 |
Gene Length | 2823 bp |
Protein Length | 940 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638127510 |
Product | excinuclease ABC subunit A |
Protein accession | YP_743578 |
Protein GI | 114321895 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0880957 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCATA TCCGCATCGA AGGCGCCCGC ACCCACAACC TGCAGGATGT GCACCTGTCG CTGCCGCGCG AGCGGCTGGT GGTGATCACC GGCCTGTCCG GCTCCGGCAA GTCCTCGCTG GCCTTCGACA CGCTTTATGC CGAGGGCCAG CGTCGCTACG TAGAGTCGCT CTCCGCCTAC GCCCGGCAGT TCCTCTCGAT GATGGAAAAG CCGGACGTGG ACCACATCGA GGGCCTGTCG CCGGCCATCT CCATCGAGCA GAAATCCACC TCCCACAACC CCCGCTCCAC GGTGGGCACC GTCACCGAGA TCCACGACTA CCTGCGCCTG CTCTACGCCC GCGCTGGCCG CCCCCACTGC CCCGAGCACG ACATCGAGCT GAATGCCCAG ACGGTCTCGC AGATCGTCGA CCGGATCCTG GCCCTGCCCG AGGGCGAGAA GATCATGCTC CTGGCGCCGG TGGTGGACGG ACGCAAGGGC GAACACCGGG AGGTGATCGA CACCCTGCGC ACCCAGGGCT ATGTGCGCGC CCGGGTGGAC GGCGAGGTCC ACGACATCGA CAGTGTGCCG GCCCTGGACC CCAAACGGCG CCACACCATC GAGGCGGTGG TGGACCGCTT CCGGGTGCGG CCGGACCTCG CCACCCGGCT GGCCGACTCG GTGGAGACGG CGCTGGCGCT GTCGGACGGT CTGGTCCGGG CCGCCTGGAT GGACGACCCG GCGCGGGCGC CGCTGGTCTT CTCCTCGCGC TACGCCTGCC CCGAGTGCGG CTACGCCATC AGCGAGTTGG AGCCGCGCAT CTTCTCCTTC AACAACCCGC GCGGGGCCTG CCCCGACTGC GACGGCCTGG GCGTGCGGAC CTTCTTCGAC CCGGCGCGGG TGGTGGTGCA CCCGGAATTG CCCCTAACCG CCGGCGCGGT GCGCGGCTGG GACCGGCGCA ACGCCTGGTA CCACGCCATG ATCCAGTCAC TGGCGAAGCA CCACGACTTC GACCCGGAGA CCCCCTGGCA GGACCTGCCC GAGGGCATCC GCGATACCGT GCTGTTCGGC TCCGGGGACG AGGAGATCGA CTTCCGCTAC CCCGGCCCGC GGGGCGAAAC CCGTCGCCGC CACGCCTTCG AGGGCATCAT CCCCAACATG GCCCGGCGCT ACCGGGAGAC GGACTCGGCG GCGGTGCGCG AGGAGCTGGC CCGTTACCAG GCGGTGCAGA CCTGCCCGAG CTGCGACGGC ACCCGGCTCA ACCAGGCCGC CCGTCACGTC TTCGTCGGCC CGCTCACCCT GCCGGAGGTG GCACGCATGC CGGTAGGCCG CACCCTGGGT CATTTCGAGG CGCTGAAGCT GGAGGGGGCG CGCGGCGAGA TCGCCGGACG CATCCTGCGG GAGATCACCA CCCGGCTCAG CTTTCTGGTC AACGTGGGAC TGGATTACCT GAGCCTGGAT CGGGCCGCCG ACACCCTGTC CGGCGGCGAG GCGCAGCGCA TCCGCCTGGC CAGTCAGATC GGCGCCGGCC TCACCGGGGT GATGTACGTG CTGGACGAGC CCTCCATCGG CCTGCACCAG CGCGACAACC AGCGGCTGCT CAACACCCTG ATGCGGCTGC GCGACCTGGG CAACACGGTG ATTGTGGTCG AGCACGACGA GGACGCCATC CGCGCCGCCG ACCACGTGGT GGACATGGGC CCGGGGGCCG GGGTGCACGG CGGTTGGGTG GTCGCGCAGG GCACCCCGAC CGAGATCGCA GCGCACCCCG ACTCGCTCAC CGGCGACTAC CTGGCCGGCC GGCGCGAGAT CCCGGTGCCC GCCGAACGCC GGCCGGTGCA GCCCGACCGG GTCGCCTGGC TGCGCGGCGC CACCGGGCAC AACCTCAAGA ACGTGGACGC GGGCATCCCC GCCGGGCTGT TCGTGGCCAT CACCGGGGTG TCCGGCTCGG GCAAATCCAC GCTCATCAAC GACACCCTGT TCCGCTATGC GGCCCGCGAG CTGAACGGCG CCAGCGCCGC CCCGGCCCCC TGTCGCGGCA TGGACCATCT GGCGCTGTTT GATAAGGTCA TCGACATCGA TCAGGCCCCC ATCGGCCGCA CCCCGCGCTC CAACCCCGCC ACCTACACCG GGCTGTTCAC GCCGCTGCGT GAGCTGTTCG CCGGCACCCC GGAGGCGCGC TCACGCGGCT ACGGCCCCGG GCGTTTCTCC TTCAACGTGA AGGGCGGGCG CTGCGAGGCC TGCCAGGGGG ACGGGGTGAT CCGGGTGGAG ATGCACTTTC TGCCCGACCT GCACGTGCCC TGCGACGTCT GCAAGGGCCG GCGCTACAAC CGCGAGACCC TGGAGATCCG CTACAAGGGG CTGGACATTG CCCAGGTGCT GGAGCTGACC GTGGAGGAGG CCCTGGCCGT CTTCGACGCC ATCCCCGGCA TCCGCCGGCG GCTGCAGACG CTGATGGACG TGGGCCTGGG CTACATCCAC CTGGGCCAGA GCGCCACCAC CCTGTCCGGC GGCGAGGCCC AACGGGTGAA ACTGGCCCGC GAGTTGGCCC GCCGGGACAC CGGCCGGACC CTCTACATCC TGGATGAACC CACTACCGGC CTGCACTTCC ACGACATCGC GCAACTGCTC TCCGTGCTCC AGCGGCTGGT GGATCACGGC AACACCGTGG TGGTCATTGA GCACAACCTG GATGTGATCA AGACCGCCGA TCACGTGATC GACCTGGGCC CGGAAGGGGG CGACGGCGGC GGCCGCATCG TGGCCACCGG CACGCCGGAG GCCATCACCC GGGTGGAGGC CTCCCATACG GGCCGCTTCC TGCGACCGCT GCTGCGTCCC TAA
|
Protein sequence | MDHIRIEGAR THNLQDVHLS LPRERLVVIT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY ARQFLSMMEK PDVDHIEGLS PAISIEQKST SHNPRSTVGT VTEIHDYLRL LYARAGRPHC PEHDIELNAQ TVSQIVDRIL ALPEGEKIML LAPVVDGRKG EHREVIDTLR TQGYVRARVD GEVHDIDSVP ALDPKRRHTI EAVVDRFRVR PDLATRLADS VETALALSDG LVRAAWMDDP ARAPLVFSSR YACPECGYAI SELEPRIFSF NNPRGACPDC DGLGVRTFFD PARVVVHPEL PLTAGAVRGW DRRNAWYHAM IQSLAKHHDF DPETPWQDLP EGIRDTVLFG SGDEEIDFRY PGPRGETRRR HAFEGIIPNM ARRYRETDSA AVREELARYQ AVQTCPSCDG TRLNQAARHV FVGPLTLPEV ARMPVGRTLG HFEALKLEGA RGEIAGRILR EITTRLSFLV NVGLDYLSLD RAADTLSGGE AQRIRLASQI GAGLTGVMYV LDEPSIGLHQ RDNQRLLNTL MRLRDLGNTV IVVEHDEDAI RAADHVVDMG PGAGVHGGWV VAQGTPTEIA AHPDSLTGDY LAGRREIPVP AERRPVQPDR VAWLRGATGH NLKNVDAGIP AGLFVAITGV SGSGKSTLIN DTLFRYAARE LNGASAAPAP CRGMDHLALF DKVIDIDQAP IGRTPRSNPA TYTGLFTPLR ELFAGTPEAR SRGYGPGRFS FNVKGGRCEA CQGDGVIRVE MHFLPDLHVP CDVCKGRRYN RETLEIRYKG LDIAQVLELT VEEALAVFDA IPGIRRRLQT LMDVGLGYIH LGQSATTLSG GEAQRVKLAR ELARRDTGRT LYILDEPTTG LHFHDIAQLL SVLQRLVDHG NTVVVIEHNL DVIKTADHVI DLGPEGGDGG GRIVATGTPE AITRVEASHT GRFLRPLLRP
|
| |