Gene Mlg_2748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2748 
Symbol 
ID4270217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3117911 
End bp3120733 
Gene Length2823 bp 
Protein Length940 aa 
Translation table11 
GC content70% 
IMG OID638127510 
Productexcinuclease ABC subunit A 
Protein accessionYP_743578 
Protein GI114321895 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0880957 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCATA TCCGCATCGA AGGCGCCCGC ACCCACAACC TGCAGGATGT GCACCTGTCG 
CTGCCGCGCG AGCGGCTGGT GGTGATCACC GGCCTGTCCG GCTCCGGCAA GTCCTCGCTG
GCCTTCGACA CGCTTTATGC CGAGGGCCAG CGTCGCTACG TAGAGTCGCT CTCCGCCTAC
GCCCGGCAGT TCCTCTCGAT GATGGAAAAG CCGGACGTGG ACCACATCGA GGGCCTGTCG
CCGGCCATCT CCATCGAGCA GAAATCCACC TCCCACAACC CCCGCTCCAC GGTGGGCACC
GTCACCGAGA TCCACGACTA CCTGCGCCTG CTCTACGCCC GCGCTGGCCG CCCCCACTGC
CCCGAGCACG ACATCGAGCT GAATGCCCAG ACGGTCTCGC AGATCGTCGA CCGGATCCTG
GCCCTGCCCG AGGGCGAGAA GATCATGCTC CTGGCGCCGG TGGTGGACGG ACGCAAGGGC
GAACACCGGG AGGTGATCGA CACCCTGCGC ACCCAGGGCT ATGTGCGCGC CCGGGTGGAC
GGCGAGGTCC ACGACATCGA CAGTGTGCCG GCCCTGGACC CCAAACGGCG CCACACCATC
GAGGCGGTGG TGGACCGCTT CCGGGTGCGG CCGGACCTCG CCACCCGGCT GGCCGACTCG
GTGGAGACGG CGCTGGCGCT GTCGGACGGT CTGGTCCGGG CCGCCTGGAT GGACGACCCG
GCGCGGGCGC CGCTGGTCTT CTCCTCGCGC TACGCCTGCC CCGAGTGCGG CTACGCCATC
AGCGAGTTGG AGCCGCGCAT CTTCTCCTTC AACAACCCGC GCGGGGCCTG CCCCGACTGC
GACGGCCTGG GCGTGCGGAC CTTCTTCGAC CCGGCGCGGG TGGTGGTGCA CCCGGAATTG
CCCCTAACCG CCGGCGCGGT GCGCGGCTGG GACCGGCGCA ACGCCTGGTA CCACGCCATG
ATCCAGTCAC TGGCGAAGCA CCACGACTTC GACCCGGAGA CCCCCTGGCA GGACCTGCCC
GAGGGCATCC GCGATACCGT GCTGTTCGGC TCCGGGGACG AGGAGATCGA CTTCCGCTAC
CCCGGCCCGC GGGGCGAAAC CCGTCGCCGC CACGCCTTCG AGGGCATCAT CCCCAACATG
GCCCGGCGCT ACCGGGAGAC GGACTCGGCG GCGGTGCGCG AGGAGCTGGC CCGTTACCAG
GCGGTGCAGA CCTGCCCGAG CTGCGACGGC ACCCGGCTCA ACCAGGCCGC CCGTCACGTC
TTCGTCGGCC CGCTCACCCT GCCGGAGGTG GCACGCATGC CGGTAGGCCG CACCCTGGGT
CATTTCGAGG CGCTGAAGCT GGAGGGGGCG CGCGGCGAGA TCGCCGGACG CATCCTGCGG
GAGATCACCA CCCGGCTCAG CTTTCTGGTC AACGTGGGAC TGGATTACCT GAGCCTGGAT
CGGGCCGCCG ACACCCTGTC CGGCGGCGAG GCGCAGCGCA TCCGCCTGGC CAGTCAGATC
GGCGCCGGCC TCACCGGGGT GATGTACGTG CTGGACGAGC CCTCCATCGG CCTGCACCAG
CGCGACAACC AGCGGCTGCT CAACACCCTG ATGCGGCTGC GCGACCTGGG CAACACGGTG
ATTGTGGTCG AGCACGACGA GGACGCCATC CGCGCCGCCG ACCACGTGGT GGACATGGGC
CCGGGGGCCG GGGTGCACGG CGGTTGGGTG GTCGCGCAGG GCACCCCGAC CGAGATCGCA
GCGCACCCCG ACTCGCTCAC CGGCGACTAC CTGGCCGGCC GGCGCGAGAT CCCGGTGCCC
GCCGAACGCC GGCCGGTGCA GCCCGACCGG GTCGCCTGGC TGCGCGGCGC CACCGGGCAC
AACCTCAAGA ACGTGGACGC GGGCATCCCC GCCGGGCTGT TCGTGGCCAT CACCGGGGTG
TCCGGCTCGG GCAAATCCAC GCTCATCAAC GACACCCTGT TCCGCTATGC GGCCCGCGAG
CTGAACGGCG CCAGCGCCGC CCCGGCCCCC TGTCGCGGCA TGGACCATCT GGCGCTGTTT
GATAAGGTCA TCGACATCGA TCAGGCCCCC ATCGGCCGCA CCCCGCGCTC CAACCCCGCC
ACCTACACCG GGCTGTTCAC GCCGCTGCGT GAGCTGTTCG CCGGCACCCC GGAGGCGCGC
TCACGCGGCT ACGGCCCCGG GCGTTTCTCC TTCAACGTGA AGGGCGGGCG CTGCGAGGCC
TGCCAGGGGG ACGGGGTGAT CCGGGTGGAG ATGCACTTTC TGCCCGACCT GCACGTGCCC
TGCGACGTCT GCAAGGGCCG GCGCTACAAC CGCGAGACCC TGGAGATCCG CTACAAGGGG
CTGGACATTG CCCAGGTGCT GGAGCTGACC GTGGAGGAGG CCCTGGCCGT CTTCGACGCC
ATCCCCGGCA TCCGCCGGCG GCTGCAGACG CTGATGGACG TGGGCCTGGG CTACATCCAC
CTGGGCCAGA GCGCCACCAC CCTGTCCGGC GGCGAGGCCC AACGGGTGAA ACTGGCCCGC
GAGTTGGCCC GCCGGGACAC CGGCCGGACC CTCTACATCC TGGATGAACC CACTACCGGC
CTGCACTTCC ACGACATCGC GCAACTGCTC TCCGTGCTCC AGCGGCTGGT GGATCACGGC
AACACCGTGG TGGTCATTGA GCACAACCTG GATGTGATCA AGACCGCCGA TCACGTGATC
GACCTGGGCC CGGAAGGGGG CGACGGCGGC GGCCGCATCG TGGCCACCGG CACGCCGGAG
GCCATCACCC GGGTGGAGGC CTCCCATACG GGCCGCTTCC TGCGACCGCT GCTGCGTCCC
TAA
 
Protein sequence
MDHIRIEGAR THNLQDVHLS LPRERLVVIT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY 
ARQFLSMMEK PDVDHIEGLS PAISIEQKST SHNPRSTVGT VTEIHDYLRL LYARAGRPHC
PEHDIELNAQ TVSQIVDRIL ALPEGEKIML LAPVVDGRKG EHREVIDTLR TQGYVRARVD
GEVHDIDSVP ALDPKRRHTI EAVVDRFRVR PDLATRLADS VETALALSDG LVRAAWMDDP
ARAPLVFSSR YACPECGYAI SELEPRIFSF NNPRGACPDC DGLGVRTFFD PARVVVHPEL
PLTAGAVRGW DRRNAWYHAM IQSLAKHHDF DPETPWQDLP EGIRDTVLFG SGDEEIDFRY
PGPRGETRRR HAFEGIIPNM ARRYRETDSA AVREELARYQ AVQTCPSCDG TRLNQAARHV
FVGPLTLPEV ARMPVGRTLG HFEALKLEGA RGEIAGRILR EITTRLSFLV NVGLDYLSLD
RAADTLSGGE AQRIRLASQI GAGLTGVMYV LDEPSIGLHQ RDNQRLLNTL MRLRDLGNTV
IVVEHDEDAI RAADHVVDMG PGAGVHGGWV VAQGTPTEIA AHPDSLTGDY LAGRREIPVP
AERRPVQPDR VAWLRGATGH NLKNVDAGIP AGLFVAITGV SGSGKSTLIN DTLFRYAARE
LNGASAAPAP CRGMDHLALF DKVIDIDQAP IGRTPRSNPA TYTGLFTPLR ELFAGTPEAR
SRGYGPGRFS FNVKGGRCEA CQGDGVIRVE MHFLPDLHVP CDVCKGRRYN RETLEIRYKG
LDIAQVLELT VEEALAVFDA IPGIRRRLQT LMDVGLGYIH LGQSATTLSG GEAQRVKLAR
ELARRDTGRT LYILDEPTTG LHFHDIAQLL SVLQRLVDHG NTVVVIEHNL DVIKTADHVI
DLGPEGGDGG GRIVATGTPE AITRVEASHT GRFLRPLLRP