Gene Mlg_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1904 
Symbol 
ID4270104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2171472 
End bp2173166 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content72% 
IMG OID638126660 
ProductDNA repair protein RecN 
Protein accessionYP_742738 
Protein GI114321055 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAGCC ACATCGATAT CCGCGACTTC GCCATCGTCG ACCAGCTCGA ACTGGACTTC 
GGCGCCGGGA TGAACGTGCT CACCGGCGAG ACCGGGGCGG GCAAATCCAT CCTGCTCGAC
GCTCTGGGGC TCTGCCTGGG CGACCGCGCC GACAGCGGCA CCGTCCGCCC CGGGGCCAAG
CGGGCGGACC TCAGCGTCAG CTTCCGGCTC GCCCCCGACA GCCCCGTGCA CGACTGGCTG
GCCGAGCACG ACCTGGATGA GGACGGCGAC TGCATCCTCC GTCGCACCAT CCAGGAGAGC
GGCCGCACCC GCGGCTACAT CAACGGCCGC CCCGCCCCGC TCAACCTGCT CAAGGCCCTG
GGCGAGCAAC TGGTCGACAT CCACGGCCAG CACGCCCACC AGCTGCTGCT GCGCCGCCAC
GTCCAGCGCC GGATCCTGGA CGAGCACGCC GACGAGGGCG GCGCCCTGGA ACGGGTCCGC
TCGCTCCACC AGCAGCTGCG TGCGGTGGAC GAGGAGCTGC GCGCCTTGGA GGGCGACCGG
GAGAGCCACG AGGACCGCCT GGCGCTGCTG CGCTACCAAG TGGACGAGCT GGCCGCACTG
GAGCTGACGG TGGAGGGCAT CGAGGCGCTG GAGCAGGAAC AGAAGCGCCT GGCCAATGCC
GGCGCCCTGA TTCAGATGGC ACAGCAGATC CTCGACCCGC TCTACGACGA CGAGCAGTCC
GCGCAGGCCG CCCTGGGCCG CGCCAGCCGC GAACTGGACG GCCACGCCGG GCTGGACCCG
GCCCTGGACG AGGCCCGGGA GCTGTTCGGC AACGCCCTGG TGCAACTTGA GGAGGGCTGC
GATGCCCTGC GCCGGTTCGC CGACAACCTG GAGCTGGACC CGGAGCGCCT GGCCTGGGCC
GAGGAGCGAC TGGGCCAACT GAGCGACCTG GCGCGCAAGC ACCGCTGCCG TCCGGAGGCC
CTCCCCGAGC GGCTCGAGGC CCTGCAGGCG GAGCTCGCAG AGCTGGAGGG GGCCGGGGAG
CGGGTCCAGG CCCTGCGCGA GCAGCGCGCG GCCCTGCATC GCGACTACCG GGAGGCCGCC
GCCACGCTCA GTGAGCAACG CCAGGCCCAC GCCCGGGCCC TGGAGCAGCG GGTGGCCGGG
CTGCTGGAGG AGCTGAGCAT GGGCGGGGCC GAGCTCCAGA TCCAGGTGGC CTTCGACGCC
GAGGCCGAGC CCACCCCGCA CGGGCTGGAT CAGGTGGAGT TTCTGGTCCG CACCAACCCT
GGCCAAGCCT TCGGGCCGCT GGCCAAGGTG GCCTCCGGCG GCGAGCTGTC ACGGTTGGGG
CTGGCCCTGC AGGTCGCCAG CACCAAGGGC ACCGGCGCCC CCACCCTGAC CCTGGTCTTC
GACGAGGCGG ACAGCGGGAT CGGCGGTGCC GTGGCCGAGG TGGTCGGGCG CCTGCTGGCC
TCGCTGGGCC AACGCTACCA GGTGCTGTGC ATCACCCACC TGCCCCAGGT GGCCGCCCAG
GCCGGGTGCC ACTTTCAGGT CAGCAAGCAC AGCGAACGGG ACCGGACCCG CACCCGGGTC
ACCCCGCTCA CCGGCGAGCA GCGGATTCAG GAAGTGGCCC GAATGCTGGG CGGCGTGGAG
ATCAGTGATA ACACCCTGGC CTCGGCCCGG GAGATGCTGG AACGCGGCGC CGGCAGGCGC
CGGGAGACCG CCTGA
 
Protein sequence
MLSHIDIRDF AIVDQLELDF GAGMNVLTGE TGAGKSILLD ALGLCLGDRA DSGTVRPGAK 
RADLSVSFRL APDSPVHDWL AEHDLDEDGD CILRRTIQES GRTRGYINGR PAPLNLLKAL
GEQLVDIHGQ HAHQLLLRRH VQRRILDEHA DEGGALERVR SLHQQLRAVD EELRALEGDR
ESHEDRLALL RYQVDELAAL ELTVEGIEAL EQEQKRLANA GALIQMAQQI LDPLYDDEQS
AQAALGRASR ELDGHAGLDP ALDEARELFG NALVQLEEGC DALRRFADNL ELDPERLAWA
EERLGQLSDL ARKHRCRPEA LPERLEALQA ELAELEGAGE RVQALREQRA ALHRDYREAA
ATLSEQRQAH ARALEQRVAG LLEELSMGGA ELQIQVAFDA EAEPTPHGLD QVEFLVRTNP
GQAFGPLAKV ASGGELSRLG LALQVASTKG TGAPTLTLVF DEADSGIGGA VAEVVGRLLA
SLGQRYQVLC ITHLPQVAAQ AGCHFQVSKH SERDRTRTRV TPLTGEQRIQ EVARMLGGVE
ISDNTLASAR EMLERGAGRR RETA