Gene Msil_3497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3497 
Symbol 
ID7092521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3839440 
End bp3841101 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content69% 
IMG OID643466788 
ProductDNA repair protein RecN 
Protein accessionYP_002363748 
Protein GI217979601 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.185224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGTTC AGCTCTCCAT CCGCAACATT GTCCTGATCG ACCGGCTCGA CATGAGTTTT 
GCCGGCGGCC TCAGCGTGCT GACCGGCGAG ACCGGCGCCG GCAAATCGAT TTTGCTCGAT
TCCTTTTCCC TCGCCCTTGG CGCGCGCGGC GACGGTTCCC TTGTCCGCGA AGGCGAGGCG
CAGGGCCAGG TCACCGCGGC CTTCGACCTT GCGCCCGACC ATCCGGCCCT GAAGGCGGCC
AACGCGCAGG ATATCGAGAC CGACGGCTCT CTGATCCTGC GCCGCGTGCA ACTCGCCGAC
GGGCGAACGC GGGCCTTCGT CAACGACCAG CGCGTCACCT CGCAGGCGCT GCGCCTCGTC
ACCCGCGAAC TTGTCGAGAT CCACGGCCAG CACGACGACC GCGCGATGGT CGATTCGACG
ACGCACCGCG CCCTGATCGA CGCCTATGGC GGGCTTCAGC CGCAATTATT CGCAACCCGC
GCCGCGCATG GCAGATGGCG CGATTTGTCG GCGCAAAAAG CCCGCGAGGA GGTCCGCATC
GCCAAGGCGC GCGCGGACGC CGATTACCTG CGCCACGCCT TTGCCGAACT CGACAAACTC
AGCCCCGAAC CCGGCGAGGA GGAGGCGCTC GCGGCGCGCC GCACTTTGAT GATGCAATCG
GAAAAGGTCG CTGCCGATCT CCGCGACGCC TATGAGTCCG TTGCAGGCGA TCATTCGCCG
ATGTCGGCTT TGTCGGCGGC GCTGCGCCGG CTGCAGCGGC GAGAGGCGCA GGCGCCGCAG
CTGATCGAGC CGCCGGCGCA GGCGCTGGAC GCGGTCCTGA CCGCGCTCGA CCTCGCCGGC
GAGGCGCTGG CCCAGGCGCT GCGGGAGGCC GACTATGACC CGCGCGAACT GGAGCAGGTC
GAGGAGCGTC TGTTCGCCCT TCGCGCCGCG AGCCGTAAAT ATTCCGCGCC CGTCGATAGC
CTGCCGCAGC TTGCGGAAAA TTTTGCGGCC GCGCTTGAGG AGCTCGACGC CGGCGAGGCG
CGGCTCGAGG CTTTGACGAA GGATGTGGCG CTGGCCGAGG CCGATTATGG GAAGGCTGCA
AAAGAGCTGT CGGCCGCGCG GAAAACAGCT GCAATCGCGC TCGATCGCGC CGTCAACGCC
GAACTCGCGC CGCTGAAACT CGAAGGAGCG CGTTTTTCGA CCGCGATTGG GCCAGAGGCG
GCCGGCCCCG AAGGGATCGA CGCGATCGAA TTCTGGGTTC AGACCAACCC CGGCACGCGG
CCCGGGCCGC TCATGAAGAT CGCCTCCGGC GGAGAACTCG CGCGCTTCAT GCTGGCCCTG
AAAGTCGTGC TCGCTGAAAG GGGGTCCGCG CCGACCCTCG TCTTCGACGA GATCGACACC
GGCGTCGGCG GCGCCGTCGC CGACGCCATC GGCCAAAGGC TTGAGCGGCT CGGCCGCCGC
GTGCAGGTGC TCGCCGTCAC CCATGCGCCG CAGGTCGCGG CCAAGGCGGA AAGCCATTTC
CGCATCGCCA AGGACGCCGC AGAGCCCGGC CGCGTGGCGA CTCGCGTCAT GGCGCTGGCG
GCGGACGCTA GGCGCGAGGA AGTCGCGCGC ATGCTCGCCG GCGCAACCAT CACCAATGAG
GCTCGTGCCG CCGCGGCGCG GCTGATGCAG CGCGCGAAGT GA
 
Protein sequence
MLVQLSIRNI VLIDRLDMSF AGGLSVLTGE TGAGKSILLD SFSLALGARG DGSLVREGEA 
QGQVTAAFDL APDHPALKAA NAQDIETDGS LILRRVQLAD GRTRAFVNDQ RVTSQALRLV
TRELVEIHGQ HDDRAMVDST THRALIDAYG GLQPQLFATR AAHGRWRDLS AQKAREEVRI
AKARADADYL RHAFAELDKL SPEPGEEEAL AARRTLMMQS EKVAADLRDA YESVAGDHSP
MSALSAALRR LQRREAQAPQ LIEPPAQALD AVLTALDLAG EALAQALREA DYDPRELEQV
EERLFALRAA SRKYSAPVDS LPQLAENFAA ALEELDAGEA RLEALTKDVA LAEADYGKAA
KELSAARKTA AIALDRAVNA ELAPLKLEGA RFSTAIGPEA AGPEGIDAIE FWVQTNPGTR
PGPLMKIASG GELARFMLAL KVVLAERGSA PTLVFDEIDT GVGGAVADAI GQRLERLGRR
VQVLAVTHAP QVAAKAESHF RIAKDAAEPG RVATRVMALA ADARREEVAR MLAGATITNE
ARAAAARLMQ RAK