Gene Smed_2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2071 
Symbol 
ID5322930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2124395 
End bp2126068 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content64% 
IMG OID640791008 
ProductDNA repair protein RecN 
Protein accessionYP_001327739 
Protein GI150397272 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.240609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00158855 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTCGCCC AGCTCGCGAT CCGAGATATC GTCCTGATCG AACGGCTTGA CCTCAGCTTC 
GATGTCGGGC TGTCGGTGCT GACCGGCGAA ACCGGTGCGG GCAAATCCAT TCTCCTCGAC
AGTCTGTCGC TTGCGCTGGG CGGGCGCGGC GACGGTTCGC TCGTGCGCCA TGGCGAGGAC
AGGGGCCAGG TGAGCGCCGT GTTCGATGTC CCGGCCGGTC ATTCGGCGCG GCTCCTTTTG
CGGGAAAACG GCATTGATGA CGATGGCGAT CTGATCTTCC GTCGCGTGCA GTCGGCGGAC
GGCCGCACCA AGGCCTTTAT CAACGATCAG CCGGTCAGCG TGCAGCTGAT GCGCCAGGTC
GGCCAGACGC TTGTTGAAAT CCATGGCCAG CATGACGATC GAGCGCTCGT CGATACCGAC
GCGCACCGCA CGCTCGTCGA TGCTTTTGGC GGCACCACCG ATGCGGCGGA AGCGGTCGCG
AACCTTTACC GCGCCTGGAA GGATGCCGAG CGGGGGCTGA AGAAACATCG GGAGAAGGTG
GAGGCCGCAT CCCGGGAGGC GGACTATCTC CGCTCCTCCG TCGAGGAGCT CGAGACACTA
TCGCCGCGCG ATGGCGAAGA GGAGGAGCTG GCGGAGAGCC GCGCCCGCAT GATGAAGGTC
GAACGCATCG CCGGCGATAT CAGCGAGGCA GCCGAGTTTC TGAACGGCAA TGCATCGCCT
GTTCCGCTCA TCGCATCGCT CGTCCGGCGG CTGGAGCGCA AGAGCCATGA GGCTCCCGGC
CTCCTTGAAG AGACCGTCGA ACTTCTGGAC GGTGCACTGA ACCAGCTTGC GGATGCCCAG
ATGGCGGTCG AGCGCGCGTT GCGCAACACC GAGTTCGATC CCAAGGAGCT CGAACGCGTC
GAAGAGCGGC TTTTCGCATT GCGCGCAGCG GGCCGGAAAT ACTCCGTCCC CGTCACCGAA
TTGCCCGCCC TTGCCGTGCG GATGATCGCC GATCTCGCCG ATCTCGATGC AGGCGAGGAG
AAGCTGCAGC AACTCGAGGT ACGGGTCGGC GAATGCAAAG CGGCGTTTGA CGCTGCATCG
CAGTCGCTGT CGGAAAAGCG GCACAATACG GCCGTTGCAC TTTCGGCGGC CGTTATGGAA
GAACTGCCGG CGCTGAAGCT GGAACGCGCC CGCTTCATGG TGGAGGTGAC GAGCGATCCG
GAATCGCCTA CGGCCGACGG GATCGACTCA GTCGAGTTCC ACGTACAGAC CAATCCCGGC
ACCAGGCCGG GACCGATCAT GAAGGTGGCT TCGGGCGGCG AGCTTTCGCG GTTCCTGCTC
GCGCTGAAAG TGGCGCTTGC CGACCGGGGT TCGGCGCCGA CTCTCGTCTT CGACGAGATC
GACACGGGTG TTGGTGGTGC CGTGGCAGAT GCGATCGGTC AGCGCCTGAA ACGCCTTTCG
AAGACCGTCC AGGTGCTTTC GGTCACCCAT GCGCCGCAGG TTGCCGCGCG TGCGGCCACG
CATCTCCTGA TCTCGAAGGG TCCCTCGGCG GAAAAAGCCG AGATGATCGC GACTCGCGTC
GCTCGCATGG ACGATGCGGC ACGCACCGAA GAGATAGCCC GTATGCTGGC AGGGGCCTCG
ATCACCGAAG AGGCGAGGGC CGCGGCTGCG CGATTGCTCG CCGGCAATGC CTGA
 
Protein sequence
MLAQLAIRDI VLIERLDLSF DVGLSVLTGE TGAGKSILLD SLSLALGGRG DGSLVRHGED 
RGQVSAVFDV PAGHSARLLL RENGIDDDGD LIFRRVQSAD GRTKAFINDQ PVSVQLMRQV
GQTLVEIHGQ HDDRALVDTD AHRTLVDAFG GTTDAAEAVA NLYRAWKDAE RGLKKHREKV
EAASREADYL RSSVEELETL SPRDGEEEEL AESRARMMKV ERIAGDISEA AEFLNGNASP
VPLIASLVRR LERKSHEAPG LLEETVELLD GALNQLADAQ MAVERALRNT EFDPKELERV
EERLFALRAA GRKYSVPVTE LPALAVRMIA DLADLDAGEE KLQQLEVRVG ECKAAFDAAS
QSLSEKRHNT AVALSAAVME ELPALKLERA RFMVEVTSDP ESPTADGIDS VEFHVQTNPG
TRPGPIMKVA SGGELSRFLL ALKVALADRG SAPTLVFDEI DTGVGGAVAD AIGQRLKRLS
KTVQVLSVTH APQVAARAAT HLLISKGPSA EKAEMIATRV ARMDDAARTE EIARMLAGAS
ITEEARAAAA RLLAGNA