Gene Smed_4410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4410 
Symbol 
ID5318123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp904990 
End bp906108 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content61% 
IMG OID640776214 
Productputative DNA topoisomerase I protein 
Protein accessionYP_001313147 
Protein GI150376551 
COG category[L] Replication, recombination and repair 
COG ID[COG3569] Topoisomerase IB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.017671 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAGAG CAATGTCGGT CGACGTGCTG GAATGTCGAC CTGCCGGGCT CGCCGATCTC 
CCGCAGCGGC TCGCGGCGAT CGGGTCTGTG CCCGAGGAAA CTGGGCTCGT CTACGTCAGC
GACAGCGAGC CGGGTATTCG GCGCCAAAGA CGTGGCAAAG GGTTCGCGTA TCGCATGCCG
GACGGATCGA TCGTTACCGA TCCGTCTGTT AAAAGCCGCA TAGCGGCACT GGGATTACCG
CCTGCCTATG AGAACGTCTG GATCTGCCTT GACGAACGAG GCCACCTGCA GGCCACCGGC
TATGATGCGC GCGGCCGAAA GCAGTATCGT TATCACAGCG AATGGCAGGC CCTCAGGAGC
GCCGACAAAT TTGCGCAACT GACGGAATTC GGCAAAGCTT TGCCTAAGAT ACGCCGCACC
ATACGGCGCC ACATGCAGGG CGGCGTGGAA AACATGCAGA CCGTGCTTGC GGCGCTCGTG
GCCCTGCTGG ACGAAGCGCA TCTGCGCACC GGCAACCAGG CTTATGTGCA GGCCAACGGA
AGTTATGGCG CAACCACTTT GCTGAAGCGG CATCTCAGAC TTGGCGACGG CTTTATAGAA
TTGAAATTCA CCGGAAAGGG TGGCAAGCGC GTTCAGCGGG TGCTCCGTCG CCCGAAGTTG
CAGCGACTGC TCGAAGAAAT AGCCGATCTC CCGGGCAGGC AGCTTTTCGT CTGGAAGGAT
GAAAACGACG CGCTTCGACC GGTCGATTCC GGCCGGCTCA ACCGGTATCT CACGGACATG
GCCGGCACAG CGATCTCGGC AAAGACATTC CGAACATGGG GCGGCACTCT CGCCGCTTTC
ACGGTCGCGC GGACCTCGAT CGAGCGGGGT GAGTGGCCGA CGATCAAACA GATGAGCGAG
GCGGCTGCAT CCGTGCTTCA CAACACGCCC GCAATCAGCC GGAGCAGTTA CATTCATCCG
GATGTGCTCG CCCTTGCCGA CAAGTCGGCC CCGGTCTCCG CGCGGCAGCT TCAGGCGCGT
GGGCGTTCAG GAAGTGAATT GCGTGTGGAG GAACAGCGTT TGCTAGGCTT TCTTCAGCGC
AGCGCGGGGA CGAAAAAGCG CCTGCCCTTG CCCCAGTGA
 
Protein sequence
MARAMSVDVL ECRPAGLADL PQRLAAIGSV PEETGLVYVS DSEPGIRRQR RGKGFAYRMP 
DGSIVTDPSV KSRIAALGLP PAYENVWICL DERGHLQATG YDARGRKQYR YHSEWQALRS
ADKFAQLTEF GKALPKIRRT IRRHMQGGVE NMQTVLAALV ALLDEAHLRT GNQAYVQANG
SYGATTLLKR HLRLGDGFIE LKFTGKGGKR VQRVLRRPKL QRLLEEIADL PGRQLFVWKD
ENDALRPVDS GRLNRYLTDM AGTAISAKTF RTWGGTLAAF TVARTSIERG EWPTIKQMSE
AAASVLHNTP AISRSSYIHP DVLALADKSA PVSARQLQAR GRSGSELRVE EQRLLGFLQR
SAGTKKRLPL PQ