Gene Smed_0452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0452 
Symbol 
ID5321286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp488506 
End bp489663 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content61% 
IMG OID640789387 
Productradical SAM domain-containing protein 
Protein accessionYP_001326144 
Protein GI150395677 
COG category[L] Replication, recombination and repair 
COG ID[COG1533] DNA repair photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.290609 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGATC TGTCTCAACT CAAGCAGGGC ATCTATGCCC CCGGCAACAG TGTGGATGTC 
GCGGAAGCCC TTGTTGCCGG AACCGGCATC AGGATCGACG TAGACCGTCG GCGCGGCCGT
GGTGCGGCCC TCAACACATC GGGACGCTTC GAGCCGAAAA CGCACGAAGT CTTCGATGAC
GGCTGGCAAA CGATCGAGGC GCTTCCACCG TTCAAGACAG AAGTCCAGAT CGAAAAGCCG
AAGACAGCGA TCACGCGCAA CGATTCTCCC GATATATCCT TCGACCGATC GATCAATCCT
TACCGCGGCT GCGAACATGG CTGCATCTAT TGCTTCGCGC GGCCTACGCA CGCCTATATG
GGGCTTTCGG CGGGGCTCGA TTTCGAAGCC AAGCTCTTCG CCAAGCCGGA CGCACCACGC
CTCCTGGAAC GCGAACTGGC GCGGCCGGAC TACAAGCTCC GTCCGATCGC AATCGGCACC
AATACCGATC CCTATCAGCC GATCGAGAAG GAATGGCGGA TCATGCGACA GATCCTGGAA
GTACTGAAGG AGGCCAATCA TCCGGTGATG ATCGTCACGA AATCGGCGAT GGTGACGCGC
GACATCGATC TGCTGGCGCC GATGGCAGAA AAAGGTCTCG CACGGGTCGG ACTCTCCGTG
ACCACGCTCG ACGGGAAGCT GGCGCGCAAC ATGGAGCCGA GGGCATCGAC GCCGGCCAAG
CGGCTGGAGG CGATACGCGC AATTTCCGAA GCCGGCATTC CCGCTGGTGT TCTGGTCGCG
CCGATCATTC CGGCGCTGAA CGACCACGAG ATAGAGCGGG TGCTCGATTC GGCGAAAGTA
GCGGGTGCTT CGGATGCGAG CTATGTGCTC CTTCGGCTTC CATTGGAAGT AAGCCCCCTC
TTCCGCGACT GGCTTCTCAG GAACTATCCG GACCGGTACC GGCACGTCAT GTCCCTCGTC
CGTTCCATGC GCGGCGGCAA GGATTACGAC GCCGAGTTCG GCAAGCGGAT GAAGGGAAGC
GGACCTTACG CCTGGCAGAT CGGCCGCCGC TTCGAGCTTG CCGCCAAGCG GCTCGGCCTC
AATCTGACGC GCCGGCAATT GCGCAGCGAC CTCTTCGTGC CGCCGCTCGG GATGGGCGTT
CAGCTGTCGT TGCTCTGA
 
Protein sequence
MNDLSQLKQG IYAPGNSVDV AEALVAGTGI RIDVDRRRGR GAALNTSGRF EPKTHEVFDD 
GWQTIEALPP FKTEVQIEKP KTAITRNDSP DISFDRSINP YRGCEHGCIY CFARPTHAYM
GLSAGLDFEA KLFAKPDAPR LLERELARPD YKLRPIAIGT NTDPYQPIEK EWRIMRQILE
VLKEANHPVM IVTKSAMVTR DIDLLAPMAE KGLARVGLSV TTLDGKLARN MEPRASTPAK
RLEAIRAISE AGIPAGVLVA PIIPALNDHE IERVLDSAKV AGASDASYVL LRLPLEVSPL
FRDWLLRNYP DRYRHVMSLV RSMRGGKDYD AEFGKRMKGS GPYAWQIGRR FELAAKRLGL
NLTRRQLRSD LFVPPLGMGV QLSLL