Gene Smed_6034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6034 
Symbol 
ID5320336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp984966 
End bp986168 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content59% 
IMG OID640777697 
Producttransposase mutator type 
Protein accessionYP_001314629 
Protein GI150378034 
COG category[L] Replication, recombination and repair 
COG ID[COG3328] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0363265 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.408755 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATCG AGAAAGAGCT TCTGGACCAG CTCCTGGCTG GACGTGATCC ATCCGAGGTT 
TTCGGCAAGG ACGGTTTGCT GGACGATCTG AAGAAGGCGC TTTCAGAGCG CATCCTCAAT
GCGGAGCTTG ACGACCATCT CGACGTCGGG CGCCTGGAGG GCGGCCCCGC CAACAGGCGC
AACGGTTCCT CCAAGAAGAC GGTTTTGACT GGCACATCGA AGATGACGCT GACCATCCCG
CGCGATCGGG CGGGTACCTT CGACCCAAAG CTGATCGCCA GATATCAGCG CCGGTTTCCC
GATTTCGACG ATAAGATCAT TTCGATGTAC GCCCGTGGTA TGACAGTGCG CGAGATCCAG
GGGCATCTTG AAGAGCTCTA CGGCATCGAT GTGTCGCCGG ATCTGATCTC GGCGGTGACC
GATACGGTTC TGGAGGCCGT TGGAGAGTGG CAAAACCGGC CGCTCGAGCT TTGCTACCCC
CTCGTGTTTT TCGACGCCAT CCGGGTCAAG ATCAGAGACG AGGGCTTCGT ACGCAACAAA
GCCGTCTATG TCGCCCTGGC CGTGCTCGCT GACGGCAGCA AGGAGATCCT CGGGCTCTGG
ATCGAGCAGA CGGAAGGGGC AAAGTTCTGG CTGCGGGTCA TGAACGAGCT GAAGAACCGC
GGTTGCCAGG ATATCCTAAT CGCCGTGGTC GACGGCTTGA AGGGCTTCCC CGAGGCCATC
ACCGCCGTCT TTCCCCAAAC AATCGTCCAG ACCTGCATCG TCCACCTGAT CCGGCACTCG
TTGGAGTTCG TATCCTACAA GGATAGAAGG ACCGTTGTGC CGGCTTTGAG AGCCATCTAC
CGCGCCCGAG ATGCCGAGGC GGGCCTGAAG GCGCTGGAGG CCTTCGAGGA AGGGTACTGG
GGCCAGAAAT ATCCCGCTAT CGCTCAAAGC TGGCGGCGCA ACTGGGAACA CGTCGTTCCC
TTCTTCGCCT TCCCCGAAGG GGTCCGCCGC ATCATCTACA CGACGAACGC AATAGAGGCC
CTCAACTCGA AGCTTCGGCG AGCTGTGCGT TCCCGCGGGC ATTTCCCTGG TGACGAAGCC
GCGATGAAGC TGTTATATCT CGTTCTTAAC AACGCGGCCG AGCAATGGAA ACGGGCGCCG
CGGGAATGGG TCGAGGCAAA GACACAGTTC GCTGTCATCT TTGGCGAGCG GTTCTTCAAC
TGA
 
Protein sequence
MAIEKELLDQ LLAGRDPSEV FGKDGLLDDL KKALSERILN AELDDHLDVG RLEGGPANRR 
NGSSKKTVLT GTSKMTLTIP RDRAGTFDPK LIARYQRRFP DFDDKIISMY ARGMTVREIQ
GHLEELYGID VSPDLISAVT DTVLEAVGEW QNRPLELCYP LVFFDAIRVK IRDEGFVRNK
AVYVALAVLA DGSKEILGLW IEQTEGAKFW LRVMNELKNR GCQDILIAVV DGLKGFPEAI
TAVFPQTIVQ TCIVHLIRHS LEFVSYKDRR TVVPALRAIY RARDAEAGLK ALEAFEEGYW
GQKYPAIAQS WRRNWEHVVP FFAFPEGVRR IIYTTNAIEA LNSKLRRAVR SRGHFPGDEA
AMKLLYLVLN NAAEQWKRAP REWVEAKTQF AVIFGERFFN