Gene Smed_3691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3691 
Symbol 
ID5318809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp132330 
End bp133301 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content60% 
IMG OID640775504 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001312437 
Protein GI150375841 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.687432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACATC CTGATGTCAT CGGATGCGAT ATTGCAAAAG CGCATCTAGA TTTTTTCGAC 
AGCGGCCTTG AGCGCCATTT CCGTATCGAC AACACTCCGG CCGCAATTTC CGCGTGGCTC
GACGGCCTTG ATGGCAGAGG CGTTCATATC GTCTTTGAGG CGACCGGGCG TTACGATCGG
CAGTTGCGCA TAGCCCTGGA GACCCGGGAG TTGCCCTATT CCCGCGTCAA TCCTGCCCGC
GCCCGCGACT TTGCCAAGGC GATCGGCCTT CTTGCCAAGA CGGATGCGAT CGATGCACGT
CTGCTTGCCC GGATGGGTCA AAGCCTGCCA CTCTCAACTC AGGCGCCTGA CGATCCCGCC
CGCCACGTGC TCGCCCGCCT TCACACGCGG CGTGACCAGC TCGTGGCCAT GCGCCAGCAA
GAGCGGACAC GCCTTCATGA GACCGAGGGG ATCGAGCGTG ACAGTGCTGA AAGCCATATG
GCTTGGCTCG ACGCGGAGGT TGCGCGCATC GAAATGGCAT GCCGTGATGT TCTGAAGGCC
GAGAAGACCT TGCAAGAACA AGAGGCAAGG CTGCGTTCCA TTCCCGGCAT CGGCCCCGTG
GCCGCATTGA CCCTGATCGC GCATATGCCA GAACTCGGCA ATCGTTCGGC CAAGGCGATT
GCAGCCCTTG CCGGTCTTGC GCCCTTCAAT GTCGACAGCG GCACGTCACG GGGAAAGCGG
CATATACGCG GCGGTCGCAA GCGGATACGT GACGCGCTCT ACATGGCGGC GCTCACAGCC
AGCCGTATGC CCCGTGCTTT TAAGTCCCAT GCTGACCAAA TGAAGGAGGC AGGCAAGCCC
TTCAAGGTCC GTCATCATTG CGCTTGCCCG CAAATTGCTC GCCATCGCAA ACGCCATCAT
CAGGGACAAA ACAACCTTCC GACGAACCAC CTGACAAACA CAGTTGCCAG CCAGACCAAG
TCCTTGGGCT GA
 
Protein sequence
MIHPDVIGCD IAKAHLDFFD SGLERHFRID NTPAAISAWL DGLDGRGVHI VFEATGRYDR 
QLRIALETRE LPYSRVNPAR ARDFAKAIGL LAKTDAIDAR LLARMGQSLP LSTQAPDDPA
RHVLARLHTR RDQLVAMRQQ ERTRLHETEG IERDSAESHM AWLDAEVARI EMACRDVLKA
EKTLQEQEAR LRSIPGIGPV AALTLIAHMP ELGNRSAKAI AALAGLAPFN VDSGTSRGKR
HIRGGRKRIR DALYMAALTA SRMPRAFKSH ADQMKEAGKP FKVRHHCACP QIARHRKRHH
QGQNNLPTNH LTNTVASQTK SLG