Gene Smed_0131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0131 
Symbol 
ID5320960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp145040 
End bp146011 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content60% 
IMG OID640789064 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001325826 
Protein GI150395359 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.482202 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACATC CTGATGTGAT CGGATGCGAT ATTGCAAAAG CGCATCTAGA TTTTTTCGAC 
AGCGGCCTTG AGCGCCATTT CCGTATCGAC AACACTCCGG CCGCAATTTC CGCGTGGCTC
GACGGCCTTG ATGGCAGAGG CGTTCATATC GTCTTTGAGG CGACCGGGCG TTACGATCGG
CAGTTGCGCA TAGCCCTGGA GACCCGGGAG TTGCCCTATT CCCGCGTCAA TCCTGCCCGC
GCCCGCGACT TTGCCAAGGC GATCGGCCTT CTTGCCAAGA CGGATGCGAT CGATGCACGT
CTGCTTGCCC GGATGGGTCA AAGCCTGCCA CTCTCAACTC AGGCGCCTGA CGATCCCGCC
CGCCACGTGC TCGCCCGCCT TCACACGCGG CGTGACCAGC TCGTGGCCAT GCGCCAGCAA
GAGCGGACAC GCCTTCATGA GACCGAGGGG ATCGAGCGTG ACAGTGCTGA AAGCCATATG
GCTTGGCTCG ACGCGGAGGT TGCGCGCATC GAAATGGCAT GCCGTGATGT TCTGAAGGCC
GAGAAGACCT TGCAAGAACA AGAGGCAAGG CTGCGTTCCA TTCCCGGCAT CGGCCCCGTG
GCCGCATTGA CCCTGATCGC GCATATGCCA GAACTCGGCA ATCGTTCTGC CAAGGCGATT
GCAGCCCTTG CCGGTCTTGC GCCCTTCAAT GTCGACAGCG GCACGTCACG GGGAAAGCGG
CATATACGCG GCGGTCGCAA GCGGATACGT GACGCGCTCT ACATGGCGGC GCTCACAGCC
AGCCGTATGC CCCGTGCTTT TAAGTCCCAT GCTGACCAAA ATGAAGGAGG CAGGCAAGCC
CTTCAAGGTC GTCATCATTG CGCTTGCCCG CAAATTGCTC GCCATCGCAA ACGCCATCAT
CAGGGACAAA ACAACCTTCC GACGAACCAC CTGACAAACA CAGTTGCCAG TCAGTCCAAG
TCCCAGCGTT GA
 
Protein sequence
MIHPDVIGCD IAKAHLDFFD SGLERHFRID NTPAAISAWL DGLDGRGVHI VFEATGRYDR 
QLRIALETRE LPYSRVNPAR ARDFAKAIGL LAKTDAIDAR LLARMGQSLP LSTQAPDDPA
RHVLARLHTR RDQLVAMRQQ ERTRLHETEG IERDSAESHM AWLDAEVARI EMACRDVLKA
EKTLQEQEAR LRSIPGIGPV AALTLIAHMP ELGNRSAKAI AALAGLAPFN VDSGTSRGKR
HIRGGRKRIR DALYMAALTA SRMPRAFKSH ADQNEGGRQA LQGRHHCACP QIARHRKRHH
QGQNNLPTNH LTNTVASQSK SQR