Gene Smed_6214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6214 
Symbol 
ID5320516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp1135226 
End bp1136371 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content62% 
IMG OID640777822 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001314754 
Protein GI150378159 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGAA CGGCCTTGGA GGGTCTCGGC GAGACCCTCT CAGCAATAGA CGAAGTCGTG 
ATAGAGGCGA CGGGCAACTG CATGGCGGTG TCTCGCGTGC TGTCGCCTTT CGTGCGCAGA
GTAGTGATTG CCAATCCGCT TCAGGTCAAG GCGACCGCAC AGGCGCATGT GAAGACCGAT
AAGATCGATG CAGGCACCTT GGCCAACCTG TATGCCGCGA GCTATCTGCC GGAGATCTGG
ACGCCGGACG CCGCCACGGA GCGCATGCGG CGGTTGGAGG CGCGCCGCTA CCAGGTTGTG
CGGCATCGGA CGCGGATCAA GAACGAGGTC CATTCCATCC TGCACGCCCA CCTGATTCCG
AAGTGCCCGC ATGCCGATCT GTTCAACGGT AGAGGGCGGG ACTGGCTTAA ACGCCAGCCT
GTGCCTGAGG ATGAGCAGAT TGCCATCGAG CGGTATGTAC GCGAACTGGA CCGGCTCGGT
GAAGATCTTG CGGTCCTCGA TCGGCAAATC GCCGAAAGCA CGATGGATGA TCCCGCGGTC
AGGCGGCTGC TCACGATCAC GGGCGTCAAC TCCCCCGTGG CGACCGGGTT GCTGGCAGCC
GTCGGCGATA TCCGTCGCTT CAAGAGCCCC CAAAAGCTGG TCAGCTACGT CGGCCTCAAT
CCGCGAGTTC GCCAGTCCGG ACTTGGCGCT GCTCATCACG GACGTATCAG CAAGATTGGC
CGTAGCCATG CCCGCGCCAT GCTCGTGGAG GCGGCATGGG CGGCGGCGAA AGCGCCGGGG
CCGCTGCACG CCTTCTCCGT CCGGGTCATG GCCAGACGCG GCCATCAGGT CGCAGCCGTC
GCCATAGCTC GCAAACTAAC GGTTCTGGTC TGGCACATGT TGACCAAGGA GGCTGATTAC
CTCTGGGCTT GCCCAAGTCT CGTCGCACAC AAAATGCGCG CGATGGAATT GCAGGCCGGG
AAACCTCAGA AAAAGGGGAA CAAGCCCGGC CCTGCCTACA CCTACAACAT CAAGAAACTG
CGTGACCAGG AGATGCATCT CGCCGAGCAA GCACAAAGAA GCTACGAACG TTTCGTCGAG
ACCTGGCGTC CGCGCCCGCC AAATCAAAAG GTGCGCGGAC GCCTCAATCC GACAAGGCTC
GAATGA
 
Protein sequence
MTRTALEGLG ETLSAIDEVV IEATGNCMAV SRVLSPFVRR VVIANPLQVK ATAQAHVKTD 
KIDAGTLANL YAASYLPEIW TPDAATERMR RLEARRYQVV RHRTRIKNEV HSILHAHLIP
KCPHADLFNG RGRDWLKRQP VPEDEQIAIE RYVRELDRLG EDLAVLDRQI AESTMDDPAV
RRLLTITGVN SPVATGLLAA VGDIRRFKSP QKLVSYVGLN PRVRQSGLGA AHHGRISKIG
RSHARAMLVE AAWAAAKAPG PLHAFSVRVM ARRGHQVAAV AIARKLTVLV WHMLTKEADY
LWACPSLVAH KMRAMELQAG KPQKKGNKPG PAYTYNIKKL RDQEMHLAEQ AQRSYERFVE
TWRPRPPNQK VRGRLNPTRL E