Gene Smed_5451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5451 
Symbol 
ID5319753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp416697 
End bp418307 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content62% 
IMG OID640777213 
Producttransposase IS204/IS1001/IS1096/IS1165 family protein 
Protein accessionYP_001314145 
Protein GI150377550 
COG category[L] Replication, recombination and repair 
COG ID[COG3464] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.268297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACGA AAACGAAATG GTCGCCCGGC CCGGGGGCAA AAGTTCTGGG TATCTCGCTC 
GATGATGATG ACGGTTGGGT TGTTTCCGCC GCCGGACCAG TTTTCGGCAT TTGCCCTGAC
TGCGGACGGC GGACCCGACA TCGGCATGGC TGGTCCAACC GTAGCCTCCA AGATCTGCCG
GTCCAGGGCA AAACCGTAAC GGTGAAGCTT CGGTTGAGCC GCTGGCGGTG CGCGCATCAG
AAATGTGAAC GACAAACGTT CACCGACCGA CTGCCGACGA TTGCTTCCCC TTATGCGCGC
CGGACAAGGA GGGTCTCCGA GATTGTCGGT CTGCTCGGCC ATAGCGCAGG CGGCCGGCCC
GGCGAGCGCT TGATCCGACG GCTCGGCATG CCGGTCAGCG ACGACACGAT CCTGCGGCAG
CTGAAGCGGG ATGCCGCGCT CGTTTCCTGC GATGCCACGA TCCGGGTCGT CGGCATAGAT
GATTGGAGCT GGCGGCGATC GTGGCGCTAC GGCACGATGA TCGTCGACCT GGAGCGCCGT
TCGGTTGTCG ATATTCTTGA GGACCGGAGC GTAGCGAGTG TCGCGCGATG GCTGGAGCAG
CATCCTTCTG TCGAGATCGT GAGCCGAGAC CGAGGCGGGC TGTACGCACA AGCTGCTCGC
GAGGGTGCGC CGCAAGCTCG TCAAGTCGCT GATCGGTTCC ATCTCATGCA GAATCTGCGA
GTCGCAATCG AGGAGCAGAT GAGCCTTGGC GGCCGCGCCA CCGGACGAGC ATTGCTGCCG
GATAAATGGA TCGGGAGCGC GCAAATCGAT CTGCTTCAGG ATGATCCGCA CGTTGACGCA
AGGCACCGGC GCCGGGGGCG TCACGCTCAT CGAGAATCAC GGCAGGCGGT GTTCGATACA
GTGCACGCTT TGAATGAGGA AGGTTTGTCC TGTTCGGAGA TCGCACGTCG CACCGGCTAC
GGGCGGCGCA GCATCGCGAA ATGGCTGACT TTCGAAACGC CACCCGACCG ACAGAAGGCG
GCGTTGAAGC CGACATCGCC CCTGTACTTT GAGGCGTTTC TCGCCGTGTG CTGGAAAGAT
GGCAATCGCT GCGGACGGCA TCTGTTCTAC GATATCAAAC AGCGCGGCTA CACGGGCAGT
TTCTCGAATC TCGAGCGGCT TCTCGCAAGC TGGCGCCGCT CGGAGAGATC GGTCGAGGGC
AGCGCGTCGT CGGCTCCGAT CATCTCGCAT CAACGGGGCC GCGGTGTTGT CCCCATACGC
GATCCGGAGA CCGGCCATGT GATCTCACCG GTGGTCGCAG CTGCCCTCTG CATCAAGCCG
CGAAGCATGC TGACGATCAC TCAGGCGAGA AAGGTCGATG CCTTGAAACA GGGCGCGCCT
GAGTTCGCTT TGATGCGCAG CCTTGGCATG CGCTTTCGCG GAATCTTTCG CAGCGGCGAT
CCGGGCAAGC TTAACAGCTG GATTGACGAT GCCGTTAATT CCGGTTTGGT CGCAATTGAG
CGGTTCGCAC GCGTCCTGCA CCGTGACATC GGCGCCGTCC GCAACGCCGT GGAACTCCCC
TGGAGCAACG GCCAGGCGGA AGGCCAGATC AACCGTCTGA AGACAATTTA G
 
Protein sequence
MQTKTKWSPG PGAKVLGISL DDDDGWVVSA AGPVFGICPD CGRRTRHRHG WSNRSLQDLP 
VQGKTVTVKL RLSRWRCAHQ KCERQTFTDR LPTIASPYAR RTRRVSEIVG LLGHSAGGRP
GERLIRRLGM PVSDDTILRQ LKRDAALVSC DATIRVVGID DWSWRRSWRY GTMIVDLERR
SVVDILEDRS VASVARWLEQ HPSVEIVSRD RGGLYAQAAR EGAPQARQVA DRFHLMQNLR
VAIEEQMSLG GRATGRALLP DKWIGSAQID LLQDDPHVDA RHRRRGRHAH RESRQAVFDT
VHALNEEGLS CSEIARRTGY GRRSIAKWLT FETPPDRQKA ALKPTSPLYF EAFLAVCWKD
GNRCGRHLFY DIKQRGYTGS FSNLERLLAS WRRSERSVEG SASSAPIISH QRGRGVVPIR
DPETGHVISP VVAAALCIKP RSMLTITQAR KVDALKQGAP EFALMRSLGM RFRGIFRSGD
PGKLNSWIDD AVNSGLVAIE RFARVLHRDI GAVRNAVELP WSNGQAEGQI NRLKTI