Gene Smed_6340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6340 
Symbol 
ID5320643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009622 
Strand
Start bp7027 
End bp8619 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content61% 
IMG OID640777934 
Producttransposase IS66 
Protein accessionYP_001314866 
Protein GI150378272 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.360304 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGAA CCGGCGAACC GAGCGTTGCA GAGCTGATGG CGCAGTTGGC GGCCAATGCT 
GCCGAAATCG CCGCGCTGAA AGCCGAGAAG GAAACGCTCT CGCAGCGGGT CGTCAAGCTG
GAGGAAGAGC TGGCGCTTGC ACGGCTGCAT CGCTTTGCGC CGCGCAGCGA AAAGCACGTT
GATCGCATCT TCAACGAAGC CGAAGAGGCT GCGGACGAGG ATGGAGCGGA CAGCCACGAG
AGCGAGGTCA TCGACATTCC GGACACAGGT CTGCCGCCTG TCGAAAGCAC GACGGGTCAG
AAGCGCGGCC GCAGACCACT ACCGCAGAAC CTGCCGCGCG AGCGCGTCGA GTATGACCTT
CCCGACGGTC AGAAGGCTTG TCCTTGCTGC CGTGGGCAGA TGCACCGCAT GGGCGAGGCC
GTTACCGAGC AACTTCATAT CGAAGTGAAG GCGAAGGTCT TACAGAATGT GCGGTTCAAA
TATGCATGCC GCCATTGCGA TCGCACCGGG ATCAACACGC CTGTCGTGAC CGCGCCGATG
CCGGTGCAGC CCTTGCCGGG CAGCATCGCC ACGGCCTCGA CACTGGCCTT CGCACTCGTT
CACAAGTACG TCGACGGCAC ACCGCTCTAC CGTCTGGCGC AGACCTTCGA ACGCGCCGGT
GTTCCTGTCA GCCGTGGTGC TCTCGGTCAC TGGGTGATCG GTTCGAGCGA GAAGCATCTG
CATCGCATCT ATGACGCGCT GAAACTGCGG CTCAGGTCGC AATCCCTCAT CCATGGCGAC
GAGACGACCG TTCAGGTCCT GAAGGAAAAG GGCAAAGAGG CCACCAGCAC ATCGTATATG
TGGGCCTATC GGAGCGGCGA CGACAGTGAC GAGCCGATCG TGCTTCTCGA TTATCAGCCG
GGCCGCGGCC AGATTCACCC CCAGACCTTC CTCGGCGATT ATCGCGGCAT ATTGATGAGC
GATGGCTACA CCGCCTGGCG CACGCTGGAT GGCGCGATCC ATATCGGATG CATGGCCCAT
TCGAGGCGAC GCTTCGTCGA TGCCCTCAAA GCCAGAAAGA AAGGCGGCGG TCCGCCCGAG
CAGGCGCTCC GGTTCTTCGA ACAACTCTAT CGGGTTGAAA GGCAAGCACG AGACAAGAAG
CCTGATGCCG GCGAAACGAA GGCCGACAGC GTTCGCCGAT TCCGACAGCA ACACAGCATA
CCCGTCCTGA CCGCTCTCAA GGTATGGCTC GACGAAATCG CGCCGAAGGT TATGCCGGAC
ACCAAGCTGG GCGATGCTGT CTCCTACACC CTGAACCAGT GGGATTATCT GACACGCTAC
AAAGACGATG GCAGGATGCC GATCGACAAC AACATCTTGG AGCGCGACAT CAGAGTTTTT
GCGACCGGCA GAAAGAGTTG GCTGTTCAGC GATAGCACCG ACGGAGCCAA GGCCAGCGCC
GTCATCTACA GCCTCATGCT GACCTGCCGT GCCTGCGGCG TCGAGCCGTT GACGTGGTTG
CGCCACGTCC TTACCGAATT GCCTCAACGC GCGGGCGATG CTGATATCGA CGACCTGTTG
CCCTTCGACC TCACACAGAC TGCCACCGCC TGA
 
Protein sequence
MNRTGEPSVA ELMAQLAANA AEIAALKAEK ETLSQRVVKL EEELALARLH RFAPRSEKHV 
DRIFNEAEEA ADEDGADSHE SEVIDIPDTG LPPVESTTGQ KRGRRPLPQN LPRERVEYDL
PDGQKACPCC RGQMHRMGEA VTEQLHIEVK AKVLQNVRFK YACRHCDRTG INTPVVTAPM
PVQPLPGSIA TASTLAFALV HKYVDGTPLY RLAQTFERAG VPVSRGALGH WVIGSSEKHL
HRIYDALKLR LRSQSLIHGD ETTVQVLKEK GKEATSTSYM WAYRSGDDSD EPIVLLDYQP
GRGQIHPQTF LGDYRGILMS DGYTAWRTLD GAIHIGCMAH SRRRFVDALK ARKKGGGPPE
QALRFFEQLY RVERQARDKK PDAGETKADS VRRFRQQHSI PVLTALKVWL DEIAPKVMPD
TKLGDAVSYT LNQWDYLTRY KDDGRMPIDN NILERDIRVF ATGRKSWLFS DSTDGAKASA
VIYSLMLTCR ACGVEPLTWL RHVLTELPQR AGDADIDDLL PFDLTQTATA