Gene Smed_6199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6199 
Symbol 
ID5320501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp1122133 
End bp1123782 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content60% 
IMG OID640777815 
Producttransposase IS66 
Protein accessionYP_001314747 
Protein GI150378152 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.07824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGAGG CTGAGATCGC GCGGTTGGAG GCGGTTGAGA AGAGCGCCAA CGAGCGGATT 
GCCAACCTGA CCTTGATCAT GAAGGTTTTG CAGCGCACGC AAAATGGCAA GCGCTCTGAG
CGGCTCCGCC TCGAGGTCAA TGACGAGCAG GTGTCCTTTG CCTTCGAAGA GGTCGAAACC
GGCCTATCGG CAATCCGCAG CGAACTCGAT CGCGCAGCCA AGGACAAGCC GAAGCGAGCG
CCGCGTCCGC GCAAGGGTTT TGCCGCCCAT CTCGAGCGCA TCGAAGAGGT CATCGAGCCG
GAAATCCCGG CCGGGTGCGA GGGGCTGGCA AAGGTTCTGA TCGGAGAGGA CCGCTCCGAG
CGGCTGGACG TCGTGCCGCC GAAGTTCAGG GTTATCGTGA CGCGTCGCCC CAAATACGCT
TTCCGGGGCA GCGACGGCGT GGTCCAGGCC CTGGCGCCGG CACACATCAT CGAAGGCGGC
CTGCCGACGG AACGGCTGCT CGCCTATATC GCCGTTTCCA AATACGCCGA TGGCCTTCCT
CTCTATCGGC AGGAAGCGAT CTATTTGCGT GATGGCGTCG AGATCAGCCG ATCGCTGATG
GCCCAATGGA TGGGGCATCT GCGCTTCGAA CTGCAGATGC TGGCCGATTA TATTCTGGAG
AGGGTCAAGG AGGGCGAAAG GATCTTTGCC GACGAGACGA CCCTACCCAC TCTTGCGCCC
GGTTCGGGCA AAACCACCAA GGCCTGGCTT TGGGCTTACG CACGCGACGA CCGCCCCTAT
GGCGGAACCA GTCCGCCGAT GGTGGCCTAT CGATTTGAAA ACAGCAGAGG TGCGGATTGC
GTGACGCGTC ATCTCTCCGG ATTCACCGGC ATCCTGCAAG TGGATGGCTA CTCGGCCTAT
ACTAATCTCG CCAAGACGCG GGCCAAAACC GGCAGCAACG AAACGGTCCA GCTTGCAGGA
TGTTGGGCAC ATCTACGGCG CAAGTTTTAT GACCTGCACA TCAGTGGAGT CTCGCAGGCC
GCCACAGACA CTGTCCTGGC AATGACCGAG CTCTGGCGCA TCGAGGATGA AGTTCGCGGT
AAGGATGCCG ACAGCCGCGC GGCCCGGCGC CAGGAGAAAT CCTCGACCAC CGTCGCCAGC
CTCTTCGAGC TCTGGGAAAA GGAACTGGGC AAAGTCTCGG GAAAATCCAA AACCGCCGAG
GCGATCCGCT ACGCGCTCAC CCGGCGCGAG GCGCTGGAGC GCTTTCTGAC GGACGGTCGC
ATCGAAATCG ACTCCAACAT CGTCGAACGG GCGATCAGGC CCCAAACGAT TACGAGAAAG
AATAGCCTAT TCGCCGGCAG CGAGGGCGGT GGACGAACTT GGGCGACGGT GGCCACCTTG
TTGCAGACGG CATTATGCCG CGCGCGGCAT AAGGCGGTTT ATGCCGACCG GCGAACTATG
CCGAGCACCT GCTCCAACAT ACCATTTGAA ACGCGCTTTT CTCGCCGTTT CCGCTGGAGA
TTTGTTGTAG GCACTCGGCA TAGTTCTCAT GCTCAGAGGG CATTCAGGAA TTCCAGAAGC
CGGTCGCTTG GTCGAAATCG CTCGGCCTTG GCGCGTTCGT ACGGCTTCAG CTTTGCGAGC
GCGGATTCCT TCAGTTCAAG ATGTGCATGA
 
Protein sequence
MAEAEIARLE AVEKSANERI ANLTLIMKVL QRTQNGKRSE RLRLEVNDEQ VSFAFEEVET 
GLSAIRSELD RAAKDKPKRA PRPRKGFAAH LERIEEVIEP EIPAGCEGLA KVLIGEDRSE
RLDVVPPKFR VIVTRRPKYA FRGSDGVVQA LAPAHIIEGG LPTERLLAYI AVSKYADGLP
LYRQEAIYLR DGVEISRSLM AQWMGHLRFE LQMLADYILE RVKEGERIFA DETTLPTLAP
GSGKTTKAWL WAYARDDRPY GGTSPPMVAY RFENSRGADC VTRHLSGFTG ILQVDGYSAY
TNLAKTRAKT GSNETVQLAG CWAHLRRKFY DLHISGVSQA ATDTVLAMTE LWRIEDEVRG
KDADSRAARR QEKSSTTVAS LFELWEKELG KVSGKSKTAE AIRYALTRRE ALERFLTDGR
IEIDSNIVER AIRPQTITRK NSLFAGSEGG GRTWATVATL LQTALCRARH KAVYADRRTM
PSTCSNIPFE TRFSRRFRWR FVVGTRHSSH AQRAFRNSRS RSLGRNRSAL ARSYGFSFAS
ADSFSSRCA