Gene Rleg_4905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4905 
Symbol 
ID8007386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp283792 
End bp285447 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content63% 
IMG OID644821825 
Producttransposase IS66 
Protein accessionYP_002973085 
Protein GI241113250 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG CGACTGATGA GCTTCCGGAC GACCTTGCCA GTGCGCTTGC ACTGCTGGCC 
GAGGAGCGTG CCCGGCGTAT CACTGCCGAG GCAGAAGCTG CGATCGCCAA GGCGCAAGCC
GCCAGCGCAA AGGCGCTCGT GTCGCATTCC GAAGCGCTGA TCGCGCGGTT GAAGCTGGAG
ATCGAGAAGG TTCGCCGCGA ACTTTACGGC AGCCGGTCAG AACGCAAGGC GCGACTCCTC
GAACAGATGG AACTGCAGCT CGAGGAGCTG GAAGCTGACG CTGGCGAAGA CGAACTGGCG
GCAGAGGTTG CGGCCAAAGC CTCGACGGTC AGGGCTTTCG AGCGCAAGCG TCCATCACGG
AAACCATTCC CTGAACATCT GCCGCGCGAG CGTGTCGTTA TCGCGGCCCC GGCGAGCTGC
CCTTGTTGCG GTTCGGGCAA GCTGTCGAAG CTGGGCGAAG ACATCACCGA GACCCTGGAG
GTCATCCCGC GTCAGTGGAA GGTGATCCAA ACGGTGCGGG AGAAGTTCAC CTGCCGCGAA
TGCGAGAAGA TCACCCAGCC ACCAGCTCCT TTCCATGTGA CGCCGCGGGG CTTTGCCGGG
CCCAGCCTTC TGGCGATGAT ACTGTTTGAG AAGTTCGCGC AGCATCAACC GCTGAACCGC
CAGAGCGAGC GCTATGGCCG TGAGGGTATC GACCTCAGCC TGTCGACGCT GGCAGATCAG
GTCGGCGCTT GCGCCGCGGC GCTGAAGCCA CTCCATGCGT TGATCGAAGC GCATGTCCTG
GCTGCCGAGC GGCTGCATGG TGACGACACC ACAGTGCCGA TCCTGGCGAA GGGAAAGACC
GATACGGGTC GCATCTGGAC CTATGTCCGA GATGACCGGC CGTTCGGCGG GCAATCGCCG
CCGGCGGCTC TCTACTATGC TTCGCGAGAT CGACGACAAG AGCATCCCGA GCGCCACTTG
AAGACCTTCA CCGGCATTCT GCAGGCTGAT GCCTATGGCG GCTACAATCC GCTGTTCAAG
GTAGACCGCG ATCCGGGGCC GCTGACGCAG GCGCTCTGCT GGTCGCACGC GAGGCGCAAG
TTCTTCGTGC TGGCCGACAT CGCCACGAAT GCCAAACGCG GCAGCCGCGC CGCGCCGATC
TCGCCTATGG CGCTGGAAGC CGTCAAACGG ATCGATGCGC TGTTCGACAT CGAGCGTGAG
ATCAACGGAC TTGCCGCCGA TCAACGCCTG GAGCACCGTC GCAAGGGCAG CCTGCCGCTT
GTCGGCGAAC TGCACCGCTG GCTTCAAACC GAGCGGGCAA AACTGTCGCG CAGTTCTCCC
GTCGCCGAGC CGATCGACTA CATGCTGAAG CGCTGGAACG GCTTCGAGTC TTTCCTCGAC
GACGGCCGGA TTTGTCTCAC GAACAATGCC GCCGAGCGAG CGCTCAGGGG TTTTGCACTT
GGAAGGAAGT CGTGGCTCTT CGCCGGATCG GATCGCGGCG CTGATCGTGC CGCCTTCATG
GTCACGCTGA TCATGAGTGC CAAGCTAAAC GACATCGATC CGCAGGCCTG GCTTGCTGAC
GTCCTGGCCC GCATCGCCGA CACGCCAATC AGTAAGCTGG AGCAATTGCT TCCGTGGAAT
TGGCAGCCGC ACGGACTGAA CGCTCAAGCA GCCTAA
 
Protein sequence
MSDATDELPD DLASALALLA EERARRITAE AEAAIAKAQA ASAKALVSHS EALIARLKLE 
IEKVRRELYG SRSERKARLL EQMELQLEEL EADAGEDELA AEVAAKASTV RAFERKRPSR
KPFPEHLPRE RVVIAAPASC PCCGSGKLSK LGEDITETLE VIPRQWKVIQ TVREKFTCRE
CEKITQPPAP FHVTPRGFAG PSLLAMILFE KFAQHQPLNR QSERYGREGI DLSLSTLADQ
VGACAAALKP LHALIEAHVL AAERLHGDDT TVPILAKGKT DTGRIWTYVR DDRPFGGQSP
PAALYYASRD RRQEHPERHL KTFTGILQAD AYGGYNPLFK VDRDPGPLTQ ALCWSHARRK
FFVLADIATN AKRGSRAAPI SPMALEAVKR IDALFDIERE INGLAADQRL EHRRKGSLPL
VGELHRWLQT ERAKLSRSSP VAEPIDYMLK RWNGFESFLD DGRICLTNNA AERALRGFAL
GRKSWLFAGS DRGADRAAFM VTLIMSAKLN DIDPQAWLAD VLARIADTPI SKLEQLLPWN
WQPHGLNAQA A