Gene Rleg2_2900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2900 
Symbol 
ID6981644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2953140 
End bp2954285 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content64% 
IMG OID643397610 
Producttransposase IS4 family protein 
Protein accessionYP_002282394 
Protein GI209550477 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00460334 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.33461 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGCATG ACAATAGCGT CTTTCATGAT GTGCTGAAGC GGATTCCGTG GGCGGTCTTC 
GAAAGACTTG TGGATGAGCA TCAGGCCGAC AAGCATGTTC GGCGGCTGTC GACGAAGAGC
CAGTTGATCG CCCTGCTTTA CGGCCAGCTT GCCGGTGCCG TCAGCCTGCG TGAGATCGTC
GGCTCCCTGG AAAGCCATAG CGCCCGCCTT TACCATCTCG GCGCTCGTCC GGTGTCGCGC
TCGACGTTCG CCGATGCTAA CGGCCTGCGT CCGAGCACGG TTTTTGCCGA GTTGTTCGCG
CAGATGGTGG CCCGCGCCGG GCGCGGCCTC AAGCGGGCCA TCGGCGAGGC GGTCTATCTG
ATCGACGGCA GCAGTCTGAG CCTTGCCGGG GCGGGATCGC AGTGGGCCCG CTTTTCCGAT
CAGGCCTGTG GTGCCAAGAT GCACGTCGTC TACGATGCCA ATGCCGAGCG ACCGATCTAT
GCGGCCGTCA CCCCGGCCAA TGTCAACGAC ATCACCGCCG CCAAGGAGAT GCCGATCGAG
GCGGGCGCCA CCTATGTCTT CGATCTCGGC TATTACGACT TCGGCTGGTG GGCGAAGCTC
AATGCCGCCG GCTGCCGCAT CGTCAGCCGC CTCAAATCCC ACACGAAACT GACGGTGAGC
GCCGAGCAGG CGGCAAATGC GGATGCCGGC ATCCTGTTCG ACCGCATCGG CCTGTTGCCG
CAGCGCCAGG CCAAGAGCCG CCGCAACCCG ATGAATCGGC CGGTGCGCGA GATCGGCGTG
CGGATCGAAA CCGGCAAGGT GCTGCGCATC TTCTCCAACG ATCTTACCGC CCCGGCCGAG
GAGATCGCCG CGCTTTACAA GCGCCGCTGG GCGATCGAGC TGTTCTTCCG CTGGGTCAAA
CAGACGCTGA AGATCCGCCA TTTCCTCGGC AATAGCGAAA ATGCCGTGCG CATCCAGGTC
GCTGTCGCCC TGATCGCCTA TTTGCTGCTG CAGATGGCAA AGGCTGACCA GGCCACCGTC
ACGAGCCCGC TGGCCTTCGC CCGCCTGGTG CGCACCAACC TGATGCACCG CAAAAGGATC
GACCGCCTCC TAAAACCACG CCACAGCCCT CCCGGAAATC CCGGCCAGAT GAGCCTCCAA
TGGTGA
 
Protein sequence
MRHDNSVFHD VLKRIPWAVF ERLVDEHQAD KHVRRLSTKS QLIALLYGQL AGAVSLREIV 
GSLESHSARL YHLGARPVSR STFADANGLR PSTVFAELFA QMVARAGRGL KRAIGEAVYL
IDGSSLSLAG AGSQWARFSD QACGAKMHVV YDANAERPIY AAVTPANVND ITAAKEMPIE
AGATYVFDLG YYDFGWWAKL NAAGCRIVSR LKSHTKLTVS AEQAANADAG ILFDRIGLLP
QRQAKSRRNP MNRPVREIGV RIETGKVLRI FSNDLTAPAE EIAALYKRRW AIELFFRWVK
QTLKIRHFLG NSENAVRIQV AVALIAYLLL QMAKADQATV TSPLAFARLV RTNLMHRKRI
DRLLKPRHSP PGNPGQMSLQ W