Gene Rleg2_1982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1982 
Symbol 
ID6980721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2034370 
End bp2035740 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content60% 
IMG OID643396705 
ProductMammalian cell entry related domain protein 
Protein accessionYP_002281493 
Protein GI209549576 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0362661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.670067 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACCA AAGCCAATTA CACGATTGTC GGTTTTTTCA CGGTGCTGGT GATCGCGGCG 
GCCTTCGGCT TCGTCTACTG GATGGCCGAA TATGGCCGCG GCGGCCCGAT GACCGAGCTG
ATCGTGCGTA TTCCGGGTTC GGCCAACGGC CTCAGCGTCG GCTCGCCGGT GCGCTTCAAC
GGCATTCAGA TCGGCTCGGT GCAGACCCTG TCGATCGATG CCGACGATCC GCAATATTCA
CTGGCGTTCA CCCAGGTGCG CACCGATGCG CCGATCTATC CGTCGACCAA GGCCGCTCTT
GAAATCCAGG GCCTCACGGG GGCCGCCTAT ATCGAACTGT CGGGCGGCCG CAAGGGCGAG
GAAAGCATCC TCCAGCATGC GATCGACAAC GGCAAACGCG CCGTCATCGT CGCCGATCAA
TCGAGCGTCA CCAATCTTCT GGCGACAGCC GACAAGATCC TCGATCGCGC CAACGACGCG
GTCGGTGAAC TTCAGGGTTT CATCGAGGAT TCACGCGCGC CCCTGACCCA GACGTTCAAG
AATGCCGAGA CGTTCTCGGA TGCGCTTGCC AAGAATTCCG GCAATATCGA CGCCTTCCTG
CAGAGCGTGG GTGAGCTTTC CAATACGGTG AAGGCGGTCT CCGGTCGTGT CGATTCGACG
CTTCAAGCTG TGGAATCGCT GGTCAAAGCG GTTGACGCGA AGAAGATCGA CAACATCGTT
TCCAACGCCG ACAAGATCAC CGCCAATGTC GCCGATGCCT CGGGCGACCT TAAGGGGGCG
ATCCAGAAGT TCGACCAGAC GGCCACCACC TTCAACGATT TCGGCAAACA GGCACAGGCG
ACGCTCGACC GCGTCGACAC GCTTGTCGCC CAGATCGACC CCGCCAAGGT GAAGGGCTCG
GTCGACGACA TCGCGCAGGC GACCAAGGAT GCGCGTGCCG CCGTTGCCTC GATCCGCGAC
GTCGCCAATA CGGTTTCGGG GCGTCAGAAA GATATCGACC AGACCATCCA GGACGTCTCG
CAGCTTGCCA ACAAGCTGAA TTCGGCCTCG ACCCGGATCG ACGGCATTCT CATCAAAGTC
GATGCCTTGC TGGGAACCGA CAACACGCAA TCGCTGTTTA CCGAGGCGCG CGATACGCTG
GAATCCTTCA AGAAGGTCGC CGACAACCTG AATGCGCGCA TCGGGCCGAT CGCCGACAAT
CTGCAGAAAT TCTCAAGCGG CGGTTTGCGC GACGTGCAGA CGCTCGTCAA CGACATGCGG
GGAACCGTCA ACAATCTGAA CGACACGATC ACCAACTTCG ACCGCAATCC GCAACGCCTG
ATCTTCGGCG GGGACACGGT CAAGCAATAT GACGGCCGGA CGCGGCGTTA A
 
Protein sequence
METKANYTIV GFFTVLVIAA AFGFVYWMAE YGRGGPMTEL IVRIPGSANG LSVGSPVRFN 
GIQIGSVQTL SIDADDPQYS LAFTQVRTDA PIYPSTKAAL EIQGLTGAAY IELSGGRKGE
ESILQHAIDN GKRAVIVADQ SSVTNLLATA DKILDRANDA VGELQGFIED SRAPLTQTFK
NAETFSDALA KNSGNIDAFL QSVGELSNTV KAVSGRVDST LQAVESLVKA VDAKKIDNIV
SNADKITANV ADASGDLKGA IQKFDQTATT FNDFGKQAQA TLDRVDTLVA QIDPAKVKGS
VDDIAQATKD ARAAVASIRD VANTVSGRQK DIDQTIQDVS QLANKLNSAS TRIDGILIKV
DALLGTDNTQ SLFTEARDTL ESFKKVADNL NARIGPIADN LQKFSSGGLR DVQTLVNDMR
GTVNNLNDTI TNFDRNPQRL IFGGDTVKQY DGRTRR