Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2193 |
Symbol | |
ID | 8015654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 2192104 |
End bp | 2193474 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644824779 |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_002976009 |
Protein GI | 241204913 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component |
TIGRFAM ID | [TIGR00996] virulence factor Mce family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.141725 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.182946 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACCA AAGCCAATTA CACGATTGTC GGTTTTTTCA CGGTGCTGGT GATCGCGGCT GCCTTCGGCT TCGTCTACTG GATGGCCGAA TATGGCCGCG GCGGCCCGAT GACGGAACTG ATCGTGCGTA TTCCGGGCTC GGCCAACGGC CTCAGCGTCG GCTCGCCGGT GCGCTTCAAC GGCATCCAGA TCGGCTCGGT GCAAACGCTG TCGATCGATG CCGACGATCC GCAATATTCG CTGGCCTTCA CCCAGGTGCG CACCGATGCG CCGATCTACC CCTCGACCAA GGCCGCCCTT GAAATCCAGG GTTTGACCGG GGCTGCCTAT ATCGAACTCT CGGGCGGCCG CAAGGGCGAG GAAAGCATCC TCCAGCATGC GATCGATAAC GGCAAACGCG CCGTCATCGT TGCCGACCAG TCGAGCGTCA CCAATCTTCT GGCGACCGCC GACAAGATCC TCGATCGCGC CAACGACGCG GTCGGCGAGC TTCAGGGATT CATCGAAGAT TCGCGCGGGC CGTTGACCGA GACTTTCAAG AATGCCGAGA CGTTTTCGGA TGCGCTTGCC AAGAATTCCG GCAATATTGA CGCCTTCCTG CAGAGCGTGG GTGAGCTCTC CAATACGGTG AAGGCGGTCT CCAGCCGTGT CGATTCGACG CTTCAAGCCG TCGAGTCGCT GGTCAAGGCA GTCGATGCGC AGAAGATCGA TAACATCGTC TCCAATGCCG AGAAGATTAC CGCCAATGTC GCCGATGCCT CAGGCGACCT CAAGGGCGCG ATCCAGAAGT TCGACCAGAC GGCCACCACC TTCAATGATT TCGGCAAGCA GGCGCAGGCG ACACTCGACC GCGTCGATAC GCTCGTTGCC CAGATCGACC CGGCGAAGGT GAAGGGCTCT GTCGACGATA TCTCGCAGGC GACCAAGGAT GCGCGCGCCG CCGTCGCCTC GATCCGCGAG GTCGCCAACA CGGTTTCGGC GCGTCAGAAA GATATCGACC AGACGATCCA GGACGTGTCG CAGCTTTCCA ACAAGCTGAA TTCGGCCTCG ACCCGCATCG ACGGCATTCT CATCAAGGTC GATGCGCTGC TCGGCACCGA CAATACGCAA TCGCTGTTCA CTGAAGCGCG CGACACGCTG GAATCCTTCA AGAAGGTCGC CGACAATCTC AATTCGCGGA TCGGGCCGAT CGCCGACAAT CTGCAGAAAT TCTCGAGCGG CGGCTTGCGC GACGTGCAGA CTCTCGTCAA CGACATGCGC GGGACCGTGA GCAATCTGAA CGATACGATC ACCAACTTCG ACCGCAATCC GCAACGCCTG ATCTTCGGCG GGGACACGGT GAAGCAATAT GACGGCCGGA CGCGGCGTTA A
|
Protein sequence | METKANYTIV GFFTVLVIAA AFGFVYWMAE YGRGGPMTEL IVRIPGSANG LSVGSPVRFN GIQIGSVQTL SIDADDPQYS LAFTQVRTDA PIYPSTKAAL EIQGLTGAAY IELSGGRKGE ESILQHAIDN GKRAVIVADQ SSVTNLLATA DKILDRANDA VGELQGFIED SRGPLTETFK NAETFSDALA KNSGNIDAFL QSVGELSNTV KAVSSRVDST LQAVESLVKA VDAQKIDNIV SNAEKITANV ADASGDLKGA IQKFDQTATT FNDFGKQAQA TLDRVDTLVA QIDPAKVKGS VDDISQATKD ARAAVASIRE VANTVSARQK DIDQTIQDVS QLSNKLNSAS TRIDGILIKV DALLGTDNTQ SLFTEARDTL ESFKKVADNL NSRIGPIADN LQKFSSGGLR DVQTLVNDMR GTVSNLNDTI TNFDRNPQRL IFGGDTVKQY DGRTRR
|
| |