Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1982 |
Symbol | |
ID | 6980721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2034370 |
End bp | 2035740 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643396705 |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_002281493 |
Protein GI | 209549576 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component |
TIGRFAM ID | [TIGR00996] virulence factor Mce family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0362661 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.670067 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACCA AAGCCAATTA CACGATTGTC GGTTTTTTCA CGGTGCTGGT GATCGCGGCG GCCTTCGGCT TCGTCTACTG GATGGCCGAA TATGGCCGCG GCGGCCCGAT GACCGAGCTG ATCGTGCGTA TTCCGGGTTC GGCCAACGGC CTCAGCGTCG GCTCGCCGGT GCGCTTCAAC GGCATTCAGA TCGGCTCGGT GCAGACCCTG TCGATCGATG CCGACGATCC GCAATATTCA CTGGCGTTCA CCCAGGTGCG CACCGATGCG CCGATCTATC CGTCGACCAA GGCCGCTCTT GAAATCCAGG GCCTCACGGG GGCCGCCTAT ATCGAACTGT CGGGCGGCCG CAAGGGCGAG GAAAGCATCC TCCAGCATGC GATCGACAAC GGCAAACGCG CCGTCATCGT CGCCGATCAA TCGAGCGTCA CCAATCTTCT GGCGACAGCC GACAAGATCC TCGATCGCGC CAACGACGCG GTCGGTGAAC TTCAGGGTTT CATCGAGGAT TCACGCGCGC CCCTGACCCA GACGTTCAAG AATGCCGAGA CGTTCTCGGA TGCGCTTGCC AAGAATTCCG GCAATATCGA CGCCTTCCTG CAGAGCGTGG GTGAGCTTTC CAATACGGTG AAGGCGGTCT CCGGTCGTGT CGATTCGACG CTTCAAGCTG TGGAATCGCT GGTCAAAGCG GTTGACGCGA AGAAGATCGA CAACATCGTT TCCAACGCCG ACAAGATCAC CGCCAATGTC GCCGATGCCT CGGGCGACCT TAAGGGGGCG ATCCAGAAGT TCGACCAGAC GGCCACCACC TTCAACGATT TCGGCAAACA GGCACAGGCG ACGCTCGACC GCGTCGACAC GCTTGTCGCC CAGATCGACC CCGCCAAGGT GAAGGGCTCG GTCGACGACA TCGCGCAGGC GACCAAGGAT GCGCGTGCCG CCGTTGCCTC GATCCGCGAC GTCGCCAATA CGGTTTCGGG GCGTCAGAAA GATATCGACC AGACCATCCA GGACGTCTCG CAGCTTGCCA ACAAGCTGAA TTCGGCCTCG ACCCGGATCG ACGGCATTCT CATCAAAGTC GATGCCTTGC TGGGAACCGA CAACACGCAA TCGCTGTTTA CCGAGGCGCG CGATACGCTG GAATCCTTCA AGAAGGTCGC CGACAACCTG AATGCGCGCA TCGGGCCGAT CGCCGACAAT CTGCAGAAAT TCTCAAGCGG CGGTTTGCGC GACGTGCAGA CGCTCGTCAA CGACATGCGG GGAACCGTCA ACAATCTGAA CGACACGATC ACCAACTTCG ACCGCAATCC GCAACGCCTG ATCTTCGGCG GGGACACGGT CAAGCAATAT GACGGCCGGA CGCGGCGTTA A
|
Protein sequence | METKANYTIV GFFTVLVIAA AFGFVYWMAE YGRGGPMTEL IVRIPGSANG LSVGSPVRFN GIQIGSVQTL SIDADDPQYS LAFTQVRTDA PIYPSTKAAL EIQGLTGAAY IELSGGRKGE ESILQHAIDN GKRAVIVADQ SSVTNLLATA DKILDRANDA VGELQGFIED SRAPLTQTFK NAETFSDALA KNSGNIDAFL QSVGELSNTV KAVSGRVDST LQAVESLVKA VDAKKIDNIV SNADKITANV ADASGDLKGA IQKFDQTATT FNDFGKQAQA TLDRVDTLVA QIDPAKVKGS VDDIAQATKD ARAAVASIRD VANTVSGRQK DIDQTIQDVS QLANKLNSAS TRIDGILIKV DALLGTDNTQ SLFTEARDTL ESFKKVADNL NARIGPIADN LQKFSSGGLR DVQTLVNDMR GTVNNLNDTI TNFDRNPQRL IFGGDTVKQY DGRTRR
|
| |