Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6217 |
Symbol | |
ID | 8016229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012852 |
Strand | + |
Start bp | 271211 |
End bp | 272521 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644827522 |
Product | polysaccharide export protein |
Protein accession | YP_002978722 |
Protein GI | 241258838 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.453567 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.166454 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTAT CGTATCGCGT TCTCAACGTG TTTTTTCCGG CGACACTTGT CATCAGTACA CTGGTCTTTC TTGCCGGAAC TGTACCGCCG GCTCTCGCCA ATGACGCTGC TTTTGCTCCG CAAACGAAAA TCCGTCTGAC GATTGTCCAA TGGATGCAGT CCAAAGGCCA ATATGAAAGA TGGGATGCGC TTGGCGGCGA ATACACCGTC TCGGATGAAG GCGCTGTCTT TCTGCCCTTT CTTGGATCCG TTTCCGTCGG CAATCTCGAT AACACCAGCC TCACAACCGA GATTGCCAAG CGTTTGCAAG AAAAAATCGG ATTGGTTCAG CCACCCGCCG TCACCATCGA AATTCTTGAA TACCCGCCAA TCTATGTCGT CGGAGACGTA ACCACGCCTG GGGAGTACAA ATTCCGTTCC GGACTTACCG TCCTGCAATC CCTGGCAATG AGCGGTGGCC CATTCCGCGC CACAAGTCAG CAGCAATCGC AGACGATCAA ACTCGCAGGC GAATTGCGGG AAATCGACCA CTCGCTGCTG CGCAGCACTG CCAAATTAGC ACGGCTCCAA GCAGAGATGA TCGGGGCAAA GGAGATCTCG TTCGATCAGA CGGTCGGCGT CGATCAGCGA TACGCTGCGG GAATATACAA CGAGGAACGG GTTATTTTTC AGGCTCGTGC AAATGCGCTG GACAGGCAGT CGAAGGCGCT CACGGAATTG CGTGATCTTT TGAACAGTGA AGTCGGCATG CTGGGCGAGA AGGTGCAGGG CTCGGAGGAC AATATCAAAT CTATCGAGGA GCAGCTGACC AGCGTCAAAA CGCTGGTTTC GAAAGGCCTC ACACTCTCGT CGCGTCAATT GGATCTGGAA CGGTTGCTCA CCACCTACCG CTCCGATCGG CTCGACCTTG TCACCGCCAT CATGCGGGGC CGTCAGGCGA TCAGTGAGAC AACGCGAAAT CTTGAGGGGC TTTATGACAC GCGGCGAAGC GAAGTCGCTT CCGAATTGCA GTCGGAACAG GCAAGCCTCG ATCAGTTCAA ATTGAAGCGC GAGATGACGC AAAAACTGCT TCTCGATGAC CTTGCGGCAG GAGGCTCGAC TACCACCGAC GAGGCGCTCC CGCTGACGTT TACGGTGAGC CGGCGAAGCG AAGGGCAAAT CAGGCAGTTC CAGGCCTCCG AAACAACGGC GCTGATCCCG GGTGATGTTG TGAGGGTCGT TCGAACCCCA ATTGCCGATC CGGTGTCTCA GGCCGCGCCC GCCGACCTGT CGCGTGAAAC TGAGACACAC GCCAGTCAGG CAAGCCAGTG A
|
Protein sequence | MKLSYRVLNV FFPATLVIST LVFLAGTVPP ALANDAAFAP QTKIRLTIVQ WMQSKGQYER WDALGGEYTV SDEGAVFLPF LGSVSVGNLD NTSLTTEIAK RLQEKIGLVQ PPAVTIEILE YPPIYVVGDV TTPGEYKFRS GLTVLQSLAM SGGPFRATSQ QQSQTIKLAG ELREIDHSLL RSTAKLARLQ AEMIGAKEIS FDQTVGVDQR YAAGIYNEER VIFQARANAL DRQSKALTEL RDLLNSEVGM LGEKVQGSED NIKSIEEQLT SVKTLVSKGL TLSSRQLDLE RLLTTYRSDR LDLVTAIMRG RQAISETTRN LEGLYDTRRS EVASELQSEQ ASLDQFKLKR EMTQKLLLDD LAAGGSTTTD EALPLTFTVS RRSEGQIRQF QASETTALIP GDVVRVVRTP IADPVSQAAP ADLSRETETH ASQASQ
|
| |