Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2197 |
Symbol | |
ID | 8013208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 2196535 |
End bp | 2197755 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644824783 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002976013 |
Protein GI | 241204917 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0740426 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCCCG CTCAAAATTC CCCGACGGGT GAGGGAGCGC CGCCGCGCTT CCGGCTTCTC AGTGCGCTTT CCTATTGTGC GCCGCTGCTC GTCAACGGCA TCGTCCTGCC GTTCTTTCCC GTCTGGCTCG CAACCCATAG CTTTAGCGAT CATGAGATCG GCATCATCCT TGCCATACCC ATGGTGGTTC GCGTGCTGGT GGCGCCTGTC GTCGCCATGA TCGCCGATCG ATTGAAGGAG CGCGCCGACG TGCTGCTCTG GTCGGGCGGC CTTTCGCTGC TGACGGCGGT CGCGCTGTTC TGGACGACGA CTTTCTGGCC GGTGACGATC GTCTATGCGC TGCAGGGCGC CACCTTCGCA CCCTATGTGC CCGTCGTCGA ATCGATCGTC ATCTCGGGCG TGCGCCGCTG GGGGCTCGAT TACGGGTCGA TGCGCGTGTG GGGCTCCATC GCCTTTATCG TCTCGACGCT GGTCGGTGGC CAGATGATCA GCCGGTGGGG CGGCGGAATG GTGCTCGATG TCATGGTGTT CGGCTTTGTC ATGACCGTTG TCATGGCGAT CTTCTGTCCG CGCATCGGGC CAACGCGACG GCGGGGCCAG CCGATCAACA TCCCAGCCGC TACCGGCAGT GGCCTGCGCG AGCCGCACCT GCTGCTGCTT TTGATCGGCG TTGCCATCCA GCAGTCGAGC CATGCGGTGC TGAACGCTTT TTCCTCGATC TACTGGCATC AGCTCGGCTT CTCCGGCACT GAGGTCGGCC TGCTCTGGAG CGCCGGCGTC GCCTCGGAAG TGACGGTGTT CTTCCTGTCG AAGCGTCTCA ACCGTCGCTT CGATGCCTGG ACGCTGATCC GCTTCGGCTG CGCCATCAGC GTCTGCCGCT GGATCCTGTT TCCGATGAAT ACCGGTTTTG CCGGTTTCTT CCTGCTGCAA TGTTTCCACG GCTTCACCTA TGCCTTCGTG CATACCGGCG TGCAGCGACG GATCATGGCG ACGGTGCAGG AGACGCAGGA ATCTTCGGCA CAGGGCGCCT ATTTCTTCTA TGTCGGCATG GCGATGGCGC TGATGACCCT GGCGTCGGGT TATCTCTACG CCTGGCTCGG CGTCGTCAGC TATTACGTCA TGGCGCTGGT CGCGTTTTCC GGCCTCGGCC TCGTCATCTT CGCCTATTAC CTTCAGCCCC AAAGGGTGCT TTCCGGCGGA AAGACCAGCG AAGCGGCGTA G
|
Protein sequence | MIPAQNSPTG EGAPPRFRLL SALSYCAPLL VNGIVLPFFP VWLATHSFSD HEIGIILAIP MVVRVLVAPV VAMIADRLKE RADVLLWSGG LSLLTAVALF WTTTFWPVTI VYALQGATFA PYVPVVESIV ISGVRRWGLD YGSMRVWGSI AFIVSTLVGG QMISRWGGGM VLDVMVFGFV MTVVMAIFCP RIGPTRRRGQ PINIPAATGS GLREPHLLLL LIGVAIQQSS HAVLNAFSSI YWHQLGFSGT EVGLLWSAGV ASEVTVFFLS KRLNRRFDAW TLIRFGCAIS VCRWILFPMN TGFAGFFLLQ CFHGFTYAFV HTGVQRRIMA TVQETQESSA QGAYFFYVGM AMALMTLASG YLYAWLGVVS YYVMALVAFS GLGLVIFAYY LQPQRVLSGG KTSEAA
|
| |