Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3922 |
Symbol | |
ID | 8015870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3995587 |
End bp | 3997245 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644826491 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002977702 |
Protein GI | 241206606 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.92025 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCAG CGGAAACACG TACAGACGGG GCGTTGGCGG GGGTAGGCCT GCTCGATCGT GAAAGGATCG TCGCAAGGCC CGGGTTCAAT CGGTGGCTGG TGCCGCCGGC AGCGCTCGCC ATCCATCTCT GCATCGGCAT GGCCTATGGT TTCAGCGTGT TCTGGCTGCC GCTTTCCAAG TCGCTCGGGA TCACGGCCTC GACGGCCTGC CCTGATCTGA CACTCGCCTC GGCGCTGTTC ACCACGACCT GCGACTGGCG CGTTGCCGAC CTCGGCTGGA TCTATACGTT GTTCTTCGTT CTACTTGGCT CATCGGCCGC CATCTGGGGC GGCTGGCTTG AGCGCGCCGG TCCGCGCAAG GCGGGCGTCG TTTCCGCTTG CTGCTGGTGC GGCGGCATCA TTCTTGCGGC GATCGGCGTC ATTACTCATC AGCTCTGGAT GATGTGGCTC GGTGCCGGCG TCATCGGCGG CATCGGTCTC GGCCTCGGTT ACATCTCACC GGTCTCGACA CTGATCAAAT GGTTTCCCGA CCGGCGCGGC ATGGCGACCG GTATGGCGAT CATGGGCTTC GGCGGCGGTG CGATGATCGG CGCCCCATTG GCCAACCTGC TGATGAATGC CTTCAAGACC GACAGTTCGG TCGGCGTCTG GCAGACCTTC ATCGTCATGG CGGTGATCTA TTTCGTCTTC ATGATGGGCG GCGCCTTCGG CTATCGCATT CCGCCGGCCG GCTGGCGTCC CGAGGGTTGG ACGCCGCCTG CAGCCAAGAG CACGATGATC ACCACCAAAC ATGTGCACCT CAGCAATGCC CACAGGACAA AACAGTTCTG GCTGATCTGG GCGGTGCTTT GCCTCAACGT TTCGGCCGGT ATCGGCGTTA TCGGCATGGC CTCGCCGATG CTGCAGGAAA TCTTCGCCGG TTCGCTGATC GGCCTGCCGG ATGTCGGCTT CGCCCAGCTC GACGCCGGGC AGAAGGCATC GATCGCCACG ATCGCCGCCG GTTTCGCCGG CTTGCTGTCG CTCTTCAACA TCGGCGGACG GTTCTTCTGG GCGTCGCTGT CCGACAAGAT CGGCCGCAAG AATACCTATT ATTGCTTCTT CGTCCTCGGC ATCGTGCTCT ATGCACTGGC GCCCACTTTC GCAGGCATGG GCAACAAGGC GCTGTTCGTT CTCTCCTTCG GCATCATCCT GTCGATGTAC GGCGGCGGCT TTGCGACGAT CCCGGCCTAT CTTGCCGATA TTTTCGGTAC GCAGTTCGTC GGCGCCATCC ACGGCCGGCT GCTGACGGCC TGGGCAACGG CCGGCATCGT CGGGCCCGTT GTCGTCAACT ATATCCGCGA AGCGCAGATC GCCGCCGGCG TTGCGCCCGG GCCAACCCTC TATACCGGCA CCATGTATAT CCTCGCCGGC ATGCTGGCGC TCGGGCTCAT CGCCAATGCG CTGATCAGGC CGCTTTCGGA CAAATGGTTC ATGTCCGATG ACGAGGTCGC CGCGTTGCAG GCGAAGTCGG CCGCCGTCAA TGCCGGGCCG ACGGGTTCCT TCGGCATCGG CACAGGCGGG CTGGACGCCA AGGCGATGCT TGCCTGGGCC GTCGTCGGCA TTCCGCTGCT GTGGGGTGTC TGGGTGACGC TGAGGGCAAC CTTCGCGCTG TTCGGCTAA
|
Protein sequence | MTAAETRTDG ALAGVGLLDR ERIVARPGFN RWLVPPAALA IHLCIGMAYG FSVFWLPLSK SLGITASTAC PDLTLASALF TTTCDWRVAD LGWIYTLFFV LLGSSAAIWG GWLERAGPRK AGVVSACCWC GGIILAAIGV ITHQLWMMWL GAGVIGGIGL GLGYISPVST LIKWFPDRRG MATGMAIMGF GGGAMIGAPL ANLLMNAFKT DSSVGVWQTF IVMAVIYFVF MMGGAFGYRI PPAGWRPEGW TPPAAKSTMI TTKHVHLSNA HRTKQFWLIW AVLCLNVSAG IGVIGMASPM LQEIFAGSLI GLPDVGFAQL DAGQKASIAT IAAGFAGLLS LFNIGGRFFW ASLSDKIGRK NTYYCFFVLG IVLYALAPTF AGMGNKALFV LSFGIILSMY GGGFATIPAY LADIFGTQFV GAIHGRLLTA WATAGIVGPV VVNYIREAQI AAGVAPGPTL YTGTMYILAG MLALGLIANA LIRPLSDKWF MSDDEVAALQ AKSAAVNAGP TGSFGIGTGG LDAKAMLAWA VVGIPLLWGV WVTLRATFAL FG
|
| |