Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2943 |
Symbol | |
ID | 8013869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 2934758 |
End bp | 2935978 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644825513 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002976741 |
Protein GI | 241205645 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.747804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATAT CCCAAGGACC GGGGAGCGTT CTTCGCCACC CCGGCTATCT GAACTTCGCT GCCTCCCGCG TTTTTTCCTC GCTCTCCTTC CAGTCCATTG GCATCGCCAT GGGGTGGATG ATCTACGATC AGACACACAG CGCCTTTGCG CTCGGCCTAG TCGGCTTGTG CCAATTCCTG CCGATGGCGG TGCTGACCTT CGTCGTCGGC CATGTCGCCG ACCGGTTCGA CCGGCGGCGC ATCGGGCTCG TCTGCCAGTT GATCGAAGCG GTGACGGCGC TGGTGCTGGC GGTTGCCACC TGGCAGCAAT GGCTGACGCC GGCCGGCATC CTTGCCGCCG TCACCGTGCT CGGCGCCGTC GTCGCCTTTG AACGGCCGAC CATGGCGGCG CTGTTGCCGA ACATCGTGCC GGCCTCGATG TTGCAGAAGG CGGTCGCAAC CTCGACCTCG TTGATGCAGA CGGCGATGAT CATCGGCCCC TCGCTCGGAG GCCTGCTTTA TGGCCTCCAC CCCGTCGCGC CTTTTGCTAT ATCGGCGCTG CTCTTTGCCG TCGCAAGCTT CAACGTCATC TCGATCCGCA TGCAATGGAG CCCTGCCAAG CGTGAGCCGG TGACGCTCGC CTCGGTCTTT GCCGGCGTCT CCTTCATCCG CAGCCGGCCG GTGATGCTCG GCACGATCTC GCTCGATCTC TTCGCGGTGC TGCTCGGCGG CGCGACTGCG CTGCTGCCGA TGTTCGCCAG CGATATCCTG CATGCCGGTC CCTGGGGCCT CGGTTTTCTG CGCGCGGCCC CGGCGGTCGG CGCGCTTGCC ATGTCGATCA TGCTCGCTCG CCGGCCACTG AGCAGCAATG TCGGTCGCAA GATGCTCGCC GCCGTCGCCG TGTTCGGCGT CGCCACCATC GTCTTCTCGC TGTCGACCAA TATCGCGCTT TCCGTCGTCG CGCTGCTCGT CATCGGCGCG TCCGATACGG TGAGCGTCGT CGTGCGCAGC TCGCTGGTGC AGCTTTTGAC GCCGGACGAG ATGCGTGGTC GTGTCAGTGC CGTCAACTCG CTGTTCATCG GCACCTCCAA CCAGCTCGGC GAATTCGAAT CCGGCATGAT GGCGGCGGCG CTTGGGCCGG TCGCCACCGG CATCGTCGGC GGCTTCGGCA CGATCGTCGT CGTGCTCCTG TGGATGCGGC TCTTCCCCGA TCTTACCAAG GTCAAGACGC TGCAGGGTTA G
|
Protein sequence | MDISQGPGSV LRHPGYLNFA ASRVFSSLSF QSIGIAMGWM IYDQTHSAFA LGLVGLCQFL PMAVLTFVVG HVADRFDRRR IGLVCQLIEA VTALVLAVAT WQQWLTPAGI LAAVTVLGAV VAFERPTMAA LLPNIVPASM LQKAVATSTS LMQTAMIIGP SLGGLLYGLH PVAPFAISAL LFAVASFNVI SIRMQWSPAK REPVTLASVF AGVSFIRSRP VMLGTISLDL FAVLLGGATA LLPMFASDIL HAGPWGLGFL RAAPAVGALA MSIMLARRPL SSNVGRKMLA AVAVFGVATI VFSLSTNIAL SVVALLVIGA SDTVSVVVRS SLVQLLTPDE MRGRVSAVNS LFIGTSNQLG EFESGMMAAA LGPVATGIVG GFGTIVVVLL WMRLFPDLTK VKTLQG
|
| |