Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3141 |
Symbol | |
ID | 8014044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3140029 |
End bp | 3141267 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644825707 |
Product | protein of unknown function DUF989 |
Protein accession | YP_002976935 |
Protein GI | 241205839 |
COG category | [S] Function unknown |
COG ID | [COG3748] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.182707 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGAAT ATGCCATAGC ATGGGAATGG CTCGCCTTCG CGGCGCGTTG GTTCCATGTC ATTACCGCGA TCGCCTGGAT CGGATCGTCC TTCTATTTCA TCGCGCTCGA TCTCGGACTG GTGAAACGCC CACATCTGCC ACCCGGCGCC TATGGCGAGG AATGGCAGGT CCATGGCGGC GGCTTCTATC ATATCCAGAA ATATCTGGTG GCGCCCGCCC AGATGCCGGA GCACCTGACC TGGTTCAAAT ACGAGAGCTA TTTCACCTGG ATTTCCGGCT TCCTGATGCT GTGCATCGTC TATTACGGCG GCGCCGACCT CTTCCTGATC GATCGGCATG TGCTGGATAT CAGCCCGCCC GTCGCAATCC TGATCTCGCT AGGGTCGCTT GCCCTCGGCT GGGTCGTCTA CGATCTGCTC TGCAAGTCGC CGCTTGGGCG GAATACCTGG GGGCTGATGG CGGTGCTCTA TGTCGTGCTC GTCTTCATGG CCTGGGGCTA TACGCAGCTT TTCACCGGCC GCGCCGCATT CCTGCATCTC GGCGCCTTCA CCGCGACAAT CATGTCGGCC AACGTCTTCA TGATCATCAT TCCGAACCAG AAGATCGTCG TCGCCGACCT CATCGCCGGA CGGGTTCCCG ATCCCAAATA TGGGCAGGTC GCCAAGCAGC GTTCGCTGCA TAACAACTAC CTGACGCTGC CCGTCATCTT CTTCATGCTG TCGAACCATT ATCCGCTCTC CTTCGGTACG CAGTTCAACT GGGTGATCGC GGCTCTGGTC TTTCTGATGG GCGTCACCAT CCGCCACTGG TTCAACACGA CGCATGCCAG GAAAGGCCGG CCGACCTGGA CCTGGATCGT CACCGTCATT CTCTTCATCC TGATCATCTG GCTTTCGACC GTGCCGAAGC TGCTGACCGG CGAAACGGAT GCGGCAGCCG TCGCGCCCGC CTTCCAGCAA TTCGCCGGCG ATCCGCATTT CCCCGCCGTC AAGCAACTGG TCTCGACGCG CTGTTCCATG TGCCACGCGG CCGAGCCGGT CTATGAGGGT ATCGCGCGGC CGCCCAAGGG CGTGATGCTC GAAAACGACG CGGAAATCGC CGCCCATGCC CGCGAGATCT ATATACAGGC GGGCCGCAGC CATGCCATGC CGCCCGGCAA CATCACCGAT ATCACGCCGG ACGAGCGCAA GCTGCTGGTC GCCTGGTTCG AGAGCGCAGT CGAAGGCAAG CAACAATGA
|
Protein sequence | MYEYAIAWEW LAFAARWFHV ITAIAWIGSS FYFIALDLGL VKRPHLPPGA YGEEWQVHGG GFYHIQKYLV APAQMPEHLT WFKYESYFTW ISGFLMLCIV YYGGADLFLI DRHVLDISPP VAILISLGSL ALGWVVYDLL CKSPLGRNTW GLMAVLYVVL VFMAWGYTQL FTGRAAFLHL GAFTATIMSA NVFMIIIPNQ KIVVADLIAG RVPDPKYGQV AKQRSLHNNY LTLPVIFFML SNHYPLSFGT QFNWVIAALV FLMGVTIRHW FNTTHARKGR PTWTWIVTVI LFILIIWLST VPKLLTGETD AAAVAPAFQQ FAGDPHFPAV KQLVSTRCSM CHAAEPVYEG IARPPKGVML ENDAEIAAHA REIYIQAGRS HAMPPGNITD ITPDERKLLV AWFESAVEGK QQ
|
| |