Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3521 |
Symbol | |
ID | 8014387 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3554806 |
End bp | 3557409 |
Gene Length | 2604 bp |
Protein Length | 867 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644826086 |
Product | protein of unknown function DUF470 |
Protein accession | YP_002977306 |
Protein GI | 241206210 |
COG category | [S] Function unknown |
COG ID | [COG2898] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0528845 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGGGC ACGGCAATTT GGAAGAGATT GAAGAAGCGG ACGGGTTTTC TTTTGGCGGT CTATTCAGAC GGTACAGAAC GCCGCTGCTG GCAGCCGCAT CACTGGTGGT CTTCTGCCTC GTCGGTTACG CCATCATGCA GCTCACCAAC GAGGTGCGCT ATGACGACGT CGTCGGTGCG CTCGCCGCCA CCAGGCCGAG CTCGATCCTG CTTGCGTTGT TCTTCACGGC GCTCAGTTTC CTCGCACTGA TCTTCTACGA TCTCAACGCC ATCGAATATA TCGGCAAGAA GCTGCCCTTT CCGCATGTCG CGCTGACGGC GTTCAGTGCC TATGCCGTCG GCAATACCGC CGGTTTCGGC GCCCTGTCTG GGGGGGCGAT TCGCTACCGC GCCTATACGC GCCTCGGGCT CTCGCCGGAG GATATCGGAC GCATCATCGC CTTCGTCACG CTGTCCTTCG GATTGGGGCT TGCGGGCGTC GCGGCAATCG CCCTCATCGT CATCGCCGAC GAGATCGGCC CGCTCGTCGG CATCAGCCCC TTCCTTCTTC GGCTGATCGC CGGTTCGATC GTCGCCATTC TGGGCGCGGT GATGATCATC GGCCGTGACG GGCGCGTGCT CGATCTTGGC CCTGTGGCAA TCCGCCTGCC GGATTCGCGT ACCTGGTCGC GGCAGTTCCT CGTCACAGCC TTCGATATCG CCGCCTCGGC GTCGGTGCTC TATGTGCTGC TGCCGCAGAC AGCCATCGGC TGGCCGGTCT TCCTCGCCGT CTATGCGATC GCCGTCGGTC TCGGCGTGCT CAGCCATGTT CCGGCCGGGC TCGGCGTGTT CGAGACCGTG ATCATCGCCT CGCTCGGCAG CGCGGTGAAC ATCGATGCCG TGCTCGGATC GCTGGTGCTC TACCGGCTGG TCTACCATGT GCTGCCGCTG CTGATCGCCG TGCTCGCGGT CTCGGCGGCG GAGCTGCGTC GTTTCGTCGA CCATCCGGCG GCCTCCAGCA TGCGGCGCAT CGGCGGACGG CTGATGCCGC AGCTGCTGTC GGCTCTCGCG CTGCTGCTCG GCGTCATGCT GGTGTTTTCG AGCGTCACGC CGACGCCGGA CCAGAACCTC GAATTCCTCT CCAACTATCT GCCACTGCCG ATGGTCGAAG GGGCGCATTT CCTCTCCAGC CTGTTGGGGC TGGCACTCGT CGTCGCCGCG CGCGGCCTCG GCCAGCGGCT CGACGGCGCC TGGTGGGTCG CCGTGTTCTC GGCGCTTGCC GCGCTGACCT TGTCGCTGCT GAAGGCGATC GCGCTTGTCG AAGCGGCTTT TCTTGCCTTC CTCATCTTCG GGCTCTTCGT CAGCCGCCGG CTCTTTACCC GCCAGGCCTC GCTGCTCAAC CAGGCGCTGA CGGCGTCCTG GCTGATGGCG ATCGCCGTGA TCGTCGTCGG GGCTGTCGTC ATCCTGCTCT TCGTCTATCG CGACGTCGAA TACAGCAACC AGCTCTGGTG GCAGTTCGAA TTCACCGCCG AGGCGCCGCG CGGGCTGCGC GCCGTGCTCG GCATCACCAT CATTTCGTCG GCGATCGCCG TCTTCAGCCT GCTCCGACCG GCGAGCTTCC GGCCGGAGCA GGCGACGGAC GAGGCGCTGG CGCGTGCCGT CGAGATCGTC ATGAAGCAGG GCAATGCCGA TGCCAATCTT GTGCGCATGG GCGACAAGAG CATCATGTTC TCGGAAAACG GCGATGCCTT CATCATGTAT GGCCGGCAGG GCCGCTCCTG GATCGCGCTG TTCGATCCGG TCGGCGACCA TCGCGCCGTG CAGGAACTCG TCTGGCGTTT CGTCGAAGCG GCGCGCGCCG CCGGCTGCCG GGCCGTGTTC TATCAGATAT CGCCGGCGCT GCTGTCCCAT TGCGCCGATG CCGGCCTGCG CGCCTTCAAG CTCGGCGAAC TGGCGGTGGC CGATCTCAGG ACCTTCGAGA TGAAGGGCGG CAAATGGGCA AACCTTCGCC AGACGGCAAG CCGCGCCCAG CGCGACGGGC TGGAATTCGC CGTCGTCGAA CCGCAGGACT TGTCTTCGGT CATCGATGAT CTTGCCGCGG TTTCGACGGC CTGGCTCGAG CATCACAATG CCAAGGAAAA GGGCTTCTCG CTCGGCGCGT TCGATTTCGA TTATGTCTCC TCGCAGCCGG TCGGCATCCT GAAAAAGGAC GGCAGGATCG TCGCCTTCGC CAATATCCTC GTGACCGAAT CCAGACAGGA GGGCACGATC GATCTCATGC GCTTCTCGCC GGACGCGCCG AAGGGCTCGA TGGACTTTCT CTTCGTGCAG ATCATGGAAT ATCTGCGAGG GCAGGGTTTC ACCCACTTCA ATCTCGGCAT GGCGCCTCTC TCCGGCATGT CGAAACGCGA GGCGGCGCCC GTCTGGGACC GCATCGGCAG CACCGTCTTC GAACACGGCG AGCGCTTCTA TAACTTCAAA GGCCTTCGGG CATTCAAATC CAAGTTTCAT CCGCACTGGC AACCGCGCTA TCTTGCGGTC TCAGGAGGGG GCAATCCGAT GATCGCGTTG ATGGACGCGA CATTTCTGAT TGGGGGCGGA TTGAAAGGGG TAGTGAGAAA ATGA
|
Protein sequence | MSGHGNLEEI EEADGFSFGG LFRRYRTPLL AAASLVVFCL VGYAIMQLTN EVRYDDVVGA LAATRPSSIL LALFFTALSF LALIFYDLNA IEYIGKKLPF PHVALTAFSA YAVGNTAGFG ALSGGAIRYR AYTRLGLSPE DIGRIIAFVT LSFGLGLAGV AAIALIVIAD EIGPLVGISP FLLRLIAGSI VAILGAVMII GRDGRVLDLG PVAIRLPDSR TWSRQFLVTA FDIAASASVL YVLLPQTAIG WPVFLAVYAI AVGLGVLSHV PAGLGVFETV IIASLGSAVN IDAVLGSLVL YRLVYHVLPL LIAVLAVSAA ELRRFVDHPA ASSMRRIGGR LMPQLLSALA LLLGVMLVFS SVTPTPDQNL EFLSNYLPLP MVEGAHFLSS LLGLALVVAA RGLGQRLDGA WWVAVFSALA ALTLSLLKAI ALVEAAFLAF LIFGLFVSRR LFTRQASLLN QALTASWLMA IAVIVVGAVV ILLFVYRDVE YSNQLWWQFE FTAEAPRGLR AVLGITIISS AIAVFSLLRP ASFRPEQATD EALARAVEIV MKQGNADANL VRMGDKSIMF SENGDAFIMY GRQGRSWIAL FDPVGDHRAV QELVWRFVEA ARAAGCRAVF YQISPALLSH CADAGLRAFK LGELAVADLR TFEMKGGKWA NLRQTASRAQ RDGLEFAVVE PQDLSSVIDD LAAVSTAWLE HHNAKEKGFS LGAFDFDYVS SQPVGILKKD GRIVAFANIL VTESRQEGTI DLMRFSPDAP KGSMDFLFVQ IMEYLRGQGF THFNLGMAPL SGMSKREAAP VWDRIGSTVF EHGERFYNFK GLRAFKSKFH PHWQPRYLAV SGGGNPMIAL MDATFLIGGG LKGVVRK
|
| |