Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3226 |
Symbol | |
ID | 6981978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3312563 |
End bp | 3315166 |
Gene Length | 2604 bp |
Protein Length | 867 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643397943 |
Product | protein of unknown function DUF470 |
Protein accession | YP_002282719 |
Protein GI | 209550802 |
COG category | [S] Function unknown |
COG ID | [COG0392] Predicted integral membrane protein [COG2898] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.598284 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGGTC ACGACAATTT GAAAGAGATC GAAGACGCGG AGGGATTTTC TTTTCGCGGT TTATTCAGAC GTTACAGAAC ACCGCTGACG GCAGCGGCGA CACTCGTGGT CTTCTGCCTC GTCGGCTATG CGATCATGCA GCTCACCAAC GAAGTGCGCT ATGACGATGT CGTCGATGCG CTGGCGGCGA CCAGGCCGAG CGCCATCCTG CTTGCCCTGT TCTTTACCGC ACTCAGTTTC CTCTCGCTGG TTTTCTACGA TCTCAACGCC ATCGAATATA TCGGCAAGAA GCTACCCTTT CCGCATGTGG CGCTGACGGC GTTCAGCGCC TATGCCGTCG GCAATACCGC CGGTTTCGGC GCGCTTTCGG GTGGCGCGAT CCGCTACCGC GCCTATACGC GCCTCGGGCT TTCGCCCGAA GATATCGGAC GCATCATCGC CTTCGTCACG CTTTCCTTCG GTCTCGGGCT TGCGGGAGTG GGAGCGATCG CCCTGATCAT CATCGCAGAC GAGATCGGCC CGCTGATCGG CGTCAGCCCC TTCCTGCTGC GGTTGATCGC CGGCTCGATC GTCGCCATTC TGGGCGCCGT GATGATCATC GGCCGGGATG GGCGCGTGCT CGATCTCGGC CCTGTCGCAA TCCGCCTGCC GGATTCGCGC ACCTGGTCGC GGCAGTTCCT CGTCACCGCC TTCGATATCG CCGCTTCGGC GTCCGTGCTT TACGTGCTGC TGCCGCAGAC GGCCATCGGC TGGCCGGTCT TCCTGGCCGT CTATGCGATC GCCGTCGGTC TCGGCGTACT CAGCCACGTT CCGGCCGGGC TCGGCGTGTT CGAAACCGTG ATCATCGCCT CGCTCGGCAG CGCGGTGAAC ATCGATGCGG TGCTCGGATC GCTGGTGCTC TACCGTCTGA TCTATCATGT GCTGCCGCTC TTGATCGCCG TGCTTGCGGT TTCGGCGACG GAACTGCGCC GTTTCGTCGA CCATCCGGCC GCCTCCAGCG TCCGGCGCAT CGGCGGAAGG CTGATGCCGC AGCTATTGTC GGCGCTTGCA CTGCTGCTCG GCGTCATGCT GATCTTTTCG AGCGTCACAC CGACACCGGA CCAGAACCTC GAATTCCTTT CCAACTATCT GCCGCTGCCG ATGGTGGAAG GGGCGCATTT CCTTTCCAGC CTGCTCGGGC TCGCGCTCGT CGTCGCTGCG CGCGGGCTCG GCCAGCGGCT CGACGGCGCT TGGTGGGTGG CGGTATTTTC GGCCCTTGCC GCACTGACTT TGTCGCTCTT GAAAGCGATC GCGCTCGTCG AAGCCGCCTT TCTCGCCTTC CTCATCTTCG GGCTCTTCGT CAGCCGGCGG CTCTTTACCC GCCATGCCTC GCTGCTCAAT CAGGCGCTGA CGGCGTCCTG GCTGATGGCG ATCGCGGTCA TCGTCGTCGG TGCCGTCGTC ATCCTGCTCT TCGTCTATCG CGATGTCGAA TACAGCAACC AGCTCTGGTG GCAGTTCGAG TTTACCGCCG AAGCGCCGCG TGGCTTACGC GCCGTGCTCG GCATCACCAT CATCTCGTCG GCGATCGCCA TCTTCAGCCT GCTCCGGCCG GCCACCTTCC GGCCGGAACC GGCGACGGAG GAGGCACTGG CGCGTGCCGT CGAGATCGTC GGCAAGCAGG GCAATGCCGA TGCCAATCTG GTGCGCATGG GCGACAAGAG CATCATGTTC TCGGAAAAGG GTGACGCCTT CATCATGTAC GGCCGGCAGG GCCGGTCCTG GATCGCGCTG TTCGACCCGG TCGGCGAACA TCGCGCCGGG CAGGAGCTCG TCTGGCGTTT CGTCGAAGCA GCGCGCGCCG CCGGCTGCCG CGCCGTGTTC TATCAGATCT CGCCGTCGCT GCTTTCCCAT TGCGCCGATG CCGGCCTTCG CGCCTTCAAG CTCGGCGAAC TGGCAGTGAC CGATCTTAGA ACCTTCGAGA TGAAGGGCGG CAAATGGGCG AACCTTCGCC AGACCGCGAG CCGCGCCCAG CGCGACGGGC TGGAATTTGC CGTTGTCGAA CCTGAGGACG TGCCCTCGGT CATCGATGAT CTCTCCGCTG TCTCCACGGC CTGGCTCGAG CATCACAATG CCAAGGAAAA GGGCTTTTCA CTCGGCGCCT TCGATCCCGA TTACGTCTCT TCCCAGCCGG TCGGTATCCT GAAAAAGGAC GGCAGGATCG TTGCCTTCGC CAATATCCTC GTCACCGAAT CCAGGCAGGA GGGCACGATC GATCTCATGC GCTTCTCGCC GGACGCGCCG AAGGGCTCGA TGGACTTTCT CTTCGTGCAG ATCATGGAAT ATCTGCGCGG CCAGGGTTTC ACCCACTTCA ATCTCGGCAT GGCGCCTCTC TCCGGCATGT CGAAACGCGA GGCGGCGCCC GTCTGGGACC GCATCGGCAG CACCGTCTTC GAACACGGCG AGCGCTTCTA CAACTTCAAA GGCCTGCGGG CATTCAAATC AAAATTTCAT CCGCACTGGC AACCGCGCTA TCTTGCGGTC TCCGGAGGGG GCAATCCGAT GATCGCGTTG ATGGACGCGA CATTTCTGAT CGGGGGCGGA TTGAAAGGGG TAGTGAGAAA ATGA
|
Protein sequence | MSGHDNLKEI EDAEGFSFRG LFRRYRTPLT AAATLVVFCL VGYAIMQLTN EVRYDDVVDA LAATRPSAIL LALFFTALSF LSLVFYDLNA IEYIGKKLPF PHVALTAFSA YAVGNTAGFG ALSGGAIRYR AYTRLGLSPE DIGRIIAFVT LSFGLGLAGV GAIALIIIAD EIGPLIGVSP FLLRLIAGSI VAILGAVMII GRDGRVLDLG PVAIRLPDSR TWSRQFLVTA FDIAASASVL YVLLPQTAIG WPVFLAVYAI AVGLGVLSHV PAGLGVFETV IIASLGSAVN IDAVLGSLVL YRLIYHVLPL LIAVLAVSAT ELRRFVDHPA ASSVRRIGGR LMPQLLSALA LLLGVMLIFS SVTPTPDQNL EFLSNYLPLP MVEGAHFLSS LLGLALVVAA RGLGQRLDGA WWVAVFSALA ALTLSLLKAI ALVEAAFLAF LIFGLFVSRR LFTRHASLLN QALTASWLMA IAVIVVGAVV ILLFVYRDVE YSNQLWWQFE FTAEAPRGLR AVLGITIISS AIAIFSLLRP ATFRPEPATE EALARAVEIV GKQGNADANL VRMGDKSIMF SEKGDAFIMY GRQGRSWIAL FDPVGEHRAG QELVWRFVEA ARAAGCRAVF YQISPSLLSH CADAGLRAFK LGELAVTDLR TFEMKGGKWA NLRQTASRAQ RDGLEFAVVE PEDVPSVIDD LSAVSTAWLE HHNAKEKGFS LGAFDPDYVS SQPVGILKKD GRIVAFANIL VTESRQEGTI DLMRFSPDAP KGSMDFLFVQ IMEYLRGQGF THFNLGMAPL SGMSKREAAP VWDRIGSTVF EHGERFYNFK GLRAFKSKFH PHWQPRYLAV SGGGNPMIAL MDATFLIGGG LKGVVRK
|
| |