Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3315 |
Symbol | |
ID | 6982068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 3408266 |
End bp | 3409477 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643398032 |
Product | protein of unknown function DUF459 |
Protein accession | YP_002282808 |
Protein GI | 209550891 |
COG category | [S] Function unknown |
COG ID | [COG2845] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAAGA AAACTGACCG GACCCCAATC CGTTGGCTCG TGCTCGCCCT GGCGGCGATT TCGCTATGCC TTGGCGCGCT TGCGCCGGTG CATATGGCGG AAGCCCAGGA GCAGCGCTAC CAGCGCCGTT CGATCCTGGA TTTCTTCCTC GGCCGGCGCT ACCTCGACGA TGGGCCGCAG GCGCCTGACG TCCCGCAGCC GCGGCGCCAG CAGCGCAAGC GCCCGCCGCA GCAGAAAGCC ATCGTCAACA CCCGCACCGC GCCGCCGATC CGGGCGCCCG TGCAGGAGGA GCCGGTCGTG CAAAAGCTCG GCGACGCCAA GAAGATCCTG ATCGTCGGCG ATTTCCTGGC CAGCGGCCTC GGCGACGGCC TGACTGCGGC CTTCGAGACT TCGCCGGGGG TCGTCGTCGA AGCCCGCGGC AACGTCTCAT CCGGTCTTGT CCGCGACGAC TATTACGACT GGCCGGAACA GCTGCCGAAG ATGATCGACG AGCTGAAGCC GGCCATGGTC GTCGTCATGA TCGGCGCCAA CGACCGCCAG CAGATGGTGA CAGATACCGC CAAGGAGAAG TTCCGCACCG ACGGCTGGTT TAGCGAATAC CGCCGCCGCG TCCTTTCCTT CGGCAAAGAA GTTACCGACC GCAAGATCCC GCTGCTCTGG GTCGGTCTTC CCGCCTTCGA ATCCGATCAG ATGACGGCCG ATGCCGTCCA GATGAACCAG CTTTACCGCA ACCAGGTCGA AAGCATCGGC GGCGAATTCG TCGATATCTG GGATGGTTTC GTCGATGAAA ACAGCAACTT CATCGTGACC GGCTCGGATG TGAACGGCCA GCAGGTGCGC CTGCGCACCT CCGATGGCAT CAACCTCACG CAGGCCGGCC GGCGCAAGCT CGCCTTCTAT GTTGAAAAGC CCGCCCGCCG CATTCTCGGC ACCCAGGCAA GCCCGGATTT GGTCCGCCTG GACGAAAGCA ATCTGCCGGG CCTCGGCCTT CCCACCAATC CGGTCGAACA CACCGTGCCG ATCAGCCTCT CCGATCCCAA TCTCGACGGC GGCGCCGAGC TTCTTGGCGC CAGGCCCCCG CCAATGACTT TGACGAGGTC GCCCCGCGAC CTCCTGGTCG AGCAGGGCGA AATGACGCCC GCGCCGCCCG GCCGCGTCGA CGATTACCGC TTGCCTACGG CGAAGACGCC GGCCGAAGTC TCGGTGAAAT GA
|
Protein sequence | MTKKTDRTPI RWLVLALAAI SLCLGALAPV HMAEAQEQRY QRRSILDFFL GRRYLDDGPQ APDVPQPRRQ QRKRPPQQKA IVNTRTAPPI RAPVQEEPVV QKLGDAKKIL IVGDFLASGL GDGLTAAFET SPGVVVEARG NVSSGLVRDD YYDWPEQLPK MIDELKPAMV VVMIGANDRQ QMVTDTAKEK FRTDGWFSEY RRRVLSFGKE VTDRKIPLLW VGLPAFESDQ MTADAVQMNQ LYRNQVESIG GEFVDIWDGF VDENSNFIVT GSDVNGQQVR LRTSDGINLT QAGRRKLAFY VEKPARRILG TQASPDLVRL DESNLPGLGL PTNPVEHTVP ISLSDPNLDG GAELLGARPP PMTLTRSPRD LLVEQGEMTP APPGRVDDYR LPTAKTPAEV SVK
|
| |