Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5389 |
Symbol | |
ID | 8007347 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 801429 |
End bp | 802697 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644822293 |
Product | hypothetical protein |
Protein accession | YP_002973553 |
Protein GI | 241113718 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1626] Neutral trehalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATGC ATACCGACCA GGCGAAGCGC ATCCTCGCCG CCAACGATCG CGGCGGCTAT ACCGTGCCGA CCGACCGGCT CTACCCGTTC CAGTGGAACT GGGATTCGGC CTTCGTTGCC ATGGGCTTTG CGCTCTACGA TACCGACCGC GCCTATCGCG AGCTGGAGCG GCTGGTCGAG GGCCAGTGGG CTGATGGAAT GATCCCGCAT ATCGTCTTCC ATGCGCCAAG CGATACTTAC TTCCCGGGAC CGAACGTCTG GCGCACGAGA CACGCTATCC CGACCTCCGG CATAACCCAG CCGCCGGTCT TTGCGATTGC GCTGCGCAAG CTGCATGAAG CCGCCGGCAA GGATGGCGAA GCGCGCACCC TGCCCCTCTA CGTGGCGGCG CTGAAATGGC ATCGCTGGTG GTATTCGGCG CGCGACCCTG AAGGCACGGG GCTCATAGCA CTCCTGCATC CCTGGGAAAG CGGCAGCGAC AATTCTCCCG CCTGGGACAT CGCGCTCGCC CGAGTGCCGA CCAATACCGA TACGCCTGTG GTGCGCAAGG ATACCGGTCA TGTCGATGCC GATATGCGCC CGCGCGACGA GGATTACCGC CGCTTCATCC ATCTCGTCGA TACCTATGCC GCCTGCGGCT GGGATCCGGC GCGGCAATGG GAAAAGGCGG CGTTCAAGGT CGCCGAGATC CAGACGACCG CAATCCTGCT CAAGGCGGGC GAAGATCTGG AACACCTTGC CCGCCTGTTT GGGCGGACCG ATGACGCGAT CGAGATCGCT GCCTTCAACG ACCGCAGCCG CAAGGCGATA ATGGCCCAGT GGCGGCCGGA GCTTGTCCGC TTCGTCTCGC GCGACCTGAT CTCCGGCGAA GATGTCGAAG CCGCCACGCA AGCCGGCTTC ATCCCCCTCC TCTCGCTGGA CCTCGACAAG CAGGTTGCGG ACGCCCTGGT CTCCGAAATG AAGGCCTGGT CCAAGGATCT CAAGGTTGCC TTCCCCACGA CCAAACCCGG CATCGCCAGT TGGGAGCCGA AGCGCTACTG GCGCGGCCCC GCCTGGGCGA TCATCAATTG GCTGCTGATC GACGGCCTTA AGCGCAATCG CTACGCGGAT GTCGCCGAAG AGCTGCGGCA ATCCACCATC GCAGCGATCG AAACGGAAGG TTTCGCCGAA TATTTCGACC CGGTCACCGG CCAGGGCTGC GGTGGCCTCG GCTTTTCCTG GACGGCTGCC GCCTATCTAT GGCTTGAGCG AGGCGTCGTC CTCGCCTGA
|
Protein sequence | MNMHTDQAKR ILAANDRGGY TVPTDRLYPF QWNWDSAFVA MGFALYDTDR AYRELERLVE GQWADGMIPH IVFHAPSDTY FPGPNVWRTR HAIPTSGITQ PPVFAIALRK LHEAAGKDGE ARTLPLYVAA LKWHRWWYSA RDPEGTGLIA LLHPWESGSD NSPAWDIALA RVPTNTDTPV VRKDTGHVDA DMRPRDEDYR RFIHLVDTYA ACGWDPARQW EKAAFKVAEI QTTAILLKAG EDLEHLARLF GRTDDAIEIA AFNDRSRKAI MAQWRPELVR FVSRDLISGE DVEAATQAGF IPLLSLDLDK QVADALVSEM KAWSKDLKVA FPTTKPGIAS WEPKRYWRGP AWAIINWLLI DGLKRNRYAD VAEELRQSTI AAIETEGFAE YFDPVTGQGC GGLGFSWTAA AYLWLERGVV LA
|
| |