Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4156 |
Symbol | |
ID | 6982928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 4332866 |
End bp | 4334206 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643398886 |
Product | hypothetical protein |
Protein accession | YP_002283644 |
Protein GI | 209551727 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.256613 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAATC CGAACCCTTA TACCCCGAGC TATTCGTTTT CCGGCTGGCA GACGTCCAAT CCTGCGAAAC CGTTGCCTGC GCCGCAGGTC GACAATGAGC TTGCGAACAT CTCGACGTCG CTCAACGCCG CGATATCAGG GCTGCTGGAC ATCCGGCGCT CCGACGGCAA GCTAAAGAAC GGGATCGTCA CGTTCGAAAG CCTGAATAAC GACCTAAAGG CCGGGTACTC AGGCGGCGCT GTTTCGGCAT GGGCGCCAGT CGTTGATTAC GCGGCCGGGA TTGTCGCCAC GTCGATCGCG CCAGCTACGG TCGTCGTCTA CCAGGGGGAA AGCTACGTTT GCACCACCGA CCACGTCACA ACGGCGCTGT TCGACGTTAC CAAGTGGCAC AAGATCGCCG CCCGCGGGGC AAATGGCACG GGCTCCGGCG ATATGCTGGC GTCGCTCAAT CTGTCGGATC TGACGAACAA GCCGCAGGCC CGGATCAATC TTGGGCTCGG TAACGTCGAC AACACGAACG ATGCCGGCAA GCCGATTTCG ACCGCCACGC AGGCCGCGTT CGATACGGTG AACGCGAGCC TGGCCGCTCT GACGGCCGCG CAGTTCGATT TTTTCACAGA CTGTGTTCCG TCTTACGTCA GCGTCACCAG TATCAGTTTC TCCTCAGGCG TCGGGCTATT CGGCAATAAG AAGCACATTC TGCCGGCCTA CACGAAGCTG ATGAGTGCTA CGTTCGCGGC CGGCGCCGGT GTCGGAATGC TCGACACCGG CACCATCGGC GCCAGCAAGA CCTATTTCCT GTTCGCGATC CGGAACACAT CGACGGGTGA TTGCGATTAT CTGGCTTCGT TGAGCCTGAC GCCGCTTGTT CCGGCCGGAT GGGAGCTGAA CTCCGGCAGC CGCATCGGGA TCATCTTAAC GAACGGCTCC AGCCAGATCA GGAACTTCGT CCAAACGGGC AACCAAGTTA CCATCATAGG AACGGCACAA CAGGTCTTTA CGACTTCGAC CTCCATTGCA GCGGCGCTGA TCGCACTTCC CAACTGCCCC GTTGGTATCT CGGTAGGTGC CATGCTGGCC CTTGATGTCT CGGCGTCCAC GAACGGTGAC GTCTCGGCGT ACCTCTCCGA CTACAGCGCT CCGGACGCTC AACGGGTCAG GGCCAGAACT TTCTGCGCGG CACAGCCTTC CGCTACGGTT GCTCAAGCCA ATTACGCGCC GGTGCGGACA AACACCCTGG CGCAAGTCTA CCGGTCCGTG GGCGTCGTGA CAGGCCCCGC AACGGCGACC GGCTACATCA ACGGGTGGGT AGACCACCAA TGCAAAAGGC TTTTCCCATG A
|
Protein sequence | MANPNPYTPS YSFSGWQTSN PAKPLPAPQV DNELANISTS LNAAISGLLD IRRSDGKLKN GIVTFESLNN DLKAGYSGGA VSAWAPVVDY AAGIVATSIA PATVVVYQGE SYVCTTDHVT TALFDVTKWH KIAARGANGT GSGDMLASLN LSDLTNKPQA RINLGLGNVD NTNDAGKPIS TATQAAFDTV NASLAALTAA QFDFFTDCVP SYVSVTSISF SSGVGLFGNK KHILPAYTKL MSATFAAGAG VGMLDTGTIG ASKTYFLFAI RNTSTGDCDY LASLSLTPLV PAGWELNSGS RIGIILTNGS SQIRNFVQTG NQVTIIGTAQ QVFTTSTSIA AALIALPNCP VGISVGAMLA LDVSASTNGD VSAYLSDYSA PDAQRVRART FCAAQPSATV AQANYAPVRT NTLAQVYRSV GVVTGPATAT GYINGWVDHQ CKRLFP
|
| |