Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2341 |
Symbol | |
ID | 6981080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2402165 |
End bp | 2403385 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643397054 |
Product | hypothetical protein |
Protein accession | YP_002281842 |
Protein GI | 209549925 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.159022 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTCGC TGATCACAGC CGCCGCGAGG GCGCTGGCAT CGGGTGATGC GCTTGGCGCG CTGAAGCGGG TGGCGCTGCG CGACGACGCA CCGGCGCTGG CGTTGCGCGG CATTGCGATG GCTCAACTCG GTGACCTCGT CCGCGCCAAG GCGCTATTGA AGAGTGCGGC GCGCGCCTTC GGACCGCGGG AGGCGGTGGC GCGGGCGCGA TGCGTGGTCG CCGAGGCCGA GATCGCGCTT GTCTCACGCG ATCTCATCTG GCCGCCGAAG GCGTTGGAAT CGGCGCGCAG GGTGCTGGAA GCGCATGGCG ACAGGGTCAA TGCCGCCCAT GCGGGCAATG TCGCCATTCG CAGGCTGGTG CTGATCGGCC GCCTCGACGA GGCAGAGCGT GCGCTCGCAG CACTCGATCC GGCGCCGCTG CCGCCGGCGC TGCGGACTGC CCACGAACTG GTGGCGGCCG GCATCGCCGT TCGGCGCCTG CAGACGCAGG CAGCTCGCGC CGCCCTCGAC CGGGCAAGGC TTTCGGCGCG CGCGGCCGCG ATCCCCGCTT TGACGGCGGA GGTCGAAACC GCCGCGCTGG TGCTCGACAC CCCGGCGGCG CGGCTGATCT CGCACGGACG GGAGCACCCG CTGTTGCTGA CCGAGGTGGA AGCGCTGCTT TCCTCCAGCA CCCTCGTCGT CGATGCCTGC CGCCATGTGG TGTGGAATGC GGGCACTGCC GTCTCGCTCG CGACTCGGCC GGTGCTGTTT GCACTTGCCC GCACACTCGC CGAAGCCTGG CCGGGGGATG TGGCAAGGGG TACGCTGATT GCCCGCGCCT TCCGCGGCAA ACATGCCGAT GAATCGCATC GCGCCCGGCT GCGGGTCGAG ATCGGCCGGC TGCGCGCTGA ACTGCGGGGA CTGGCGGAGG TTTCGGCGAC AAAGCGCGGT TTTGCGCTTT CCCCGCGCGG TGCCCGGGAG ATTGCCGTGC TGGCGCCCCT CGTCGAAGGG GCGCATGGTG CGGTGCTCGC GTTTCTGGCC GATGGGGAGG CGTGGTCGAG TTCGGCGCTG GCGATTGCGC TCGCGGCCAG CCCCCGCACC GTGCAGCGGG CGCTGGATTC GCTGGCCGGT GAAGGCAAGG TCCAATCGTT GGGGCGCGGG CGGGCACGGC GCTGGATGAG CCCGCCGCTG CCTGGTTTCC CGACGATCTT GTTACTCCCC GGGCCGCTGC CGAGCGATTA G
|
Protein sequence | MDSLITAAAR ALASGDALGA LKRVALRDDA PALALRGIAM AQLGDLVRAK ALLKSAARAF GPREAVARAR CVVAEAEIAL VSRDLIWPPK ALESARRVLE AHGDRVNAAH AGNVAIRRLV LIGRLDEAER ALAALDPAPL PPALRTAHEL VAAGIAVRRL QTQAARAALD RARLSARAAA IPALTAEVET AALVLDTPAA RLISHGREHP LLLTEVEALL SSSTLVVDAC RHVVWNAGTA VSLATRPVLF ALARTLAEAW PGDVARGTLI ARAFRGKHAD ESHRARLRVE IGRLRAELRG LAEVSATKRG FALSPRGARE IAVLAPLVEG AHGAVLAFLA DGEAWSSSAL AIALAASPRT VQRALDSLAG EGKVQSLGRG RARRWMSPPL PGFPTILLLP GPLPSD
|
| |