Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3147 |
Symbol | |
ID | 6981892 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3220848 |
End bp | 3221858 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643397857 |
Product | protein of unknown function DUF808 |
Protein accession | YP_002282640 |
Protein GI | 209550723 |
COG category | [S] Function unknown |
COG ID | [COG2354] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTTG GCTTGATTGC CCTTCTTGAT GATATCGCCG CATTGGCCAA GGTGGCTGCG GCCTCGCTTG ACGATATTGC CGGCCAGGCA GCCAAGGCAG GCGCGAAAGC GGCAGGTGTC GTCATCGATG ACGCGGCGGT CACACCCCGC TACGTCACTG GATTTTCGGC CGCACGCGAA TTGCCGATCA TTGGCAAGAT TGCGGTGGGT TCGCTGAAGA ACAAGCTTCT GATCCTGCTT CCGGCGGCGC TTATCCTCAG CCTCGTCGCA CCGCAGGCGA TCACACCGCT GCTTATGATC GGCGGACTTT TTCTCTGCTA TGAGGGCGTG GAAAAAGTCT ACGGACTGGT GCTGCCGCAT GCGGCCCACG CCCATGAATC GGCACTCGAG GCGACAAGCC TCGATGCGCA ATCGCTCGAA GACGAGAAGG TCGCCGGTGC GATCAAGACC GATTTCATCC TGTCGGCGGA AATCATGGCG ATTACGCTTG CCGCAGTTCC GGCAGGCAGC ATCTTCGCGC AGGCCTTCAT CCTTGCCGTC GTTGGATTGG GCATAACAGT CATGGTCTAT GGCGGCGTGG CGCTGATCGT GAAGGCCGAT GATCTGGGGC TGATGATGGC GCGTGCACAG ACGGCGCCTA TGCTGCGCGC GATCGGACGA GGGCTGGTCA CCGGCATGCC TTATTTTCTG AAAGCGCTCG GCATTGTTGG AACGGCTGCG ATGATCTGGG TCGGTGGCGG CATCATCGTT CACGGCCTCG AAGCCTATGG AGTGGCCGGC CTTGCGCATC TTATCCATGA TGCCGGCGAG GTCGCCGTCC ATGCCGTTCC CCTTCTGGCC TCTGTGCTGC GCTGGACCGT CGAAGCGGCA GGTGCCGGCA TCGTCGGTAT TGTCGCCGGC TTGATCACAA TTCCGGTCGC AGCTTACGTT ATCTCGCCGA TGTGGCGGTA CCTCAAATCA CTCCTGCCGC GTCGCCGGGG GAAAGAGGCG CTGGCGGACG GGAAAAAATG A
|
Protein sequence | MSVGLIALLD DIAALAKVAA ASLDDIAGQA AKAGAKAAGV VIDDAAVTPR YVTGFSAARE LPIIGKIAVG SLKNKLLILL PAALILSLVA PQAITPLLMI GGLFLCYEGV EKVYGLVLPH AAHAHESALE ATSLDAQSLE DEKVAGAIKT DFILSAEIMA ITLAAVPAGS IFAQAFILAV VGLGITVMVY GGVALIVKAD DLGLMMARAQ TAPMLRAIGR GLVTGMPYFL KALGIVGTAA MIWVGGGIIV HGLEAYGVAG LAHLIHDAGE VAVHAVPLLA SVLRWTVEAA GAGIVGIVAG LITIPVAAYV ISPMWRYLKS LLPRRRGKEA LADGKK
|
| |