Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2388 |
Symbol | |
ID | 6981127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 2449256 |
End bp | 2450584 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643397101 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002281889 |
Protein GI | 209549972 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.945135 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGAAT CCGCGGGCGG CGTCTTCGAC TATGTCGGGA TACTCGCCGT GCTTCTACTG GTGGCCGCCA ACGGCTTCTT CGTCGCTGCG GAATTCGCGC TGGTGTCCGT CAGGCGCAGC CGTGTCACCG AACTTGCCGC TGCAGGCCGC ATGAATGCCT CCGCCCTACA GCGCGCCGTC GACAATCTCG ATGCCAATCT CGCCGCCACC CAGCTCGGTA TCACCATCTC GTCGCTGGCA CTGGGCTGGG TCGGCGAGCC GGCGCTTGCC CACTTGATCG AGCCTCTGCT GTCCTGGCTG CCCGGGCAAT GGGCGGCAAC AGGCGCGCAT ACCGTCGCCA TCGTCATTGC CTTCGTCATC ATTACGGCAC TGCACATCGT GCTGGGCGAG CTGGCGCCGA AAAGCCTGGC GCTTCAGCGC AGTGAGGCCA CTTCGCTTGC CGTAGTGCGT CCGCTGGGGC TCTTCCTGGT GCTGTTCAAG CCGGCGATCT TCGTTCTGAA CGGCATGGGC AACCTTGTGC TGCGGGGCGT CGGCCTTCGC GCCGGAACCG GGGAATCGTC GTTCCATTCG CCGCAGGAAC TCAAGCTGCT GGTCGCCGAA AGTCAGGAAG CCGGTCTTCT CAATCAGGTG CAGCAGCAGC TCGTCGAGCG GGTGTTCAAC ATCGGCGACA GACCGATCTC CGACATCATG ACCCCGCGTC TCGATATCGA ATGGTTCGAT GCCGACGACA GCGAGGCCGA GATTCTGAAG ACCATCCGTG AATGCAGCCA CGAACAATTG CTGGTCGCCC GCGGCTCGAT CGACGAACCG ATCGGCATGG TGTTGAAGAA GGACCTTCTC GACCAGGTTC TCGACGGCGG CAAGGTCCGG CCGATGGAGG TGATCAAGCA GCCGCTGGTG CTGCACGAGG GCACCTCGGT CGTGCGTGTG CTCGACAGTT TCAAGGCCTC GCCTGTCCGG CTCGCCATCG TCATCGATGA ATATGGCAGC CTTGAGGGTA TCGTCACCCA GACCGACCTG CTCGAAGCGA TCGCCGGCGA CCTGCCGGGA TCCAATGAGG AGCCCGATAT CATCGTGCGG GAAGACGGAT CGCTCTTGAT CGATGCGATG ATGCCCGCCT TCGACGCCTT CGAGCGGCTC GGTCTGCGCG ATCGTCCGGA TGCCGATTTC CATACGCTTG CAGGCTTCGC GCTGCACCAG CTCCAGCACA TCCCGGAAGC CGGCGAAACC TTCGTTTTCG ATAGCTGGCG CTTCGAAGTT CTCGATATGG ACGGCATGCG CATCGACAAA ATGCTGGCAA CGCGCATCCC CGCGGACGGG GAAGGCTGA
|
Protein sequence | MSESAGGVFD YVGILAVLLL VAANGFFVAA EFALVSVRRS RVTELAAAGR MNASALQRAV DNLDANLAAT QLGITISSLA LGWVGEPALA HLIEPLLSWL PGQWAATGAH TVAIVIAFVI ITALHIVLGE LAPKSLALQR SEATSLAVVR PLGLFLVLFK PAIFVLNGMG NLVLRGVGLR AGTGESSFHS PQELKLLVAE SQEAGLLNQV QQQLVERVFN IGDRPISDIM TPRLDIEWFD ADDSEAEILK TIRECSHEQL LVARGSIDEP IGMVLKKDLL DQVLDGGKVR PMEVIKQPLV LHEGTSVVRV LDSFKASPVR LAIVIDEYGS LEGIVTQTDL LEAIAGDLPG SNEEPDIIVR EDGSLLIDAM MPAFDAFERL GLRDRPDADF HTLAGFALHQ LQHIPEAGET FVFDSWRFEV LDMDGMRIDK MLATRIPADG EG
|
| |