Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2723 |
Symbol | |
ID | 8013671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2709319 |
End bp | 2710650 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644825295 |
Product | protein of unknown function DUF21 |
Protein accession | YP_002976525 |
Protein GI | 241205429 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGATT CCGCCGGCGG GCTCTCCGAC TATATCGGGA TACTAGCCGT GTTTCTTCTC GTCGCCGCCA ATGGCTTCTT CGTTGCCGCC GAATTCGCCC TGGTGTCAGT CAGGCGTAGC CGCGTCGCCG AACTCGCTGC GGCAGGCCGC ATGAACGCCT CGGCACTTCA GCGCGCCGTC GACAATCTCG ACTCCAACCT TGCAGCCACC CAGCTTGGCA TCACCATCTC GTCGCTCGCC CTCGGCTGGG TTGGCGAACC GGCGCTTGCC CATCTGATCG AGCCGCTGCT GTCCTGGCTG CCCGGGCAAT GGGCGACGGC GGGCGCGCAT ACTGTTGCCG TCGTCATCGC CTTCGTCATC ATCACGGCCC TGCATATCGT GCTCGGCGAG CTCGCGCCGA AGAGCCTAGC GCTTCAACGC AGCGAGGCCA CTTCGCTTGC CGTGGTGCGC CCGCTCGGTC TGTTCCTGGT GCTGTTTAAG CCGGCGATCT TTGTCCTGAA CGGCATGGGC AACATGGTGC TGCGGGGCGT CGGTCTTCGC GCCGGAACCG GGGAATCGTC GTTCCATTCG CCGCAGGAGC TCAAGCTGCT GGTCGCTGAG AGCCAGGAGG CCGGCCTTCT CAACCAGGTG CAGCAGCAGC TCGTCGAGCG GGTGTTCAAC ATCGGCGACA GACCGATCTC CGACATCATG ACCCCGCGTC TCGACATCGA ATGGTTCGAC GCCGACGACA GCGAGGCCGA GATCCTGAAG ACCATCCGCG AATGCAGCCA CGAGCAATTA CTGGTCGCCC GCGGCTCGAT CGACGAGCCG ATCGGCATGG TGTTGAAGAA GGACCTTCTC GATCAGGTTC TCGACGGCGG CAAGGTCCGG CCGATGGAGG TGATCAAGCA GCCGTTGGTG CTGCATGAGG GCACCTCGGT CGTCCGCGTG CTCGACAGTT TCAAGGCCTC ACCCGTTCGC CTCGCCATCG TCATCGACGA ATATGGCAGC CTCGAAGGCA TCGTCACCCA GACCGACCTG CTCGAAGCCA TCGCCGGCGA CCTGCCGGGA TCCAACGAAG AGCCCGACAT CGTCGTCAGG GAAGACGGGT CGCTCTTGAT CGATGCGATG ATGCCGGCCT TCGACGCCTT CGAACGGCTG GGCCTGCGCG ATCGTCCGGA TGCCGATTTC CATACGCTGG CGGGCTTTGC GCTGCATCAG CTCCAGCACA TCCCCGAAGC CGGCGAAACC TTCGTCTTCG ACAACTGGCG CTTCGAAGTG CTCGACATGG ACGGCATGCG TATCGACAAG ATGCTCGCGA CGCGCATTCC CGCGGATGGG GCGGAGGCCT AG
|
Protein sequence | MSDSAGGLSD YIGILAVFLL VAANGFFVAA EFALVSVRRS RVAELAAAGR MNASALQRAV DNLDSNLAAT QLGITISSLA LGWVGEPALA HLIEPLLSWL PGQWATAGAH TVAVVIAFVI ITALHIVLGE LAPKSLALQR SEATSLAVVR PLGLFLVLFK PAIFVLNGMG NMVLRGVGLR AGTGESSFHS PQELKLLVAE SQEAGLLNQV QQQLVERVFN IGDRPISDIM TPRLDIEWFD ADDSEAEILK TIRECSHEQL LVARGSIDEP IGMVLKKDLL DQVLDGGKVR PMEVIKQPLV LHEGTSVVRV LDSFKASPVR LAIVIDEYGS LEGIVTQTDL LEAIAGDLPG SNEEPDIVVR EDGSLLIDAM MPAFDAFERL GLRDRPDADF HTLAGFALHQ LQHIPEAGET FVFDNWRFEV LDMDGMRIDK MLATRIPADG AEA
|
| |