Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2143 |
Symbol | |
ID | 8013161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 2132754 |
End bp | 2134496 |
Gene Length | 1743 bp |
Protein Length | 580 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644824729 |
Product | protein of unknown function DUF521 |
Protein accession | YP_002975959 |
Protein GI | 241204863 |
COG category | [S] Function unknown |
COG ID | [COG1679] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.180277 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.183216 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGACA TACAACAACA GGCAGACGAA AAGGTGTTTG CCGGGCGCGC GCTTGTCGCT GGCTTCGCTA CGGGGTCGAT CGTATTCAGC GACACGGCGC TCAGCTTCTG GGGCGGCGTC GACTCCCAGA CGGGAGAAGT CATCGACCGT CACCATCCCC TGTCGACACA GGTCCTGACA GGAAAGATTC TCGCCATCCC CGGCGGACGC GGTTCGTGTA CGGGCAGCAG CGTGCTTATG GAGCTGATCA TGAACGGGCA CGCTCCCGCC GGCATCGTGG TCTCGCGCCA GGAGGAAATA CTGTCGCTCG GAGTGATCGT CGCTGACGAA GTGTTCGGCC GATCGATCCC CGTTGTCCAG CTCTCCGAAG ACGACTTCGC CGAGCTGCAC TCTATCCCTG AAGTCACCCT CATCGGCGAC AAGGTCATCG CGAGCTGGAT CGAAACCGCC CCCGGTTTCG ATGCCGCCGA TCGCACCTAT GGAAATTCGA TAGCGCTGAC ATCGAGGGAT CGGTCGGCTT TGAACGGGGA GATGGGCAAG GCTGTGCAGG TGGCAATGAG GGCCACGACG CGCATGGCTG AAATTCAGGG AGCGACGGAA CTTATCGACA TCTCCCAGGT CCACATCGAT GGCTGCATCT ACACCGGGCC GACCAGCCTG GAATTCGCGA AACGCATGCG GGACTGGGAG GGTAGGGTTG TCGTTCCAAC GACCCTGAAC TCGATATCGG TGGACCAGAT GCGTTGGCGG GAGCAGGGCG TGTCTCCCGG CGTCGCCGGA CCCGCATCCG AATTGGGTGA GGCTTACGCC TCCATGGGAG CACGAAAGAC GTTCACCTGC GCGCCCTATC AGCTGTCGTC CGCACCCAGG CAGGGTGAGC AGGTGGCCTG GGCGGAGTCG AACGCCGTCG TATTCGCGAA CAGTGTTTTG GGTGCTCGCA CTGCCAAGTA TCCTGACTAC CTGGATCTGT GCATCGCTCT CACCGGGCGC GCGCCGCTGA CCGGGCCGCA TATCGCCGAC AACAGGCGCG CCAGCCTCGT GGTCAATGTG TCGGGCTTCG TGTCTTGGGA TGACATGGTC TACCCCATTC TCGGCTACCA CATCGGCAAG CTTGTCGGAG ATGAAATTCC GGTCGTGATC GGCCTCGAGA CTTGGAAGCC GAATCTGGAC GACCTAAAGG CGTTCGGGGC GGGTTTTGCG ACGACATCAG GCTCTCCGAT GTTCCATATC GTCGGTGTCA CGCCGGAGGC CGACAGCCTT GAGAGCATTG TCGGAAGCAA CATCAAGGCG AGCTACGAAA TCTGTCCCAA GGACGTCGTT GCGGAGTGGA GGAAACTCAA CGGCGGGTCT GTGGATGCCA TCGAGTTCGT TGCGCTGGGC AATCCCCATT TCTCTTTCGA CGAATGCGAG CGGCTGGCCG CGCTCTGCGA AGGCTTGGCG AAACATCCCG ACGTCAAGGT TCTGGTTACC TGCAACCGTG CGACGTTCGA AAGGGCTTCC GCTGCGAATC TCGTCGGCAA ATTGTCGGAC TTCGGCGTAG AATTCGTAAC CGATGCGTGC TGGTGCACAC TTGCGGAGCC TGTCATCCCA AAATCGGTCG ATACAATTAT CACCAATTCC GCGAAGTTCG CTCACTACGG TCCCGGATTG ACGGGTAAGG CGCTTCGGTT TGGCAGCCTG GCGGACTGCG TCGCTGCTGC TTGTCAGGGC GAGTTGAAGC CTGTTCGTCC CACATGGGAC TGA
|
Protein sequence | MPDIQQQADE KVFAGRALVA GFATGSIVFS DTALSFWGGV DSQTGEVIDR HHPLSTQVLT GKILAIPGGR GSCTGSSVLM ELIMNGHAPA GIVVSRQEEI LSLGVIVADE VFGRSIPVVQ LSEDDFAELH SIPEVTLIGD KVIASWIETA PGFDAADRTY GNSIALTSRD RSALNGEMGK AVQVAMRATT RMAEIQGATE LIDISQVHID GCIYTGPTSL EFAKRMRDWE GRVVVPTTLN SISVDQMRWR EQGVSPGVAG PASELGEAYA SMGARKTFTC APYQLSSAPR QGEQVAWAES NAVVFANSVL GARTAKYPDY LDLCIALTGR APLTGPHIAD NRRASLVVNV SGFVSWDDMV YPILGYHIGK LVGDEIPVVI GLETWKPNLD DLKAFGAGFA TTSGSPMFHI VGVTPEADSL ESIVGSNIKA SYEICPKDVV AEWRKLNGGS VDAIEFVALG NPHFSFDECE RLAALCEGLA KHPDVKVLVT CNRATFERAS AANLVGKLSD FGVEFVTDAC WCTLAEPVIP KSVDTIITNS AKFAHYGPGL TGKALRFGSL ADCVAAACQG ELKPVRPTWD
|
| |