Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4022 |
Symbol | |
ID | 8014828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4098741 |
End bp | 4099991 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644826591 |
Product | hypothetical protein |
Protein accession | YP_002977802 |
Protein GI | 241206706 |
COG category | [S] Function unknown |
COG ID | [COG4223] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.483809 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.844233 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTATCGG GAAACCCGCC ACGCCATTCG AAGAGCGCCG ACGAACCGGT CACGATCGAC CTCGATGCAC AGGAATTCGC CGCTGCGGCC GATACCGAAA AACCGGTGAA CAATGAAACT GCCGACGCCG ACAGCACCGC TGCCGCCGAT GTCGGCCTGC CGCCCGAAAC CGAGACTGCG TCGCATGCCG AATATGAAGA GAAGCCTGTG ATGGAGGCCC CGGAGGAGGA ACCGGCAGCC CCAGAACCGT CCTTTACCCC TCCTCCCGAA CAGCCTGAGC CAAAGAGCGC CGGCACCTCC GGTCTCATTG CTGCGGGCAT CTTCGGCGGC CTCGTGGCGT TGCTTGGCGC CGGCGCCATC CAGTATGCCG GTTACCTCCC AGGCTCCTCC GCACCGCAGA CGACCTCGCC GGAGACGGCC AATCTTGCCG GTGAGATCGA CGGCCTGAAG CAGTCCGTCG CCAACCTTGC CGCCAATCCG GCGAGCACAG ATAACGGCGA GCTTGCGAAA CGCGTCGCTG CGCTGGAAAC GGCTGCAAAA GCTCCCGCAG CCGGCGCACC GGCCGATTCG GCAAATGTCG AGGCACTCAA CCAGAAGATT GCGGAGCTGA CCGGTCAGGT CGACCAACTG CGCTCTACGC TTACCCAGTC ATCCGAGCAG CAGACGACGA ACGGCGCCGA TATCGCCAAG CGCCTCGAAG AGGCCGAAAA GAAGCTGAAC GAGCCGCGCG AGGACGTCGC CGTTGCCCGG GCTATCGCGG CTGCCGCCCT GAAGGCGGCG ATCGATCACG GTGGCCCGTT CCTGGCCGAA CTCGACACTT TCGCCGGTGT CGCACCCGAC GATCCAGCCG TCGCCGACCT TAGAGCCTTT GCCGAAACCG GCATTCCCTC ACGCACCGAG CTGGTGGGCG AGGTTCCCGA TGTCGCCACC GCGATCGTCG AAGCCGTCAA CCAGCCGGAT CCGAATCAAA GCTGGTCGGA CCGGCTGATG TCGAGTGCCA AGTCGCTGGT GAGCGTCCGT CCCGTCGGCA ATATCGAGGG TGAAAGCGTC GAAGCCATCG CCGCCCGCAT GGAGGAGAAG GTGAAGAACG GCGACCTGCC CGGCGCTTCC GCCGAATGGA ACAACCTGCC GGCTCTCGGC AAGCAGGCCT CCGCCGCCTT CAAGCAAACG CTCGAAGCGC GCATCCGCGT CGAGGAACTG GTCGGCGGGG CGCTGTCGAA AGCGGTCTCC GGCACCGGCA AGGAGGGATG A
|
Protein sequence | MVSGNPPRHS KSADEPVTID LDAQEFAAAA DTEKPVNNET ADADSTAAAD VGLPPETETA SHAEYEEKPV MEAPEEEPAA PEPSFTPPPE QPEPKSAGTS GLIAAGIFGG LVALLGAGAI QYAGYLPGSS APQTTSPETA NLAGEIDGLK QSVANLAANP ASTDNGELAK RVAALETAAK APAAGAPADS ANVEALNQKI AELTGQVDQL RSTLTQSSEQ QTTNGADIAK RLEEAEKKLN EPREDVAVAR AIAAAALKAA IDHGGPFLAE LDTFAGVAPD DPAVADLRAF AETGIPSRTE LVGEVPDVAT AIVEAVNQPD PNQSWSDRLM SSAKSLVSVR PVGNIEGESV EAIAARMEEK VKNGDLPGAS AEWNNLPALG KQASAAFKQT LEARIRVEEL VGGALSKAVS GTGKEG
|
| |