Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3529 |
Symbol | |
ID | 8014393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3564340 |
End bp | 3565827 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644826094 |
Product | hypothetical protein |
Protein accession | YP_002977314 |
Protein GI | 241206218 |
COG category | [S] Function unknown |
COG ID | [COG4222] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0210661 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAATG TCAGACCTTA CACAGCGCGG CTTTTCGCCG CGGTTCTCGC GGCATCCGCC GCCGTGCCAG TCGTCGCAAT GGCCGAGAAT TCCGCCACCG TCGGCGGCCT CACCTTCGTC AACAAGGGCC TGGTGGGCAT CGGCCGCATT CCGGCCAACC AGCGTGACAA ATTCGGCGAG ACCTTCGGTT CCGGCTCGGG CATGGCGATC GATCCCGCCG CCTGGAGCCG CGACGGCGCC GGCTACAAGG GCACGCTCTA CCTGTTGCCC GACCGCGGTT ACAACGCCGT CGGCACCGTT GATTACCGGC CACGTCTGAA CACCATCTCG ATCGGCCTGA CGCCGACCGC TCCGGGTGCG GCACCCGAGG TTGGCAAGGA ACAGTCGGGC GTCGACGCAA GGCTCGTCGA TTCCACACTC TTCGTCGACG ACAGGGGCGG CGACATGACC GGTCTCGACC CGGAATCCGG CGTCCGCCCC GCTGCCGGTG ATTTTCCGCC GCTGCCGCAG GCGACGAACG GCAAGATCGC GCTCGACAAT GAAGCCATCA TCCGCATGGC CGACGGCAGC ATGTTCGTCA GCGACGAATA TGGTCCTTAT ATCTATCACT TCTCCGCTGA CGGCCACCTG CTCTCCGCCA CCCAGCCGCC GAAGGCGCTT TTGCCGATGC GCAAGGGCGC GTTGAGCTTT GCCTCCAACA ATCCCGGCCC CGGCGCCTCC GCTCCGGACC CGAAAGATCC GGAGACCGGC CGCCAGAACA ACCAGGGTCT CGAAGGCATG GCGATGACGC CTGATGGTAA GTTCATCATC GCTGTGCTGC AATCGGCTGC CCGCCAGGAC GGCGGCGATT CCGGCTCGAC CCGGCAGAAT ACCCGCGCGC TGATCTATGA CGCCGCCGAT CCCGATCACC TGAAACTGAT GCACGAATAT GTCGTGCCGC TGCCGGTCTT CAAGGACGCC AAGGACAAGA CGATGATCGC CGCCGAGAGC GAAATCGTCG CCCTGTCCGA CAAGACCTTC CTGATGCTCG CCCGTGACAG CGGCAACGGC CAGGGCCTGA AGGGCGACAC CTCGCTCTAC CGCAAGGTCG ACATCGTCGA CGTTTCCGCC GCGACCGACA TTGCCGGCAG CAATTTCGAT GACGGCAAGC CGATCGCGCC GAAGGGCGTC ATCGATCCCT CGCTGACGCC GGCGACGCTG ATACCCTTCA TCGACCTCAA CGACAAGGTC GACCTCGCCC GCTTCGGCCT GCACAACGGC GCGCCGAACG ACAAGAACAA TCTGTCGGAA AAATGGGAAG CCATGGGTCT GGCGAGCGTT CTCAACCCGA ACCTGCCGGA CGACTACTTC CTGTTCGTCG CCAATGACAA CGACTTTTTG ACGCAGGATG GTTTCCAGGT GGGTGCCGCC TACAAGGCCG ACGGCGGTGC CGACGTCGAC ACAATGTTCC AGGTCTTCCA GGTCACCCTT CCAGGTCTGA AGAAGTAA
|
Protein sequence | MKNVRPYTAR LFAAVLAASA AVPVVAMAEN SATVGGLTFV NKGLVGIGRI PANQRDKFGE TFGSGSGMAI DPAAWSRDGA GYKGTLYLLP DRGYNAVGTV DYRPRLNTIS IGLTPTAPGA APEVGKEQSG VDARLVDSTL FVDDRGGDMT GLDPESGVRP AAGDFPPLPQ ATNGKIALDN EAIIRMADGS MFVSDEYGPY IYHFSADGHL LSATQPPKAL LPMRKGALSF ASNNPGPGAS APDPKDPETG RQNNQGLEGM AMTPDGKFII AVLQSAARQD GGDSGSTRQN TRALIYDAAD PDHLKLMHEY VVPLPVFKDA KDKTMIAAES EIVALSDKTF LMLARDSGNG QGLKGDTSLY RKVDIVDVSA ATDIAGSNFD DGKPIAPKGV IDPSLTPATL IPFIDLNDKV DLARFGLHNG APNDKNNLSE KWEAMGLASV LNPNLPDDYF LFVANDNDFL TQDGFQVGAA YKADGGADVD TMFQVFQVTL PGLKK
|
| |