Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4633 |
Symbol | |
ID | 8015377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4755971 |
End bp | 4757068 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644827208 |
Product | protein of unknown function DUF917 |
Protein accession | YP_002978408 |
Protein GI | 241207312 |
COG category | [S] Function unknown |
COG ID | [COG3535] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.951403 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.119918 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACGCA TACTCGTTGA GAAGGACGTG GAAGCTGCCG TCAAGGGCGG CTCCGTCTAT GCCGCCGGCG GCGGCGGCTG GGCCGATCAC GGGCGGATGC TTGGTTATGC CGCCGTCAAT GTCGGCAAGC CGGAGCTGGT CTCGATCGAC GAATTGCGGG ACGAGGACTG GATCGCGACT GCGGCTGCGA TCGGCGCGCC GGCCTCCACC ACGCCCTGGG AAATGCAAGG CATCGACTAT GTGAAGGCGG TGCAATTGCT GCAGGAGGCG CTGGGCGAAA AGCTTTCCGG GCTGATCATC GGCCAGAACG GCAAGTCCTC GACGCTGAAC GGCTGGCTGC CCTCGGCGAT CCTCGGCACC AAGGTAGTCG ACGCCGTCGG CGATATCCGC GCACATCCGA CGGGCGACAT GGGCTCGATC GGCATGGCCG GTTCGCCCGA GCCGATGATC CAGACCGCTG TCGGCGGTAA TCGCGCCGAG AACCGTTACA TCGAACTGGT GGTGAAGGGG GCGACGGCGA AGATCTCGCC GGTGCTGCGC GCCGCAGCCG ACCAATCCGG CGGCTTCATC GCCAGCTGCC GCAATCCGCT CCGCGCCTCC TATGTCCGCA GCCATGCAGC ACTCGGCGGC ATATCGATGG CGCTTGCGCT CGGCGAAGCG ATCATCGCGG CGGAGAAGCG CGGCGGATCT GATGTCATCG ACGCGATCTG CAAGACGACG GGCGGACATA TCCTTGCCGA AGGCGTCATC ACCCGCAAGG ACGTCGTCTA TACCAAGGAA GCCTTCGACA TCGGCACGAT CACCGTCGGC GCAGGCGAAA CGTCGGTGAC GCTGCATGTG ATGAACGAAT ATATGGCGGT GGACGATGCG GATGGCGGGC GGCTAGCGAC CTTCCCCGCG GTGATCACCA CGCTTTCACC AGAGGGCGAG CCGCTGAGTG TCGGCCAGCT CAAGGAGGGC GTGCATGTGT TCATCCTGCA TGTGCCGATG GATATCATTC CGCTGTCGGC AAGCGTGCTC GATCCGACCG TCTATCCCGT CGTCGAAAAG GCGATGGGGA TCGAGATCGC ACGCTATGCA CTGGCGACGA AGGCCTGA
|
Protein sequence | MGRILVEKDV EAAVKGGSVY AAGGGGWADH GRMLGYAAVN VGKPELVSID ELRDEDWIAT AAAIGAPAST TPWEMQGIDY VKAVQLLQEA LGEKLSGLII GQNGKSSTLN GWLPSAILGT KVVDAVGDIR AHPTGDMGSI GMAGSPEPMI QTAVGGNRAE NRYIELVVKG ATAKISPVLR AAADQSGGFI ASCRNPLRAS YVRSHAALGG ISMALALGEA IIAAEKRGGS DVIDAICKTT GGHILAEGVI TRKDVVYTKE AFDIGTITVG AGETSVTLHV MNEYMAVDDA DGGRLATFPA VITTLSPEGE PLSVGQLKEG VHVFILHVPM DIIPLSASVL DPTVYPVVEK AMGIEIARYA LATKA
|
| |