Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4337 |
Symbol | |
ID | 8015916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4458869 |
End bp | 4460725 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644826913 |
Product | hypothetical protein |
Protein accession | YP_002978116 |
Protein GI | 241207020 |
COG category | [S] Function unknown |
COG ID | [COG4289] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.78015 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0000641887 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCTATG ATCCCGCCAG CGCCAACCCG CTGGCCGGCA ATCCGCTGGA AACCCGCGCC GATATGAGCC GCGCCCTTCT TGCGCTCTTC GATCCGCTGC TTGCCTGTTT CTCGAACGGC AATGCCCGCG TCACGCTCAA TGGCGGCGGC GCCCATTTCG ACCGGGCGGC GGCCGATCTG GAAGGGTTCG CCCGCCCGCT CTGGGGATTG GCGCCGCTTG GCGCCGGCAA CGGCGACTTC GCCCACTGGC ATCGTTTTGC CGAAGGTCTC GCAAACGGCA CCGATCCCGC CCATCCCGAA TATTGGGGAA CGGTCAATGG CCGCGACCAG CGGATGGTCG AGCTTGCGGC TCTGGGTTTT GCGCTGGCTC TGGTGCCCGA AAAGATCTGG GAGCCGCTCG ATGCGCGCGC CCGTAGCAAT GTCATCGCCT ATCTCAAGCA TGCCCGGCAG TTCGATTATG CCGACAACAA TTGGAAATTC TTCCGGATCT TTGTCGATAT CGCGCTCGAT CGTCTCGGCG CCGATTTCGA CCGCAGCCTG ACGCGGCAAT ATCTTGAGGA GCTCGAAGGC TTTTATATCG GCGACGGCTG GTATCGCGAC GGAAACGTCC GCCGCATCGA CCACTACATT CCCTTCGCCA TGCATTTCTA TGGCCTGATC TATTCGAAGC TCGTCGACGA CGATTATGCA AAGCGCTACC GCGAGCGCGC GGTCCTCTTC GCTCGCGATT TCCGGCATTG GTTTGCCGCC GACGGGGCGA CGATCCCCTT CGGCCGCAGC CTGACCTATC GTTTTGCCTG CGCCGGCTTC TGGTCGGCGC TCGCCTTTGC CGATCTCGAG GCTCTGCCCT GGGGCGAGGT CAAGCATCTC TGCCTGCAGC ATCTGCGCTG GTGGAAGGAC AAGCCGATTG CCGATCGCGA CGGTGTGCTG TCGATCGGTT TCGGCTATCC GAACCTGCTG ATGTCGGAGA GTTACAATTC CGCCGGCTCG CCTTACTGGG CCTTCAAGGC CTTCCTGCCG CTGGCGATCG CTGAGGATCA CCCATTCTGG ACGGCGAAGG AGAAAGTGCC GGAACAAGCA CCTGAGATCG TCCCCCAACG TCATCCCGGC ATGGTGATCA TGCGGGCGGG CGGCGATGTC ATTGCGCTGT CGTCCGGCCA GGAAAACCTG CAGATGCGGT GCGGCACGGA AAAATATGCG AAGTTCGCCT ATTCGGCCCG CTACGGCTTC AGCGTCGAGG CCGATGAGCG TGCCTTTGCG CTTGCCGCCT TCGATTCAGC GCTTGCCTTC AGCGATGACG GCCTGCACTA CCGCGTCCGC GAAACGAACG AGGAAGCCAA GCTCGCAGGG GAGGTGCTTT ATGCGAAATG GTCGCCTTTT GCCGATGTCG ACGTTGAAAC CTGGCTCGTG CCCGCTGCAC CTTGGCATAT CCGCCTCCAT AGGATCAGGA CAAGCCGGCC GCTACGGATT GCCGAAGGCG GATTTGCCAT CGGCCGCCGG GACTTTGAGC TGGATACCCT GTCCGCTTCG GGTGGGTTTG CCTATGCGGT CGGCGAAGCC GACTTCACAG GCATTCTCGA TCTCGGCTCT TCGGTCAAAC GTTCGGGTGT TGTCCAGAAG GCAATGCCCA ACACCAATGT GATCGTCGCG AAAACCCTCG TGCCGCAGCT GCGCGGGCAG ATTCCGACCG GTGAAACCAT CCTGATGACG GCAGTGCTGG CGCTTGACGA TCCCGCCGCC CTCTTGTCTG CCTGGACGAG ACCGCCGAAA GCGCCTGAGA TTGCAGCGCT GGAGGCCCTG GTGAGGGAAA AGGGCGTGAC AGTCAGTGCC ATCGAAGCGC CCGGACAAAT GCCATGA
|
Protein sequence | MPYDPASANP LAGNPLETRA DMSRALLALF DPLLACFSNG NARVTLNGGG AHFDRAAADL EGFARPLWGL APLGAGNGDF AHWHRFAEGL ANGTDPAHPE YWGTVNGRDQ RMVELAALGF ALALVPEKIW EPLDARARSN VIAYLKHARQ FDYADNNWKF FRIFVDIALD RLGADFDRSL TRQYLEELEG FYIGDGWYRD GNVRRIDHYI PFAMHFYGLI YSKLVDDDYA KRYRERAVLF ARDFRHWFAA DGATIPFGRS LTYRFACAGF WSALAFADLE ALPWGEVKHL CLQHLRWWKD KPIADRDGVL SIGFGYPNLL MSESYNSAGS PYWAFKAFLP LAIAEDHPFW TAKEKVPEQA PEIVPQRHPG MVIMRAGGDV IALSSGQENL QMRCGTEKYA KFAYSARYGF SVEADERAFA LAAFDSALAF SDDGLHYRVR ETNEEAKLAG EVLYAKWSPF ADVDVETWLV PAAPWHIRLH RIRTSRPLRI AEGGFAIGRR DFELDTLSAS GGFAYAVGEA DFTGILDLGS SVKRSGVVQK AMPNTNVIVA KTLVPQLRGQ IPTGETILMT AVLALDDPAA LLSAWTRPPK APEIAALEAL VREKGVTVSA IEAPGQMP
|
| |