Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6156 |
Symbol | |
ID | 6983229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011370 |
Strand | - |
Start bp | 91901 |
End bp | 95005 |
Gene Length | 3105 bp |
Protein Length | 1034 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643399172 |
Product | hypothetical protein |
Protein accession | YP_002283928 |
Protein GI | 209552012 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATACC GAAAAATAAC AAAAGCGCAA AAGTCGCGAG TTCGCTCTTC AGCTCTTACA CCACAAAAGC GTCGAGTACG CGTCTCTAAG GGGGCGCACG CCACTTTACT ATCTATTCGG AATGAGGCTC CGACGGTGGA GGTACTCAGC CGGACGATCG AGTGGACCTT GGCGCGGTAT AAGTACCATC GTTATCTGGT CTCGTTAGTA CCACGAACCC CTTCGCAGCT TCCGGATTAC GTTTTAGCCA CAGGAGATCA TCTTGAGGAT GCGCTAAAGT GGCAATTAGC TGCAATCTCG GCGGCGCGCG CAAAAGTCGA GAGGCATAAT CGGAACGCTT ACACCAGCAT TGACCAGGTA ACCGACTACA GCGCCTCCCT TGAAAGTGCC AACGCGGAAA TGCGAACCTG TATGTGGAGC TATTCGGCAG CTTCACAAGC TCTTTTTTCG TTAGCAAAGA CCAAAGGCCT AGATGCCCAG CGCCGTTGGA TGCAGAGAAA CGTTTATGTT TCCAATCCCT CAATCTCAAA CATTATTCTC TACAGCAAGG GGATTGCGTC AGAAGGGGAT CGGCATCCTG TAGACATCAT GGATGTTTTG AACAGGTTCA TTTTCTCGAA AATGGTTGAC GACGAGTTGA ACCTCCTGCT GTATTACCTG ATATTCAATC CGCCTTTGAA TTTGGACGCG GCAGCAAAGC TTTCCCCCTT ACTTTTATAT TTTCCCTTAG TTGATCAATA CGAATTCCTG GCCAATCTCG TTACGTCCGA TCCAAATTCT TCGGGGCCAG AAGGTTTTCC ATACTCCGGG GAATTTATCG AACTCTTGTC CGCTACAGGT GATTGGAGAG GGAAATCGCG GGCATCGCTT GCCGAAAATA CTGAGCCGAC TTTCCTTTCA TTGCCTATCG TAAATCGGTG GTGCGGCTCT CTTCTCGACA GCATTGGGGT ACTCGACGGA ATTTCCTCCG CCCCTGATCG CGAACTTGAT ATCGCACTCG CAAATGAATT TTTCCATCAA CCGCAGTCTC CTCGAGCGTA CCTCGCAAGC TCGCTCTTCG CAATGAAATC CGCGCAAACT TTGGCAGACG TGAAGTCCGC GCTCTATCGC AGAGAAATTG CAAACGCGCA TTTCAATAGC ATGGGGCTCA ACTCGGATCA CATCGAGCGG CGACGTATAG AGTTTACTTT TGACGCTCTC TCAGTCGAGG TCGGAAAGTC TGAGTCTGTC TATGAAAATC GAGAGCTTTT ACGAATTGCT TGTATCTGCG GAATAAGCGA AGGGAGGACT CTTGAGACGC TTATTCTCCT GTTTAGCTAC ACAGGCCAGG ATCCGCTAGC TGCGGGATAT TTTCCAGCGA GTTTGTTCTC AAGCAGTATA ACAGAAGACG AGGTGGCCGA TATCGGACAC GATGCCAGAG TTGCGATTGC GCTTTCGCGT GTTGCTGGTA GCCTCGGAGA TGAGGGGCAA AATCTGGTCT ATATAGCAGT AGAACAGCAT CTCAGTGAGC GCGGGGTCAC CAAGCCGAGC GAGCTAGCTG TCGAAGGGCT TATCGACATC GCTTTTCTCA GTGAAGCGTG CACTTCCGCG TCCTTGAGGC AATCACTTGA GTTTCTGTCC AAGGCTGAGA TGGAGGAAGA GCGGATAAAG GTGCTCTTGA ATTTGGCGCA GGCGAATTCG GAGAATGAGG ACGAGTACAT TGACGAGGTT CACGCAATCA TCGGTCAACA GACCATCGAA GAGCTCCTTC AGAGGTTTCA TGTAGGCAAG GTACAATGCG ACGAGCAAGC GCTTGCAACA TGGGCGCTGA CAGAGTTGTC GCCAAAATTC AATCGCCTTA AGGATTTCAT CGACGCTGGC TTGCCGCCTG TCGAGAAGAA TGCTGACGTT GAGTTCATCG CCCATTTGAC TTCAGGAAAG TCTGAAACCT TCACGTTCAA AGTCCCGAAT AACGAGTCCC TTGATATTGC CCGGACCATA CTGGCCGAGC TGAACTCAAA GTATGCGTTG GACCCTCGCT ACGGTGTCGA TTCCTATTTG AGTCTCGGCA TGCGTCATGG TGCCGTCGAA GCTCACCTTC AGAGCCCGCT AATCGCAGAG AATATTCTTA CTACCAAGGA GGCCCTTGGT TATCCAGAAG ACTGCTTCTG GAAGGGGTAT TTCCTTGATA ACGGTTATGA GTACTACGGG GAGATGATTG GGCCGATTTT GGCTAGATTT TCGGAGAAGT TCGATAATAA ACTGGAAGCG ATCAAGAATG ATCTCCTGCA GGTGCGACGG CCCGATAAAC CAGAGGGGCT GATTGTTGCT GACTGGTCGG AAGCATCGGT CTTGTCCACG TGCGCCAGAT TCGCTGAGGT CGTTAACTTC GAGGCATTGA TCACGGAATT CACGTCAATT TTCTGGGCGA ACATAGAAGG TAATCTTGGC AACGCTCGCG AGTTCATTGA AAACGTGCTT TCGAACGAAC TAAATGAGCT GATCGACGAG TTAGAAGCCG ATGTGCGTCA AGCTACCGGA CAACAGAGGT TACCTCCGTT CTCGGATGCT CTGATGCGGG CGCGCGAGGA ACTCAGCAAC GCGGTGAAAG ATATATCTTC GTGGCATAAT GTTGCTCGCT CCACCCATGT CGAGCCTCTT GGATTGGTCG ATATCATTAG CGCGGCGCAG AAAATTGTCT GCCGCCTTTA TCCCGATTTT CAGCCGCGTG TGACGTTTTC TGGAGAAACA GGGATCTCAG TAACCTATTC GCTGCAGGTG CTAATCGAGG TCTTCAAAGC GCTGTTTACG AACGTGTACG CTCATTCTGA AGTCGAGACG CCGTCTGTCA ATGTACACAT GACAGTCTCA GGCGAAGACG CGCTGAATGT TGAATTTGCC AGTGACTGCA AGGATCTGAA CAAGGCCGAG CAAGCAGCCT TAGATAACAA TGAAAAAATC AAAACTGGCG AATACGAAAA AAAACTACCG AAAGAAGGTG GGTCCGGATT GGCGAAGGTT GCTCGTTCAA CTCTCCGGGA CGGCAAACCA AACACTATCA TTTCCGTTGA TCATGTTGCT AGAAAGTTTC GTGTGAGTAT GACGTTCAGA ATAATCCAGA TTTGA
|
Protein sequence | MAYRKITKAQ KSRVRSSALT PQKRRVRVSK GAHATLLSIR NEAPTVEVLS RTIEWTLARY KYHRYLVSLV PRTPSQLPDY VLATGDHLED ALKWQLAAIS AARAKVERHN RNAYTSIDQV TDYSASLESA NAEMRTCMWS YSAASQALFS LAKTKGLDAQ RRWMQRNVYV SNPSISNIIL YSKGIASEGD RHPVDIMDVL NRFIFSKMVD DELNLLLYYL IFNPPLNLDA AAKLSPLLLY FPLVDQYEFL ANLVTSDPNS SGPEGFPYSG EFIELLSATG DWRGKSRASL AENTEPTFLS LPIVNRWCGS LLDSIGVLDG ISSAPDRELD IALANEFFHQ PQSPRAYLAS SLFAMKSAQT LADVKSALYR REIANAHFNS MGLNSDHIER RRIEFTFDAL SVEVGKSESV YENRELLRIA CICGISEGRT LETLILLFSY TGQDPLAAGY FPASLFSSSI TEDEVADIGH DARVAIALSR VAGSLGDEGQ NLVYIAVEQH LSERGVTKPS ELAVEGLIDI AFLSEACTSA SLRQSLEFLS KAEMEEERIK VLLNLAQANS ENEDEYIDEV HAIIGQQTIE ELLQRFHVGK VQCDEQALAT WALTELSPKF NRLKDFIDAG LPPVEKNADV EFIAHLTSGK SETFTFKVPN NESLDIARTI LAELNSKYAL DPRYGVDSYL SLGMRHGAVE AHLQSPLIAE NILTTKEALG YPEDCFWKGY FLDNGYEYYG EMIGPILARF SEKFDNKLEA IKNDLLQVRR PDKPEGLIVA DWSEASVLST CARFAEVVNF EALITEFTSI FWANIEGNLG NAREFIENVL SNELNELIDE LEADVRQATG QQRLPPFSDA LMRAREELSN AVKDISSWHN VARSTHVEPL GLVDIISAAQ KIVCRLYPDF QPRVTFSGET GISVTYSLQV LIEVFKALFT NVYAHSEVET PSVNVHMTVS GEDALNVEFA SDCKDLNKAE QAALDNNEKI KTGEYEKKLP KEGGSGLAKV ARSTLRDGKP NTIISVDHVA RKFRVSMTFR IIQI
|
| |