Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5803 |
Symbol | |
ID | 8016601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012853 |
Strand | - |
Start bp | 378705 |
End bp | 381437 |
Gene Length | 2733 bp |
Protein Length | 910 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644827939 |
Product | cyanophycin synthetase |
Protein accession | YP_002979139 |
Protein GI | 241518511 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0769] UDP-N-acetylmuramyl tripeptide synthase |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR02068] cyanophycin synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0493543 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAAGAGG TCGGCGTCTA CCGTGGCCCG AACCTTCACA GTCGAACCAA AATGGTCCGC ATCCAGTTGG ACCTTGGGAA GTTGGAACAG TATCCGACAA ATCTTCTGCC TGGCTTCGTT GATGCCCTTC TGAAACATGT TCCCGGATTG CGCGAGCATG GTTGTAGCTA CGGCGAACCC GGAGGGCTAG TTCTGCGCAT GGAAGAAGGC ACGTGGCTCG GACATGTCGC CGAGCATGTT GCTATCGAAC TGCAGAACAT CGCGGGCGCT GATGTTGCCC GCGGCAAGAC CCGCTCGGTG ACCAACATGC CCGGCGTCTA CAATGTCATG TTTGAATATG AGAATGAGGA TCTCGGCCTG TTGGCGGGAC GCTTCGCGCT AGAGCTCGTC AATTTTCTCT TGCCCGCCGA TTTGCAGGGG CTCGCTGGCG CCGACATGAT AGCGCCGTCC CCCCTTGCGC AGTTCACGAT GGCTGAAGCT CTGCGCGTTC TCCGCGCGGC GCATTCTGAG ATTGCCTTCG GCCCCACGAC TGCATCGATC GTTCGTGAAG CGGAAGCGCG CTCCATCCCC TGGCGAAGAC TTGACAACAG CAGCCTTGTC CAGCTGGGCT ACGGCAAACA TCTCAAGCGC ATCAGAGCAA GCTGCAGCTC CCTCACCTCC GAGATTGCAG CGGAAATCGC CAGCGACAAG GAACTGACCA AGCGCCTGCT GATCGAGGCT GGCCTACCGG CGCCGTGGGG GACTGTTGTC AGTAGCGCGG AAGAAGCGGT AGAGGCAGCA CAGTCGCTCG GCCTGCCCGT CGTCATCAAG CCGGTGGACG GCAACCATGG TAGAGGTGTC AATATCGGTC TTTCGAGTCA GAGCGAGGTG GAGTGGGGTT TCGAGCAGGC TCGAGCGCAT AGCAGCCAGG TTCTAGTGGA GCAGCAGTTT GTCGGCGGGG ATCACCGCAT CCTTATTATC GGCGGCAGGC TGGTCGCCGC TGCAAAGCGT GTCCCCGCCC ACGTCATCGG CAACGGCAGG GTGACGATCG AGCAGTTGAT CAACCGCGAG AACGAAGATC CTCGCCGTGG CGAGGGGCAT GAGGCGGCAC TTACCCGGAT CTCGATCGAC GAATGCCTCA TCCACTACAT CGCCCGTTCC GGATTCACTC TCTCTAGCGT TCCGGAAAGG GGGAAACCGG TCATGCTCCT GCCAACTGCG AACCTGTCAA CCGGTGGCAC CGCCATCGAC TGCACCGAAG ACATTCATCC GGACAATGCA CTGATTGCCT GTCGCGCGGC GCAGATTATC GGCCTCGACA TAGCCGGCAT AGACTTCGTC GCGCCGGACA TCAGACGCTC GGTTCTTCAG ACGGGCGGGG GCATCATCGA GGTCAATGCG GGACCGGGGT TTCGCATGCA TCTTTATCCC TCCCAGGGCA AGCGGCGTGA CGTTGCAGGT GCGGTTTTGG ATCTTCTTTA TCCGCCAGGC GCGCCCAGCC GAGTGCCGGT ACTGGCCGTG ACAGGCACGA ACGGCAAAAC GACAACGACC CGAATGCTGG CTCATATCCT CGCCGCCGAT GGCCAGATAG TCGGAATGAC GTCTAGCAAC GGCGTCTACA TCGACGGCCG GCGTATCATG GAGGGCGACT GCACCGGTCC AAGGAGCGCG CGTGTTGTGC TGGGCGAACC GACCATAGAC GTCGCTGTTC TCGAAACGGC GCGCGGTGGA CTGCTTCGTG AGGGACTGGC ATTCAACGCT TGTGACATAG GCTGCGTGAC GAATGTGACG GCTGATCACC TTGGCCTTCG GGGTGTCCAT TCGGTGGAAG ACCTCGCCGC TGTAAAGTCG GTGGTGGTTG AAGCCGTTCA TGAAGATGGC TGGAGCATCC TCAACGCCGA TGACCCACTG GTCGCCGCCA TGCGCGAGGA AGCCGGTGGC CATATCTGCT ACTTCTCCAC GAAAGCACCC AAGCAGTGGC CGGACTTTTT GAAGGACCAC GTTTTCCACG GCGGCCGAGC CCTGGGATGC GACCTGGCTA CCGGCCTGTT CGACATGATA CTCTACGACA AGGCACAGAA AATGCTGATC TGCCGCGTCG ACGAGATCCC CGCCACCATG AACGGAATGG CTGCCTTTAA TGTCGAGAAT GCGCTGGCAG CGTCAGCGAT GGCCGTATGC AACGGTGTTC CTTTGCCCGT CATCCGTGAG GCGTTGCGTT CATTTGGAAC ATCCTACGAG CAGTCGGCTG GACGTCTTAA CATGGTCGAG CGCGAGGGGG TCCGTATAAT CGTGGATTAT GCCCATAATC CTGGAGGTCT TCGCGCCCTT GGGAGGCTGG TCGGAAAGCT TAGGACCGGG AACAGCAGCT GCATTGGCGT CGTCGGCATC GGCGGTGACA GACGCGACCA GGATATTTAT GAAATGGGCG AGATTGCGGC TAGCGTTTTT GACCGCTTGA TTCTGAAGGA GGACTATGAT TTGCGGGGAA GATCGGCTGG CGAGGTTGTT GCCATTCTTC GTCAAGGCGC GCTCAAAGCG GGCTTCGACG CTTCCAAACT CGAGGTCGTG CTCCGCGAAC ACGAGGCGGT CGAGCGCGCG CTGGGCTCAG CCCTAAATGA CGACTTAATA GTCGTCACCG CTGACGACAT CGCCCGTGTT TGGAAGCGGG TCTCTGCCCC GCCGGCCAAC GAACTCGATC CTCTCCCATA CAAGGGCGCT ATTCCACTCC AATCAGAAAT TCGGGTTTTC TGA
|
Protein sequence | MQEVGVYRGP NLHSRTKMVR IQLDLGKLEQ YPTNLLPGFV DALLKHVPGL REHGCSYGEP GGLVLRMEEG TWLGHVAEHV AIELQNIAGA DVARGKTRSV TNMPGVYNVM FEYENEDLGL LAGRFALELV NFLLPADLQG LAGADMIAPS PLAQFTMAEA LRVLRAAHSE IAFGPTTASI VREAEARSIP WRRLDNSSLV QLGYGKHLKR IRASCSSLTS EIAAEIASDK ELTKRLLIEA GLPAPWGTVV SSAEEAVEAA QSLGLPVVIK PVDGNHGRGV NIGLSSQSEV EWGFEQARAH SSQVLVEQQF VGGDHRILII GGRLVAAAKR VPAHVIGNGR VTIEQLINRE NEDPRRGEGH EAALTRISID ECLIHYIARS GFTLSSVPER GKPVMLLPTA NLSTGGTAID CTEDIHPDNA LIACRAAQII GLDIAGIDFV APDIRRSVLQ TGGGIIEVNA GPGFRMHLYP SQGKRRDVAG AVLDLLYPPG APSRVPVLAV TGTNGKTTTT RMLAHILAAD GQIVGMTSSN GVYIDGRRIM EGDCTGPRSA RVVLGEPTID VAVLETARGG LLREGLAFNA CDIGCVTNVT ADHLGLRGVH SVEDLAAVKS VVVEAVHEDG WSILNADDPL VAAMREEAGG HICYFSTKAP KQWPDFLKDH VFHGGRALGC DLATGLFDMI LYDKAQKMLI CRVDEIPATM NGMAAFNVEN ALAASAMAVC NGVPLPVIRE ALRSFGTSYE QSAGRLNMVE REGVRIIVDY AHNPGGLRAL GRLVGKLRTG NSSCIGVVGI GGDRRDQDIY EMGEIAASVF DRLILKEDYD LRGRSAGEVV AILRQGALKA GFDASKLEVV LREHEAVERA LGSALNDDLI VVTADDIARV WKRVSAPPAN ELDPLPYKGA IPLQSEIRVF
|
| |