Gene Rleg_4362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4362 
Symbol 
ID8015921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4490524 
End bp4491882 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content65% 
IMG OID644826938 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_002978141 
Protein GI241207045 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00774436 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGAACG GCTCCGCGCC AAAGCCCGCC ACCGCCCGCA AATCCGCCGG TCTTACGGGC 
AGCGTCCGTA TTCCGGGCGA CAAGTCGATC TCGCATCGTT CCTTCATGTT CGGCGGGCTC
GCCTCCGGCG AAACGCGGAT CACCGGTCTG CTCGAAGGCG AGGACGTGAT CAATACTGGC
CGGGCGATGC AGGCGATGGG CGCCAGGATC CGCAAAGAAG GCGAACAATG GGTGATCGAC
GGCACCGGCA ACGGTGCACT GCTTGCGCCT GACGCGCCGC TCGACTTCGG CAATGCCGGC
ACCGGCGTAC GTCTGACCAT GGGCCTCGTC GGCACCTACG ATTTTCGCTC CACCTTCACA
GGCGACGCCT CGCTTTCGAA ACGGCCGATG GGCCGCGTGC TCAATCCACT GCGCGAAATG
GGCGTGCAGG TCAGCGCCTC CGAGGGCGAC AGGCTGCCGG TGACGCTCAG AGGCCCGGGA
ACGCCGAGCC CGATCCGCTA TCGGGTGCCG ATGGCATCCG CCCAGGTGAA ATCGGCCGTG
CTGCTTGCCG GCCTCAATAC GCCGGGTATC ACCACGGTCA TCGAGCCGGT GATGACGCGC
GACCATACCG AAAAGATGCT GCAGGGTTTT GGTGCTGCCC TGTCCGTCGA GACCGATAGC
GAGGGTGTGC GCACAATCCG GCTCGAAGGG CGCGGCAAGC TGGCGGGCCA GGTGATCGAC
GTTCCGGGCG ACCCCTCCTC CACCGCCTTC CCGCTGGTTG CGGCGCTGCT TGTTCCCGGC
TCCGACATCA CCATCGTCAA CGTGCTGATG AACCCGACCC GCACCGGGCT GATCCTGACG
CTGCAGGAGA TGGGGGCCGA CATCGAGGTG GCGAATGCGC GTCTTGCCGG CGGCGAGGAC
GTGGCGGATC TGCGCGTGCG CCATTCCGAA CTCAAGGGCG TCACCGTTCC CGAAGAACGT
GCGCCGTCGA TGATCGATGA ATATCCTATC CTCGCCGTCG CTGCCTGTTT CGCCGAGGGC
GCGACGATCA TGAAGGGGCT GGAGGAACTG CGCGTCAAGG AGTCCGACCG TCTTTCCGCC
GTCGCCGATG GCCTCAAACT CAATGGCGTC GACTGCGACG AAGGCGAGGA TTTTCTCATC
GTGCGCGGCC GGCCAGACGG CAAGGGCCTC GGCAATGCTG CCGACGGCAG GGTCAGCACA
CATCTCGACC ATCGCATCGC CATGAGTTTC CTTGTCATGG GGCTTGCCTC GGAGCATCCT
GTCACGATCG ACGACGCGGC GATGATCGCG ACCAGTTTTC CGGAATTCAT GCAGCTGATG
ACCGGCCTTG GAGCGAAGAT CGCGGAGGTG CCGGAATGA
 
Protein sequence
MLNGSAPKPA TARKSAGLTG SVRIPGDKSI SHRSFMFGGL ASGETRITGL LEGEDVINTG 
RAMQAMGARI RKEGEQWVID GTGNGALLAP DAPLDFGNAG TGVRLTMGLV GTYDFRSTFT
GDASLSKRPM GRVLNPLREM GVQVSASEGD RLPVTLRGPG TPSPIRYRVP MASAQVKSAV
LLAGLNTPGI TTVIEPVMTR DHTEKMLQGF GAALSVETDS EGVRTIRLEG RGKLAGQVID
VPGDPSSTAF PLVAALLVPG SDITIVNVLM NPTRTGLILT LQEMGADIEV ANARLAGGED
VADLRVRHSE LKGVTVPEER APSMIDEYPI LAVAACFAEG ATIMKGLEEL RVKESDRLSA
VADGLKLNGV DCDEGEDFLI VRGRPDGKGL GNAADGRVST HLDHRIAMSF LVMGLASEHP
VTIDDAAMIA TSFPEFMQLM TGLGAKIAEV PE