Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0627 |
Symbol | |
ID | 8011808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 661784 |
End bp | 662881 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644823217 |
Product | chorismate synthase |
Protein accession | YP_002974470 |
Protein GI | 241203374 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0254313 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCACA ATACATTCGG TCACCTCTTC CGCGTAACCA CCTGGGGCGA AAGCCATGGT CCGGCGCTCG GCTGCGTCGT CGACGGCTGC CCTCCGGGGC TCAGCTTCAA GCTCAAGGAC CTGCAGGTCT GGCTCGACAA GCGCAAGCCC GGCCAATCCC GCTTCGTGAC GCAGCGGCGC GAGGACGATC TGGTGAAGAT ACTATCAGGC GTCATGCTCG ACGCCGACGG CGAGACGATG ACGACCACCG GCACGCCGAT CTCGATGCTG ATCGAAAATA CCGACCAGCG CTCCAAGGAT TACGGCGAGA TCGCGAAACA ATACCGCCCC GGCCATGCCG ATTATACCTA CGACCTCAAA TACGGCATTC GCGACTATCG CGGCGGCGGC CGCTCCTCGG CGCGCGAGAC CGCTGCGCGG GTTGCCGCCG GCGGCATCGC CCGCCTTGTC GTACCAGGCG TTACCGTGCG CGGCGCGCTG GTGCAGATCG GCAAGTACAA GATCAACCGC CGCAACTGGG ACTGGGACCA GGTCGATCAG AACCCGTTCT TCTCGCCCGA TGCGGCGATC GTGCCGGTCT GGGAGGAATA TCTCGACGGC GTCCGCAAGA AGGGCTCCTC GATCGGCGCG GTCATCGAGG TGATTGCCGA AGGTGTGCCG GCCGGCCTCG GCGCGCCGAT CTATGCCAAG CTCGATCAGG ACATCGCCTC GCTGCTGATG TCGATCAACG CCGTCAAGGG CGTCGAGATC GGCAATGGTT TTGCCGCCGC CGAAACGTCA GGCGAAGACA ATGCCGACGA GATGCGCATG GGCAATGACG GCACGCCGAT CTTCCTTTCC AACAATGCCG GCGGTATTCT CGGCGGCATC TCCACCGGGC AGCCGGTGGT GGCGCGTTTC GCCGTCAAGC CGACCTCCTC GATCCTGACC GAACGCCAGT CGATCGATGC GGAGGGCAAG AATGTCGATA TCCGCACCAA GGGCCGGCAC GATCCCTGCG TTGGCATCCG CGCCGTGCCG ATCGGCGAGG CGATGGTCGC CTGCGCCGTC GCCGACCATT ATCTTCGCGA CCGCGGCCAG ACCGGCCGTC TGAAATAG
|
Protein sequence | MSHNTFGHLF RVTTWGESHG PALGCVVDGC PPGLSFKLKD LQVWLDKRKP GQSRFVTQRR EDDLVKILSG VMLDADGETM TTTGTPISML IENTDQRSKD YGEIAKQYRP GHADYTYDLK YGIRDYRGGG RSSARETAAR VAAGGIARLV VPGVTVRGAL VQIGKYKINR RNWDWDQVDQ NPFFSPDAAI VPVWEEYLDG VRKKGSSIGA VIEVIAEGVP AGLGAPIYAK LDQDIASLLM SINAVKGVEI GNGFAAAETS GEDNADEMRM GNDGTPIFLS NNAGGILGGI STGQPVVARF AVKPTSSILT ERQSIDAEGK NVDIRTKGRH DPCVGIRAVP IGEAMVACAV ADHYLRDRGQ TGRLK
|
| |