Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2812 |
Symbol | |
ID | 6981556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2856213 |
End bp | 2858402 |
Gene Length | 2190 bp |
Protein Length | 729 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643397524 |
Product | anthranilate synthase |
Protein accession | YP_002282308 |
Protein GI | 209550391 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I [COG0512] Anthranilate/para-aminobenzoate synthases component II |
TIGRFAM ID | [TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase [TIGR01815] anthranilate synthase, alpha proteobacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0309756 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAACGA TCCTGCGGGA TGATGGTGCG GAAATCTACG AGACCAAGGG CGGGATATCG GTCACGCGGC AGCGGCGTGC GATCCCCTAC GGCGACGCGG TCTCTTCCTA TATCGACAAG CTCGATGAGC GCCGCGGCGC GGTGTTTTCG TCGAACTACG AATATCCGGG ACGTTATACG CGCTGGGATA CCGCCGTCGT CGATCCGCCG CTCGGCATCT CCTCCTTCGG GCGCAACGTC TGGATCGAAG CCTATAACGA ACGCGGCGAG GTACTTCTCG GCTTCGTGAC CGAGCGGCTG AAAACGGTGT CCGATCTCGT GCTCGGCGCT TCCTCCGCCC GCCGTCTCGA CCTCTCGGTC AAAGCGCCGG ATCGGGTGTT CACCGAGGAA GAGCGCTCGA AGATGCCGAC GGTCTTTACC GTGCTGCGTG CCGTCACCGA CCTCTTCTAT TCGCAGGCGG ATGCGAGCCT TGGGCTTTAC GGCGCCTTCG GCTATGACAT CGCCTTCCAG TTCGATGCAA TCGATCTGAA GCTGACCCGG CCCTCCGACC AGCGCGATAT GGTGCTCTAC CTGCCGGACG AAATCCTCGT CGTCGACAAC TATGCCGCCA AGGCCTGGAT CGACCGTTAC GATTTCGAAA AGGGCGGCGT GACGACCGAG GGCAAGGCGC AAGACATCGC GCCGGAGCCT TTCAAGCAAA CGGATGCCAT TCCCCCGAAG AGCGATCACC GGCCGGGCGA ATATGCCGAT CTCGTCGTCA AGGCGAAGGA AAGTTTCCGC AAGGGCGATC TCTTCGAAGT CGTGCCCGGG CAGAAATTCA TGGAGCGCTG CGAAAGCAAG CCTTCCGACA TTTCCAAGCG GCTGAAGGCG ATCAATCCGT CTCCCTATTC CTTCTTCATC AATCTCGGCC ATCAGGAATA TCTGGTCGGT GCCTCGCCCG AGATGTTCGT GCGCGTGTCC GGCCGCCGGA TCGAGACCTG CCCGATCTCG GGCACGATCA AGCGCGGTGA CGACCCGATC GCCGACAGCG AGCAGATCCT GAAGCTCTTG AATTCCAAGA AGGACGAATC CGAGCTGACC ATGTGCTCCG ACGTCGACCG CAACGACAAG AGCCGCGTCT GCGAGCCTGG CTCGGTCAAG GTGATCGGCC GCCGGCAGAT CGAGATGTAT TCGCGCCTCA TCCACACGGT CGACCATATC GAGGGCCGGC TACGCGACGA TATGGACGCC TTCGACGGGT TCTTGAGCCA TGCCTGGGCC GTGACCGTCA CCGGCGCTCC GAAGCTCTGG GCGATGCGCT TTATCGAGAG CCGTGAAAAG AGCCCGCGCG CATGGTATGG TGGAGCGATC GGCATGGTCG GCTTCAACGG CGACATGAAC ACCGGCCTGA CGCTGCGCAC GGTGCGCATC AAGGACGGTA TCGCCGAGGT ACGCGCCGGG GCGACGCTGC TCAACGATTC CATTCCTGAA GAAGAAGAAG CCGAAACAGA ATTGAAGGCC TCTGCCATGC TTTCCGCCAT CCGCGACGCC AAGACCGGCA ATTCCGGCAA GACCCAGCGC GATGTCGCAA GCGTCGGCAA GGGCGTTAAC ATCCTGCTCG TCGACCATGA GGACAGCTTC GTCCACACGC TTGCCAATTA TTTCCGCCAG ACGGGGGCGA GCGTTTCGAC CGTGCGCACG CCGGTGCCGG AGGAAATCTT CGACCGGCTG AACCCGGACC TCGTCGTGCT GTCGCCCGGG CCTGGAACGC CCAAGGATTT CGACTGCAAG GCGACGATTA AGAAAGCGCG GGCGCGCAAC CTGCCGATCT TCGGCGTCTG CCTCGGCCTG CAGGCGCTCG CCGAGGCCTA TGGCGGGGAA CTGCGCCATC TGGCGCTGCC GATGCACGGC AAACCCTCGC GCATCCGCGT GCTGGAGCCG GGCATCGTCT TCTCCGGCCT GTCGAAGGAG GTGACGGTCG GCCGCTACCA CTCGATCTTC GCCGATCCCT CGACGCTGCC GCGCGACTTC ATCATCACCG CGGAAAGCGA AGACGGCACG ATCATGGGCA TCGAGCACGC CAAGGAGCCG ATCGCCGCGG TGCAGTTCCA CCCGGAATCG ATCATGACGC TCGGCGGCGA TGCTGGCATG CGGATGATCG AAAATGTCGT GGCGCATCTG GCCCGCAAGG CGAAGACCAA GGCTGCCTGA
|
Protein sequence | MVTILRDDGA EIYETKGGIS VTRQRRAIPY GDAVSSYIDK LDERRGAVFS SNYEYPGRYT RWDTAVVDPP LGISSFGRNV WIEAYNERGE VLLGFVTERL KTVSDLVLGA SSARRLDLSV KAPDRVFTEE ERSKMPTVFT VLRAVTDLFY SQADASLGLY GAFGYDIAFQ FDAIDLKLTR PSDQRDMVLY LPDEILVVDN YAAKAWIDRY DFEKGGVTTE GKAQDIAPEP FKQTDAIPPK SDHRPGEYAD LVVKAKESFR KGDLFEVVPG QKFMERCESK PSDISKRLKA INPSPYSFFI NLGHQEYLVG ASPEMFVRVS GRRIETCPIS GTIKRGDDPI ADSEQILKLL NSKKDESELT MCSDVDRNDK SRVCEPGSVK VIGRRQIEMY SRLIHTVDHI EGRLRDDMDA FDGFLSHAWA VTVTGAPKLW AMRFIESREK SPRAWYGGAI GMVGFNGDMN TGLTLRTVRI KDGIAEVRAG ATLLNDSIPE EEEAETELKA SAMLSAIRDA KTGNSGKTQR DVASVGKGVN ILLVDHEDSF VHTLANYFRQ TGASVSTVRT PVPEEIFDRL NPDLVVLSPG PGTPKDFDCK ATIKKARARN LPIFGVCLGL QALAEAYGGE LRHLALPMHG KPSRIRVLEP GIVFSGLSKE VTVGRYHSIF ADPSTLPRDF IITAESEDGT IMGIEHAKEP IAAVQFHPES IMTLGGDAGM RMIENVVAHL ARKAKTKAA
|
| |