Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4942 |
Symbol | thyX |
ID | 6978036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 583001 |
End bp | 583951 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643394095 |
Product | FAD-dependent thymidylate synthase |
Protein accession | YP_002278913 |
Protein GI | 209546995 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1351] Predicted alternative thymidylate synthase |
TIGRFAM ID | [TIGR02170] thymidylate synthase, flavin-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.497472 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATTGGAT TGACGACCGA GCAAGCTGCC GAAATCGAAT CTATCAGGGC TACCCCGTCA AGCACCCTGC GGCCGGTGTC GCCCGGCCTC GAAGAGATAC TCCACAAGTC GTTCCCGGTT CTGGACCACG GCATTATCAG GGTGGTCGAC TACATGGGCG ACGACAGCGC GATCGTCCAG GCAGCGCGGG TATCCTACGG CCGGGGTACT AAGCGGATCC AAGAGGACAG CGGACTTATC AAGTATCTCC TTCGCCATCT CCACACGACT CCATTCGAAA TGGCAGAGAT CAAGTTCCAT GTGAAGCTGC CGATCTTCGT CGCGAGGCAA TGGATCCGTC ACCGCATGGC GAGCGTGAAC GAATATTCGG CCAGGTATTC CGTACTCGAC AACGAATTCT ACATTCCGCA GCCCGAACAC CTGGCCGCGC AGTCGACGTT GAACCGTCAG GGACGCGGGG CGGTGCTGGA GGGTGACGAG GCGGCGGAAG TCATCTCACT TCTGCGCCGC GACGCCGAGC AGGCATACGA CGACTACTCG ACGATGCTGA ACGATCCCGG CAGCCCCGAT CATCGCGACG ACCGGTCCGG GCTCGCGCGC GAACTGGCCC GCATGAATCT TTCACTCAAC TTCTATACCC AATGGTACTG GAAGACCGAC CTGCACAATT TGATGAACTT CCTGCGGCTG AGGGCCGATT CGCACGCCCA ATTTGAAATA CGCGCCTATG CGGAGATCAT GCTGGAAATC ATGGCGGCCT GGGTTCCAAT CGCCCATTCG GCGTTCAAGG AATACAGGAT GGAAGCAGTT CAATTGTCGG GTACGGCCGT AAAGGCGGTC CGCCGGATGA TCGCCGGAGA AGCGGTCAGT CAGGAGAACA GTGGGCTGAG TGCACGCGAG TGGCAGGAAC TCAGTGCCGC CCTCGACCTC CCTTCGCTAA AGGCATCCTA A
|
Protein sequence | MIGLTTEQAA EIESIRATPS STLRPVSPGL EEILHKSFPV LDHGIIRVVD YMGDDSAIVQ AARVSYGRGT KRIQEDSGLI KYLLRHLHTT PFEMAEIKFH VKLPIFVARQ WIRHRMASVN EYSARYSVLD NEFYIPQPEH LAAQSTLNRQ GRGAVLEGDE AAEVISLLRR DAEQAYDDYS TMLNDPGSPD HRDDRSGLAR ELARMNLSLN FYTQWYWKTD LHNLMNFLRL RADSHAQFEI RAYAEIMLEI MAAWVPIAHS AFKEYRMEAV QLSGTAVKAV RRMIAGEAVS QENSGLSARE WQELSAALDL PSLKAS
|
| |