Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3201 |
Symbol | |
ID | 4075305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 196546 |
End bp | 197571 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 638004710 |
Product | proline racemase |
Protein accession | YP_611437 |
Protein GI | 99078179 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3938] Proline racemase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0659896 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.833754 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAGTT CCAAAACCAT TCATGTCATC TCCGCTCATG CAGAAGGTGA AGTCGGGGAT GTCATTGTTG GTGGCGTGCT GCCACCTCCG GGTGAAACGC TGTGGGAACA GCGCAATTTC ATCGCCAATG ACGAAACCTT GCGCAATTTT GTCCTGAATG AGCCACGCGG CGGCGTGTTT CGCCATGTGA ATCTTTTGGT TCCCCCCAAA CACCCAGAAG CCGTGGCGGC CTGGATCATT ATGGAACCTG AAGATACGCC ACCGATGTCG GGTTCAAATT CCATATGCGT TTCAACGGTT TTGCTCGACG GTGGGCTTGT GCCCATGCAG GAACCGGAAA CCCATATGGT TCTGGAAGCC CCCGGCGGCC TCGTCAGAGT CCGGGCCGAG TGTCACGATG GCAAGGCAAG GCGCATTTTC GTGCAGAACC TACCGAGTTT TGCCGACAAG ATCCAAGTGC CGCTCGACGT CAAAGGGCTT GGGACCATTA CCGTCGACAC AGCCTACGGA GGCGACAGTT TTGTGATCGT TGATGCCGAG GCTCTCGGAT TCGAGATCAC AGAGGTCGAA GCCAAGGACA TCGCCAATCT CGGTGTGCGT ATCACTGACG CAGCCAACGA ACAGCTTGGG TTCTGCCACC CCGAAAATCC GGAATGGGAT CACATTTCCT TCTGCGCGTT CTGCGGCCCG CTTCGTGAAA CTGAAACAGG TTTCACCAGC AAATCTGCCG TCGCCATTCA ACCTGGCAAA GTCGATCGCT CCCCTACCGG CACGGCAGTG TCAGCACGAA TGGCGTTGAT GTCTGCGCGC GGTCAGATGA CGGCGGGCCA GAGCTTTGAA GCGGTTTCTA TCATCGGGTC GAGTTTTACG GGCCAGATCT TGGACGTCAC GCAGGTTGGC ACGAAGCCGG CGATCATTCC AGAAATCAGC GGCCGCGGTT GGGTTACCGG CATACACCAA CATATGCTCG ACCCCGAAGA TCCCTGGCCC GAAGGCTACC GGCTGTCTGA CACCTGGGGA GCCTGA
|
Protein sequence | MRSSKTIHVI SAHAEGEVGD VIVGGVLPPP GETLWEQRNF IANDETLRNF VLNEPRGGVF RHVNLLVPPK HPEAVAAWII MEPEDTPPMS GSNSICVSTV LLDGGLVPMQ EPETHMVLEA PGGLVRVRAE CHDGKARRIF VQNLPSFADK IQVPLDVKGL GTITVDTAYG GDSFVIVDAE ALGFEITEVE AKDIANLGVR ITDAANEQLG FCHPENPEWD HISFCAFCGP LRETETGFTS KSAVAIQPGK VDRSPTGTAV SARMALMSAR GQMTAGQSFE AVSIIGSSFT GQILDVTQVG TKPAIIPEIS GRGWVTGIHQ HMLDPEDPWP EGYRLSDTWG A
|
| |