Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3863 |
Symbol | |
ID | 5318300 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 320381 |
End bp | 321382 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640775675 |
Product | hydroxyproline-2-epimerase |
Protein accession | YP_001312608 |
Protein GI | 150376012 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3938] Proline racemase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.365358 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACCC ACACCTTCTC CTGCATAGAC GGCCACACCT GCGGAAATCC CGTCCGCCTC GTCTCTGGCG GGGGGCCGCG CCTGGAAGGC GCGAACATGC TGGAAAAGCG CGCGCATTTC CTCAAGGAAT TCGACTGGAT CCGCACCGGC CTCATGTTTG AGCCGCGCGG TCACGACATG ATGTCGGGCT CCATTCTCTA TCCGCCGACG CGGCCCGATT GCGACGTCGC GGTGCTCTTC ATCGAGACCT CCGGCTGCCT GCCCATGTGC GGCCATGGTA CCATCGGCAC CATCACCATG GGGATCGAGA ACGGCCTGAT CACACCGCGT GAGCCCGGCA GGCTCTCCAT CGATGCCCCC GCCGGCAAGG TGGACATCAC CTACCGCCAG GAAGGCCGCT TCGTCGAGGA AGTGCGCCTC ACCAATGTTC CCGGCTTCCT TTATGCCGAA GGGCTGACGG CCGAGGTCGA GGGCCTGGGC GAAATCGTGG TGGACGTTGC CTATGGCGGC AATTTCTACG CCATCGTCGA GCCGCAGAAA AACTTCCGCG ACATGGCCGA TCATACGGCG GGGGAACTGG TGGGCTGGAG CCCGAAGCTG CGTGCCGCGC TGAACGAAAA ATACGAGTTC GTCCATCCCG AGCACCCGGA GATCCGGGGC TTGAGCCATA TCCAGTGGAC CGGAAAGCCG ACGCAATCCG AGGCACACGC CCGCAACGCG GTGTTCTACG GCGAGAAGGC CATCGATCGC TCGCCCTGCG GAACGGGCAC CTCGGCCCGC ATGGCGCAGC TCGCCGCCAA GGGAAAGCTG AAGGTGGGCG ACGAATTCGT TCACGAGTCG ATCATCGGAT CGCTCTTCAA GGGACGCGTC GAGGCGGCCG CGAAGGTCGC GGATCGCGAT GCGATCATCC CATCGATTGC CGGCTGGGCA AGGATGACCG GCATCAACAC CATTTTTATC GATGACCGCG ACCCCTTCGC CCATGGCTTC GTCGTAAAAT GA
|
Protein sequence | MATHTFSCID GHTCGNPVRL VSGGGPRLEG ANMLEKRAHF LKEFDWIRTG LMFEPRGHDM MSGSILYPPT RPDCDVAVLF IETSGCLPMC GHGTIGTITM GIENGLITPR EPGRLSIDAP AGKVDITYRQ EGRFVEEVRL TNVPGFLYAE GLTAEVEGLG EIVVDVAYGG NFYAIVEPQK NFRDMADHTA GELVGWSPKL RAALNEKYEF VHPEHPEIRG LSHIQWTGKP TQSEAHARNA VFYGEKAIDR SPCGTGTSAR MAQLAAKGKL KVGDEFVHES IIGSLFKGRV EAAAKVADRD AIIPSIAGWA RMTGINTIFI DDRDPFAHGF VVK
|
| |