Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4452 |
Symbol | |
ID | 5318604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 936403 |
End bp | 937482 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640776254 |
Product | hypothetical protein |
Protein accession | YP_001313187 |
Protein GI | 150376591 |
COG category | [R] General function prediction only |
COG ID | [COG1073] Hydrolases of the alpha/beta superfamily |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0556925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGC ACGTTGACAC GGACGACGTT TGCAACACCA GCAGGCGCAA CCTGATGAAG GGAACAGGCC TCGCCGTCGC GGCGATAAGC ATGATCCCGG CTGTGACGGC AACAGGGGCG TTCGCGCAGA GCCCTGCCTG GGACAAGGTG TTTCCGAAAA GCGAGAACGT CGATCACCAG AAGGTTTCGT TCAAGAACCG CTACGGCATC ACGATCGCCG GCGACCTCTA CCTGCCGAAG AACCGCGGCA GTCAACCCTT AGCAGCTCTT GCGGTCGCCG GCCCCTTTGG TGCCGTGAAG GAACAATCTT CGGGATTGTA CGCTCAGACC ATGGCCGAAC GCGGCTTCGC GGCGCTGGCC TTCGACCCTT CCTTTACCGG TGAAAGTGGT GGCGAGCCAC GCAACGTCGC TTCGCCGGAC ATCAACACGG AAGACTTCAG TGCCGCGGTC GATTACCTGG GACTGCAGCC CACCATCGAC CGCGAGCGGA TCGGCGTGAT CGGCATTTGC GGCTGGGGTG GCATGGCCCT GAACGCCGTC GCCGCTGACA AGCGCGTCAA GGCGGTCGTG GCCAGCACCA TGTACGACAT GACCCGTCTG ATGTCCAAAG GCTACAACGA CAGTGTCACG CAGGAGCAGC GGACGCAGAC GCTGGAGCAG TTGAGCCGCC AGCGCTGGGC GGACGCGGAG AAGAACGGGC CAGCCTATCA GCCGCCCTAC AATGTACTCA AGGGAGGCGA GGCTCAGTTC CTGGTCGACT ATCACGATTA CTACATGACG CCCCGCGGCT ACCATCCGCG CGCTGTCAAC TCCGGCAATG CCTGGACGCA GACCACGCCC CTGTCGTTCA TGAACATGCC GATCCTGACC TACATCGCCG AGATTTCCCC GCGCCCGCTT CTGCTCATCC ACGGCGAGAA CGCCCATTCG CGATACTTCA GCGAAACAGC CTTTGCCGCC GCAGCGGAGC CAAAGGAGCT GATGATCATC CCGAACGCCA ACCATACCGA TCTCTACGAC CGCATGGACA AGATCCCGTT CGACCGGATC GCCAAGTTCT TCGGGCAGCA TCTGGCCTAG
|
Protein sequence | MSEHVDTDDV CNTSRRNLMK GTGLAVAAIS MIPAVTATGA FAQSPAWDKV FPKSENVDHQ KVSFKNRYGI TIAGDLYLPK NRGSQPLAAL AVAGPFGAVK EQSSGLYAQT MAERGFAALA FDPSFTGESG GEPRNVASPD INTEDFSAAV DYLGLQPTID RERIGVIGIC GWGGMALNAV AADKRVKAVV ASTMYDMTRL MSKGYNDSVT QEQRTQTLEQ LSRQRWADAE KNGPAYQPPY NVLKGGEAQF LVDYHDYYMT PRGYHPRAVN SGNAWTQTTP LSFMNMPILT YIAEISPRPL LLIHGENAHS RYFSETAFAA AAEPKELMII PNANHTDLYD RMDKIPFDRI AKFFGQHLA
|
| |