Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5519 |
Symbol | |
ID | 5319821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 484148 |
End bp | 485440 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640777273 |
Product | epocide hydrolase domain-containing protein |
Protein accession | YP_001314205 |
Protein GI | 150377610 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCAGA TACAGGCACA GGTGACCCGT CGAACTCTTC TTGCTGCGGC CGCGACCGCA GGCGCGGTGG GGATGGTTCC CCAAGCTTTT GCCGCCACCG AGGCTCCGGT GATCAAGCCG TTCAAGTTCA GGGCTACCGA TGAACAGCTG ATTGATCTCC ATCGCCGGGT GGCAGCCACC CGTTGGCCGG ACGAGGAAAA CGTAGCAGAC GACTCCCAAG GGGTGCGTCT CGAAACGATA AAGAAGCTCG CGAAGCACTG GAAACACCAT GACTGGCGCA ACGTCGAGGC CCGCCTCAAC GCGTTTCCGC AGTTCACGAC GGAGATCGAC GGACTCGACA TTCACTTCAT CCATGTGAAG TCGAAGCATG AAAACGCGCT ACCGATCATC ATCACCCATG GGTGGCCAGG CTCCGTGATC GAGCAGCTGA AGATCATAAA GCCCCTGACC GATCCGACCG CCTATGGCGG CACCGAAGCA GACGCCTTCC ACGTCGTTAT TCCGTCCCTC CCCGGCTACG GCTTTTCCGG CAAGCCTCGG GAGACCGGGT GGAACCCGCC GCGGATCGCG AAGGCCTGGG CCGTGCTCAT GGAGCGTCTG GGATATACGA AGTACGTCGC CCAGGGCGGC GACTGGGGTA ACGCGGTGAC TGAACTCATG GCCGTCCAGC AGCCCCCTGG CCTACTCGGC ATACACACGA ACATGGCCGC CACCGTTCCG GCCGAGATTT CGAAGTCCTT GGCCGCAGGA ACGCCCCCGG AAGGCTTGTC TGCCGACGAA AGGCGGGCGT GGGATCAGCT CGACGATTTC AACAAGAATG GCCTCGGCTA CGCTATCGAG ATGAACAACA GGCCGCAGAC ACTCTACGGC ATCGTGGACT CGCCGATCGG TCTTGCGGCG TGGATGCTCG ACCACGACAT CCGCAGCTAC CGCATGATCG CCAGGTCGAT TGACGGAGAA AAAGAGGGCC TTAGCCCTGA CGACGTCCTC GACAACGTTT CCCTGTACTG GCTGACAAAC ACGGCAATTT CTTCCGCGCG TCTTTATTGG GACAATGCCC ATCATCCGAG CGGCGGCTTC TTCGACCCCC GCGGCATCAA GATCCCCGTC GCTGTCAGTG CGTTTCCGGA CGAAATCTAC CAGGCGCCGC AGAGCTGGGC AAAGAAGGCA TATCCGAAGC TCATCCACTA CAACCGCCTG CCGAAGGGCG GCCACTTCGC AGCCTGGGAG CAGCCTGCGC TTTTCACCTC GGAACTCCGT GCCTCATTCA AGTCGCTTCG CGACCAGATT TGA
|
Protein sequence | MSQIQAQVTR RTLLAAAATA GAVGMVPQAF AATEAPVIKP FKFRATDEQL IDLHRRVAAT RWPDEENVAD DSQGVRLETI KKLAKHWKHH DWRNVEARLN AFPQFTTEID GLDIHFIHVK SKHENALPII ITHGWPGSVI EQLKIIKPLT DPTAYGGTEA DAFHVVIPSL PGYGFSGKPR ETGWNPPRIA KAWAVLMERL GYTKYVAQGG DWGNAVTELM AVQQPPGLLG IHTNMAATVP AEISKSLAAG TPPEGLSADE RRAWDQLDDF NKNGLGYAIE MNNRPQTLYG IVDSPIGLAA WMLDHDIRSY RMIARSIDGE KEGLSPDDVL DNVSLYWLTN TAISSARLYW DNAHHPSGGF FDPRGIKIPV AVSAFPDEIY QAPQSWAKKA YPKLIHYNRL PKGGHFAAWE QPALFTSELR ASFKSLRDQI
|
| |