Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0457 |
Symbol | |
ID | 5321291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 493538 |
End bp | 495361 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640789392 |
Product | TPR repeat-containing protein |
Protein accession | YP_001326149 |
Protein GI | 150395682 |
COG category | [R] General function prediction only |
COG ID | [COG4785] Lipoprotein NlpI, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.559359 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCAGA ATACATTCCT TCGCCTCCTC AGCGGCGCGG CAATGCTGGT TCTTGCGACG GTGGCCGGCA ACCTGCAGGC CTTTGCCGAG GAGAAGGCGC CGGTCGAAGA CGCCGAACCT TTCGACATCA AGAGCGTCAA CACCTTTGCC GGTGCCTTCC TTGCGGCGCG CACGGCCGAC GTGGACCGCG ACTTCGCGGC GGCAACCGAC CTATATCGCA CTGCACTGAG GTTCGAGCCG GGCAACACCG AGGTGAAGCA GCGGCTGATG ATCACGCTGT TGATGAGCGG CAGGTTCGAT GAAGGCGCCA AGATCGCCGA AGAGCTGAAA TCGGATCCTG CCGTTGAAAG GATCACCACG GTGGTCCGAG CGATCGAAGC GATCCGTAAG CGCGAATACC GCAATGCGCA AAAGCTCTTG AAGTACGAGG GGCCGAACGA CCTCGATCGG CTGATGAGCT CGCTGCTTTC CGCTTGGGCG AAGTTCGGTC AGGGCCGACC GAAAGAGGCC CTGGCGCAGA TCAAAAATCT CCAGGGTCCG GAATGGTTCC GGATCTTCAA GAACTATCAT GCCGGAGCGA TCGCGCTTGC GGCCGGCGAC AAGGCAACCG CGCGGACCCG GCTGAACGAC GCCGTTCTCG ACCGCGAAGG CGGCGGCGCG GCACCGGATA CCTTCATGCG GGCGGTGGAG GCGCTGGCGC GGTTCGAGGC ACGCGAAGGC AACAAGCAGA AGGCGTTGGA TACGATCGCC GTCGGCGAAA ATCTCGTAAA CAACTACACC CCGCTCGAGG CTCTCAGAAA GAGTGTCGAA GAAGGAAAGA CGCAGGAACA GCAGGTTCGC AACGCCGTGC AGGGTGCCGC CGCCGTGCTC TTTTCCATCG GTGGTGCGCT CAACCGGGAG GGGGCGGAAG ACATCGTTTC GCTTTACCTC CAAACGGCGC GGCGGCTTGA CCCGGAAAGC GCCGATATTC TGGTGATGCT CGGCGGCATC GCCGAAAATC TGAAGAAGCC GGACGAGGCG ATCGAGCTTT ACAAGAGCGT GCCGGAAAAC TCGCCGATGC GTCGCCTTTC CGAGCTGCAG CTTGGCCTGA GTCTCGCGGG CATCGGAAAG GTCGAAGAGG CGAAGAAGCA CCTGAAAGGG CTGATCGACG TCGATCCCAA GAATATCCGC AATTATCTCG CCTATGGCAG CGTGCTTTCC GACGCCAAGG ACTACAAGGC CATGGGCGAG CTTTACGATC GGGCGGTCGA GGCGATCGGT CCCGTTCCAA AGCGCAGCGA CTGGACCATC TTCTTCCAGC GCGGCATCGC CTATGAGCGG CAGAAGCTTT GGGAGAAGGC CGAGCCGAGC TTCCTCAAGG CGCTGGAGCT CAACCCGGAT CAGCCCCAGG TGCTCAACTA TCTCGGCTAC TCCTGGGTCG ATATGAACGT AAAGCTCGAA GAGGGCCTCG ACATGATCCG CAAGGCGGTC GAACTCAAGC CCGACGACGG CTATATCGTC GACTCGCTCG GCTGGGCGTA TTTCCGCATG GGCCGTTTCG ATGAGGCCGT GGCCGAACTC GAGCGCGCGG CCGAGCTGAT GGCCGGCGAC GCGACGATCA ACGATCACCT TGGCGACGCC TACTGGCGCG TCGGCCGGAA ACTCGAAGCG GTGTTCCAGT GGAATCAGGC GCTGGAGCTG AAGCCTGAGG AGGCTGAAAT TCCCAAGATA AAGGCGAAGA TCGAAAATGG TCTGCCGCCG TTGAAAGAGT CGGTTCCGGC CGCGGCGGAT GCCAAGGAAA AGCTTCCGAA AAAAACGGAG CCGGCGCCGG ACAAGAAGTC CTGA
|
Protein sequence | MRQNTFLRLL SGAAMLVLAT VAGNLQAFAE EKAPVEDAEP FDIKSVNTFA GAFLAARTAD VDRDFAAATD LYRTALRFEP GNTEVKQRLM ITLLMSGRFD EGAKIAEELK SDPAVERITT VVRAIEAIRK REYRNAQKLL KYEGPNDLDR LMSSLLSAWA KFGQGRPKEA LAQIKNLQGP EWFRIFKNYH AGAIALAAGD KATARTRLND AVLDREGGGA APDTFMRAVE ALARFEAREG NKQKALDTIA VGENLVNNYT PLEALRKSVE EGKTQEQQVR NAVQGAAAVL FSIGGALNRE GAEDIVSLYL QTARRLDPES ADILVMLGGI AENLKKPDEA IELYKSVPEN SPMRRLSELQ LGLSLAGIGK VEEAKKHLKG LIDVDPKNIR NYLAYGSVLS DAKDYKAMGE LYDRAVEAIG PVPKRSDWTI FFQRGIAYER QKLWEKAEPS FLKALELNPD QPQVLNYLGY SWVDMNVKLE EGLDMIRKAV ELKPDDGYIV DSLGWAYFRM GRFDEAVAEL ERAAELMAGD ATINDHLGDA YWRVGRKLEA VFQWNQALEL KPEEAEIPKI KAKIENGLPP LKESVPAAAD AKEKLPKKTE PAPDKKS
|
| |