Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1942 |
Symbol | |
ID | 5322801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 1994093 |
End bp | 1995973 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640790880 |
Product | TPR repeat-containing protein |
Protein accession | YP_001327611 |
Protein GI | 150397144 |
COG category | [S] Function unknown |
COG ID | [COG5616] Predicted integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.420919 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGATG AGAGAATACT CCCGAATGCG GCCCTCGCAA AGATTGCAGG CGAATCGGTC GAGAACCTGT CAACTCATCG GATTTCGGTC ACACATGGCC AAGAGATGGA CGGCATGAGA CGGCTGGCGG CGATCTTGGA CGCGGACGTC GTCGGCTATA GCCGGCTGAT GGGACTGGAC GAAGCCGGCA CTTATCGGGC GGTCAAGCAT TGCCACAACG CCTTCATTCT GCCGTTGGTC GAGGCCCATA ACGGCAGGAT CGTCAAGCAG GCGGGCGATG GAATGCTCGC AGAGTTCGCA AGCGTACTCG ACGCCGTCGC TTGCGCCATC GCGATCCAAC GAACAATGCA CGATCAGGCC GGGAGCGCGG AAACTGAGCG TCTGGAATTG CGCATCGGCG TCCACCTCGG CGATATCGTC GCGGACGACG GCGATATTCA CGGCGAAGGC ATTGCCGTTG CCGGGCACCT TCAGGAAATG GCGCCGCCCG GCGGCATCTG CGTGTCGCAG CAAGTCTATG ACCAGGTTTC CTCGAAACTG GATATCCAGA TGGGAGACCT CGGCTGCAAG ACGTTTGCCG ATATTCCCGG CCCGCTGCAC GTCTGGTGCT GGCAGCCGGG CGCAACACGG GAAGAATCGC CCGCACCAAA GCAGAACCGG CCGCGTCCTG ACATGAAGCG GCCGTCGATC GCCGTCCTGC CATTCGTCAA CTTGTCGAGC GTCGACGAAC AGGAGCATTT CTCCGACGGC TTCACGGAGG AGTTGATTTC CACCCTGGCC CGATGCCGTT GGCTGCGCGT CGTCGCGCGC AACTCCTCCT TCACTTTCAA GGGGGTAACT GTCGACGTGA GAAAGGTTGC GTCCGACTTG GGCGTTAAAT ACGTGATCGA GGGCAGCATA CGCCGCGCGG CAAACCGGAT CCGCATCACG GCGCAATTGC TGAGCGGCGA AACCGGCATG CTGCTCTGGG CAGAGCGCTA CGACCGCATG CTGGACGACG TTTTCGTGCT GCAGGATGAG ATCGCGGGAC AGATCACCGG TACTGTGGAA CCCGAACTCG GCTTCATCGA ATTCGCAGCG CTGCGCGGCC AAAGCGCGAC GGACATGGAT GCCTGGAATA TCTATCTCAA GGGGTTATGG CACCTCTACA AGTTTGATCT TGAGAATCTG AGGATTTCCA AAGAGCTGTT CGAGCGAGCG ATCGACCTCG AACCCGCTTT TGCCCAGGCC TATGCGCGCC TTGCTTATGT CCATATACAG CTCGGCTGGT ATGGCCCTCT TGAGGAGCGG GGCGATCGGA TCGCCGACGC GACGGCGCTG GCTGAACGTG CGACCGCGCT CGACGACCGT GAGCCGGCAG CGCATCTGGC ACTCGGCCGG GCACGGGCAC TCGGCGGCCA GCCGGAGCGT GGGATCGAAC ACCTGCGCAA CGCACTGAGG CTTGTCCCAA GCTTCGCCCA GGGCCACTTT GCTCTCGGGC AAGCGCTTTG TTATGTGGGC CGCCCCGAAG AGGGCATCAC CGCGATCAAC GAGGCGTTCC GGCTGAGCCC TCGAGATCCG CATCTGTGGA CGTTTCACAA CATGGTCGCC ATCGCCCAAT ACCAGGCGGG TCGCTTCGCG CAAGCCGCCG AGGCGGCCCG CGCCTCTCTA CTCAAGGAAA ATGCCACGTT CTGGCCCGCA ATGGTGCTGG CAGCGTCCCT CGGCGCCCAG GAGCGGAAGG GCGAGGCCCG CGCGGCGGTG GCGGAGCTTT TGCGCCGGCG GCCGGACATG ACCGCGAAAA CGGCCCGCGC CGAATTCTAC TTCGGCAGCG TGCCGGCCAT GTCCGAGAAA TTCATCGACC GCTTCGTCAG CGATCTGCAC CGCGCCGGCG TGCCTGATTG A
|
Protein sequence | MLDERILPNA ALAKIAGESV ENLSTHRISV THGQEMDGMR RLAAILDADV VGYSRLMGLD EAGTYRAVKH CHNAFILPLV EAHNGRIVKQ AGDGMLAEFA SVLDAVACAI AIQRTMHDQA GSAETERLEL RIGVHLGDIV ADDGDIHGEG IAVAGHLQEM APPGGICVSQ QVYDQVSSKL DIQMGDLGCK TFADIPGPLH VWCWQPGATR EESPAPKQNR PRPDMKRPSI AVLPFVNLSS VDEQEHFSDG FTEELISTLA RCRWLRVVAR NSSFTFKGVT VDVRKVASDL GVKYVIEGSI RRAANRIRIT AQLLSGETGM LLWAERYDRM LDDVFVLQDE IAGQITGTVE PELGFIEFAA LRGQSATDMD AWNIYLKGLW HLYKFDLENL RISKELFERA IDLEPAFAQA YARLAYVHIQ LGWYGPLEER GDRIADATAL AERATALDDR EPAAHLALGR ARALGGQPER GIEHLRNALR LVPSFAQGHF ALGQALCYVG RPEEGITAIN EAFRLSPRDP HLWTFHNMVA IAQYQAGRFA QAAEAARASL LKENATFWPA MVLAASLGAQ ERKGEARAAV AELLRRRPDM TAKTARAEFY FGSVPAMSEK FIDRFVSDLH RAGVPD
|
| |