Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5121 |
Symbol | |
ID | 5319423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 75035 |
End bp | 76033 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640776899 |
Product | type II secretion system protein |
Protein accession | YP_001313831 |
Protein GI | 150377236 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4965] Flp pilus assembly protein TadB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0725578 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACCT CTTTGCTTTT TCCTGTTTTC GTATTTCTCC TGGCTTCCAC CAGCATCGGC GGCGTGATGG TGGCGGCGTT CTATCCGCGT GTCTCCAAGG CAAGCGCCTA TCGCCAGAGG CTCGCGCGCA TGTCGGCACC GGCGGAAAAT AAGCGAAGCG AGCCTACGGA GCCTGATGGC CGCGACCGCC GTCGCTCGGT TGAAAAGACG TTGCGCGAGA TAGAGGAAAA GCGCCAGGCG AATGCGCGCA AAGGAAAGAC GACGCTGACG GCCAGACTCC GTCAGTCCGG CCTCCATTGG TCCCCCAAGA CCTATTTCGT GGTCTGTGCC TGCGCCGCAT TGGCCAGCTG GTGCGCGATG CTGCTGCTGG ATACGGGCGC GCTGGTGTCG GCAGGATTCG CCATTTCAGG CGGGCTGCTT CTGCCGCACG TCTATGTGAA CATCAAGCGC AATGCCCGCT TTGCCAAGTT CACTGCCGAA TTCCCGAATG CCGTCGACGT GATCGTCAGG GGGCTCAAGG CAGGTCTGCC GATGCCGGAC TGCCTGAGGG TGATCGCCAC CGAAGCGCAG GAGCCGGTCA AAGGCGAGTT CCTCGCGATC GTCCAGGACC AGACGCTGGG TATCCCCGTG GACGAGGCCG TGAAACGCAT GAGCGAACGC ATGCCGCTGG CCGAAGCGCA TTTCTTTGCC ATCGTCATCG CCATTCAGAG CCGCACCGGC GGCAGCCTTT CGGAAGCGCT GGGGAACCTT TCCAAGGTCC TGCGCGAGCG GAAGAAGATG AAAGCCAAGA TCAAGGCGAT GAGCTCGGAA GCGAAATCCT CCGCAGGCAT CATCGGTGCT TTGCCGTTCC TCGTCGCGGG TGCCGTTTAC TTCGCGAGCC CGGACTACAT GGCACTCCTG TTCGCGACCG TGACGGGCAA GATCGTCCTT GTCGGCTGCG GCCTGTGGAT GGGCATCGGT ATCCTCGTCA TGCGCAAAAT GATCAACTTC GACTTCTGA
|
Protein sequence | MATSLLFPVF VFLLASTSIG GVMVAAFYPR VSKASAYRQR LARMSAPAEN KRSEPTEPDG RDRRRSVEKT LREIEEKRQA NARKGKTTLT ARLRQSGLHW SPKTYFVVCA CAALASWCAM LLLDTGALVS AGFAISGGLL LPHVYVNIKR NARFAKFTAE FPNAVDVIVR GLKAGLPMPD CLRVIATEAQ EPVKGEFLAI VQDQTLGIPV DEAVKRMSER MPLAEAHFFA IVIAIQSRTG GSLSEALGNL SKVLRERKKM KAKIKAMSSE AKSSAGIIGA LPFLVAGAVY FASPDYMALL FATVTGKIVL VGCGLWMGIG ILVMRKMINF DF
|
| |