Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5549 |
Symbol | |
ID | 8016440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012853 |
Strand | - |
Start bp | 136330 |
End bp | 137868 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644827716 |
Product | type II and III secretion system protein |
Protein accession | YP_002978916 |
Protein GI | 241518288 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4964] Flp pilus assembly protein, secretin CpaC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.00090409 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGATCAGCG CGATAGTATG CAACGCGATG GATGCGGCAT TGAACATTAG GGTTCATCCG CTCCGCAAGA CGGATTTAGG GAAGTCCTCG GTCCATGCCG GCTGGATTGC GCATACAGTT CTCCTGCTCA CGCTGCTGTC GTCGGCCACA TATTGCCGTG CCGAGAATGT CATCAGCGTT ACAGAGACCG GTGTGAATGT CACGCGGCAT GTCAGCATCG GTCTAAATAA GACGCTCATC GTTGAACTTC CTCAGGACGC TCATGACATT GTTGTTTCAG ATCCAGGCGT GTCTGACGCG ATAGTGCGAG CGACACGCAC CATCTTTCTC TTTGGCAAAA AGGTCGGGCA AACGAATATC TTCATCCTCG ATGCAAACAA GCGACCAATC GTCAATATCG ATATCGCGGT TGAGAGAGAC ATTGCCGGGC TGGAGACGGA TTTGCGTCGT TTGATACCGG ATGCGGCCAT CAAGGTTGAG ATTATTTCCG ATAATATTGT CCTCACCGGC ACCGTAAGGT CGGCACAGGA CTCGGCACAG GCCGCCGATC TTGCATCCGC TTTCGTCAAA GGCGGGGAGG CGACGACCCG CACCCAAAGT GCGTCGAGCG GGGGTAGTCA AGGATCCGTG GCCCTTGTTG CGGAAGATCG GCAGGAATCC AAAATCATCA ACCTTCTTCG CATCGCAGCC GACGATCAGG TGATGCTCAA AATGACGATA GCGGAGGTAA AGCGAGAGAT CCTGAAGCAA CTGGGCTTTG ACAATGAGCT AAAAAACGCT GGTGGCTCAA CAATTGCCCA GCTGGGAACG GCATCGACGG ATGCGACGAC CGCGACCTCC GGCGGCGGCC TATCAGCCCT ATTCAGCGGA TCCTTCGGCA AGCATGGCCT CTCAACGACA TTGAATGCGC TTGAACAAGC GAAAGTGGTT CGAACCTTGG CCGAGCCGAC ATTGACGGCC GTCTCGGGTC AATCGGCATC GTTTCAAGCA GGTGGCGAGG TCCTCTATTC AAACACCGAC CGCGATGGCA ATACGACCCA AACCCCGTAC AGCTACGGCA TCAGCCTCTC ATTCAAGCCC ATCGTCCTAA CGTCCGGTCG GATCAGCCTG CAGATCTCAA CCGAGGTGTC CGAACCGGTT ACCTCTATAT CCGGTTCATC CCCGACCTAC GGCAAGCGTT CGACCAGCAC CACCGTTGAA CTGCCATCGG GTGGTTCGAT CGCTCTGGCG GGGCTAATCC GAGATAACTT CAACAGGACC TCCAATGGGA CGCCGGTCTT GAACAAGATT CCAGGGTTTG GCGCTTTGTT TCGCCAGACA AGCTTCGAGA GAAACGAGAC TGAACTCGTA ATTATCGCCA CACCCTATTT GGTTCGCCCT GTCGCGGCAA AAGATCTGAA TCGGCCTGAT GACAATCTTA GCCCAGCTGA TGATGCCTCT CAGGGGTTAC TCGACCGGAT CAACAAGCTC TACGGCAACG GCAAAACCTT GGAGCCAACA GCGCAATATC ACGGCACCGT CGGCTTCATA TACAAGTGA
|
Protein sequence | MISAIVCNAM DAALNIRVHP LRKTDLGKSS VHAGWIAHTV LLLTLLSSAT YCRAENVISV TETGVNVTRH VSIGLNKTLI VELPQDAHDI VVSDPGVSDA IVRATRTIFL FGKKVGQTNI FILDANKRPI VNIDIAVERD IAGLETDLRR LIPDAAIKVE IISDNIVLTG TVRSAQDSAQ AADLASAFVK GGEATTRTQS ASSGGSQGSV ALVAEDRQES KIINLLRIAA DDQVMLKMTI AEVKREILKQ LGFDNELKNA GGSTIAQLGT ASTDATTATS GGGLSALFSG SFGKHGLSTT LNALEQAKVV RTLAEPTLTA VSGQSASFQA GGEVLYSNTD RDGNTTQTPY SYGISLSFKP IVLTSGRISL QISTEVSEPV TSISGSSPTY GKRSTSTTVE LPSGGSIALA GLIRDNFNRT SNGTPVLNKI PGFGALFRQT SFERNETELV IIATPYLVRP VAAKDLNRPD DNLSPADDAS QGLLDRINKL YGNGKTLEPT AQYHGTVGFI YK
|
| |