Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0480 |
Symbol | |
ID | 8011675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 500049 |
End bp | 503138 |
Gene Length | 3090 bp |
Protein Length | 1029 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644823072 |
Product | outer membrane autotransporter barrel domain protein |
Protein accession | YP_002974325 |
Protein GI | 241203229 |
COG category | [S] Function unknown |
COG ID | [COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain [TIGR01451] conserved repeat domain [TIGR02601] autotransporter-associated beta strand repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.876293 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGAAA GGATATCTGG AGCGGCAATC CAAGGGATGG TCATGCTGCG ACTTACGGGA TTGGCGAGCA CTGCGGCGCT GGTGCTGGCT GTAGGGCCGG GATGGGCGCA GGTCATCACC GGCAACGACA CGGAGATCGT CGATGGCAAC GACCCGGGCG GTACCGGCGC GGGCACGCAA CCGAGTCCAT GGACAATCAA TACCAACCTC ATCGTGGGCG ATCAGAATGG TGACGATGCA GCCCTTGTCA TCCGGAATGG CGGCATAGTC AGCAATGACA TCGGCGTGCT CGGTGTCGAT CCCGGCGCTG CAGGAACGGT AACGGTTACG GGGACGGGCT CGGCCTGGAC CAATTCCGAC GACCTCTACG TCGGCCATCG GGGTGTCGGC GTGCTCAATA TCGAAGATGG CGGCGTTGTC GACAATATAT TCGGCCGTAT CGGCTATTTC TCTGGCGCCA GCGGCACGGT GACGGTCACC GGCACCGGCT CGACCTGGAC CAACGCCCAG GATCTTTATA TCGGCGACAG CGGCACCGGC ACTCTGACCA TTTCGAATGG CGGCACGGTC AGCAGCACTG CCGGGCTTAT CAGCAACGAC ACGACTGCCA TCGGCGAGGT CATCGTCACA GGCACGAACT CGATCTGGAG CAATTCCAGC TATATCTCGG TCGGCGAGGC GGGAGCGGGA ACGCTGACCA TTTCGAATGG CGGCTCGGTC ACTGCGAGTG AAGGTTATGT CGGCTATAGT TCGAACGGCA ACGGCGTGGT GAGCGTGACC GATACGGGGT CGAGCTGGAT CAATTCCGGC GCGCTGTTCG TGGGCGAATT CGGTTCAGGC AGTATGAGCG TCGAGAATGG CGGCACGGTT TCGGCTTCCG AGGTTATCAT CGCCGACGAT TCGGGCGCCA CGGGAACTGT GCGGATCGCC GGAAGCGCCG CAAACGGGCG CGGCGTCCTG GAGACCGGTT ATATCGAGAG AGGCGGTGGT GACGCGGACC TCGTTTTCGA TGGCGGTATT CTCAGCGCAA CGGGCAACGA GGCGAATTTC CTGCGCGGTT TCAACGCCGG CGAGGTAACG ATTGATGCTG GCGGCGCCTT TATCGATACG AACGGCTTTG CCGTCGGCAT TGCCACGGAT TTGCAAGGCG CCGGTGGCTT GACCAAAAAG GGCAGCGGCA CACTGACGCT TTCGGGCACC AGCAGTTTTA CCGGTGTGAC GACGGTCGAG GCCGGCACGT TGCAGGCGGG CAGCGCCGGT GCCTTCGTGC AGAATGGCGC CTATGCGGTC AATGGCGGCA TATTCGATCT CGGCGGCTTC GACCTGACGA TGGCGCAGCT TTCGGGAAGC GGCGGCGAGA TCGTCATCGG CAGCGCCGAA CTCACGCTCG ATCAGGCGAA CAACACCACC TATGGCGGCA TACTCTCCGG CAGCGGCGAC TTCACGATGC TGGGCAGCGG CACCTTGCGC CTGACCGGCA ACAGTTCGGG CTTTGCCGGC ACGACCACGG TTTCGGACGG ACGCCTGATC GTCAATGGCA GCCTTGGCGG CATCTTGATG ATGACGGGCG GCACGCTGGC CGGGTCGGGT CATATCGGTA CGGTGACCGC CGGCGCAGGC GTCACCATCG CGCCGGGCAA CTCCATCGGG ACATTGACCA TCGGCGGCAA CCTCACCCTC GATCCCAGTT CCACCTATGA GGTCGAGGTC GATCCGGCTG GCACTGCCAG CGATTTGATC TCGGTCACGG GGACCGCGTT CCTCAACGGC GCCAGCGTTA CCCATGTTGG GATGAATGGC GACTACCAGC CTTTCTCCAC CTATACGATC CTTACCGCCG CTGGCGGGAT CAACGGAACA TTCGGCGCCG TCACCTCCGA TTATGCCTTC CTGGCGGCGG AGCTCTCCTA TGATCTGAAC AACGTCTATC TGGAGATCGA ACGCAACAAC GTGCGTTTCA GCGACATGGC ACGGACGCGC AACCAGATGG CCGCGGCAGA GGCTGCAGAG AATCTCGGAA CTGGCAACGA TATCTACGAC GCCATCGTCA CATTGCCCGA TGACGAGCCG CTGATTCAGG CCAGTTACGA CGCGCTTTCC GGTGAAATCC ATGGCTCGAT CAAAACGGCG CTGATTACGC AAAGCCTCGT TGTCCGCCAG GCCGCCAACG AACGTCTGCG TTCCGCGTTT AGCGACGCCA GTGCTGGCGT AATCCCGATA CAGGCTTTCT GGCCGGGCGG TCCGGAACTC ATCGCTGCCA ATCCTTCGGA CGCGCCGGTT TTCTGGAGCA CGGCTTTTGG CGGCGCAAGC GAGACACGCA CGGACGGCAA TGCCGCCACC CTCAACCACC AGACCGGCGG GCTTCTCGCT GGCGTCGACG CCATGTTCGA TGACGTCAGG CTCGGCCTGA TGGCCGGTTA CAGCAACTCC CAATTCGACC CGCGGCACCG AAGCTCATCG GGATCGAGCG ACGATTATCA CCTCGGCCTT TACGCGGGCA CGCAATGGGG CGGTCTCGCC TTCCGCACCG GTCTCGCTCA TACGTGGCAC GAGATCGAGA CCAACCGCAG CGTCGCTATT GGAAGCTTCG AGGACAGGCT GGAAGCAAGC TATAATGCCG GCACGCTGCA GGCATTTGCG GAGCTGGGCT ATCGGTTTGA TACGGCGGCG GCCACTTTCG AGCCTTTCGT CAATCTCGCC CATATCGGCA TTCGAACGGC GGGTTTCACC GAAGGGGGCG GGGCGGCAGC GCTCGACAGC TCCAGCCGCA TGACCAACAC CACCATCACC ACGCTTGGCC TGCATGCCGA AATGGAGGTT CGCTTGGGCG AGACGAACGC CACCCTGCGC GGCATGTCAG GCTGGCGGCA TGCCGCCGGC GACATCGTTC CGGTGTCGAC GCATGCTTTT GCCGGAGGCG ACGCGTTCAC CGTCGCCGGA GTGCCGGTGG CAGAGAACGC CTTCGTTCTT GACGCCGGGC TCGACTTCGA CCTCACCGAA AGCGCCATCC TCGGCATCGC CTATTCCGGC CAGATTGCCG ACAACGCGCA GCAGCATGGG GCCAAAGCGA CGCTGTCGGT GAAATTCTAA
|
Protein sequence | MGERISGAAI QGMVMLRLTG LASTAALVLA VGPGWAQVIT GNDTEIVDGN DPGGTGAGTQ PSPWTINTNL IVGDQNGDDA ALVIRNGGIV SNDIGVLGVD PGAAGTVTVT GTGSAWTNSD DLYVGHRGVG VLNIEDGGVV DNIFGRIGYF SGASGTVTVT GTGSTWTNAQ DLYIGDSGTG TLTISNGGTV SSTAGLISND TTAIGEVIVT GTNSIWSNSS YISVGEAGAG TLTISNGGSV TASEGYVGYS SNGNGVVSVT DTGSSWINSG ALFVGEFGSG SMSVENGGTV SASEVIIADD SGATGTVRIA GSAANGRGVL ETGYIERGGG DADLVFDGGI LSATGNEANF LRGFNAGEVT IDAGGAFIDT NGFAVGIATD LQGAGGLTKK GSGTLTLSGT SSFTGVTTVE AGTLQAGSAG AFVQNGAYAV NGGIFDLGGF DLTMAQLSGS GGEIVIGSAE LTLDQANNTT YGGILSGSGD FTMLGSGTLR LTGNSSGFAG TTTVSDGRLI VNGSLGGILM MTGGTLAGSG HIGTVTAGAG VTIAPGNSIG TLTIGGNLTL DPSSTYEVEV DPAGTASDLI SVTGTAFLNG ASVTHVGMNG DYQPFSTYTI LTAAGGINGT FGAVTSDYAF LAAELSYDLN NVYLEIERNN VRFSDMARTR NQMAAAEAAE NLGTGNDIYD AIVTLPDDEP LIQASYDALS GEIHGSIKTA LITQSLVVRQ AANERLRSAF SDASAGVIPI QAFWPGGPEL IAANPSDAPV FWSTAFGGAS ETRTDGNAAT LNHQTGGLLA GVDAMFDDVR LGLMAGYSNS QFDPRHRSSS GSSDDYHLGL YAGTQWGGLA FRTGLAHTWH EIETNRSVAI GSFEDRLEAS YNAGTLQAFA ELGYRFDTAA ATFEPFVNLA HIGIRTAGFT EGGGAAALDS SSRMTNTTIT TLGLHAEMEV RLGETNATLR GMSGWRHAAG DIVPVSTHAF AGGDAFTVAG VPVAENAFVL DAGLDFDLTE SAILGIAYSG QIADNAQQHG AKATLSVKF
|
| |