Gene Rleg_0480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0480 
Symbol 
ID8011675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp500049 
End bp503138 
Gene Length3090 bp 
Protein Length1029 aa 
Translation table11 
GC content62% 
IMG OID644823072 
Productouter membrane autotransporter barrel domain protein 
Protein accessionYP_002974325 
Protein GI241203229 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain
[TIGR01451] conserved repeat domain
[TIGR02601] autotransporter-associated beta strand repeat 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.876293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGAAA GGATATCTGG AGCGGCAATC CAAGGGATGG TCATGCTGCG ACTTACGGGA 
TTGGCGAGCA CTGCGGCGCT GGTGCTGGCT GTAGGGCCGG GATGGGCGCA GGTCATCACC
GGCAACGACA CGGAGATCGT CGATGGCAAC GACCCGGGCG GTACCGGCGC GGGCACGCAA
CCGAGTCCAT GGACAATCAA TACCAACCTC ATCGTGGGCG ATCAGAATGG TGACGATGCA
GCCCTTGTCA TCCGGAATGG CGGCATAGTC AGCAATGACA TCGGCGTGCT CGGTGTCGAT
CCCGGCGCTG CAGGAACGGT AACGGTTACG GGGACGGGCT CGGCCTGGAC CAATTCCGAC
GACCTCTACG TCGGCCATCG GGGTGTCGGC GTGCTCAATA TCGAAGATGG CGGCGTTGTC
GACAATATAT TCGGCCGTAT CGGCTATTTC TCTGGCGCCA GCGGCACGGT GACGGTCACC
GGCACCGGCT CGACCTGGAC CAACGCCCAG GATCTTTATA TCGGCGACAG CGGCACCGGC
ACTCTGACCA TTTCGAATGG CGGCACGGTC AGCAGCACTG CCGGGCTTAT CAGCAACGAC
ACGACTGCCA TCGGCGAGGT CATCGTCACA GGCACGAACT CGATCTGGAG CAATTCCAGC
TATATCTCGG TCGGCGAGGC GGGAGCGGGA ACGCTGACCA TTTCGAATGG CGGCTCGGTC
ACTGCGAGTG AAGGTTATGT CGGCTATAGT TCGAACGGCA ACGGCGTGGT GAGCGTGACC
GATACGGGGT CGAGCTGGAT CAATTCCGGC GCGCTGTTCG TGGGCGAATT CGGTTCAGGC
AGTATGAGCG TCGAGAATGG CGGCACGGTT TCGGCTTCCG AGGTTATCAT CGCCGACGAT
TCGGGCGCCA CGGGAACTGT GCGGATCGCC GGAAGCGCCG CAAACGGGCG CGGCGTCCTG
GAGACCGGTT ATATCGAGAG AGGCGGTGGT GACGCGGACC TCGTTTTCGA TGGCGGTATT
CTCAGCGCAA CGGGCAACGA GGCGAATTTC CTGCGCGGTT TCAACGCCGG CGAGGTAACG
ATTGATGCTG GCGGCGCCTT TATCGATACG AACGGCTTTG CCGTCGGCAT TGCCACGGAT
TTGCAAGGCG CCGGTGGCTT GACCAAAAAG GGCAGCGGCA CACTGACGCT TTCGGGCACC
AGCAGTTTTA CCGGTGTGAC GACGGTCGAG GCCGGCACGT TGCAGGCGGG CAGCGCCGGT
GCCTTCGTGC AGAATGGCGC CTATGCGGTC AATGGCGGCA TATTCGATCT CGGCGGCTTC
GACCTGACGA TGGCGCAGCT TTCGGGAAGC GGCGGCGAGA TCGTCATCGG CAGCGCCGAA
CTCACGCTCG ATCAGGCGAA CAACACCACC TATGGCGGCA TACTCTCCGG CAGCGGCGAC
TTCACGATGC TGGGCAGCGG CACCTTGCGC CTGACCGGCA ACAGTTCGGG CTTTGCCGGC
ACGACCACGG TTTCGGACGG ACGCCTGATC GTCAATGGCA GCCTTGGCGG CATCTTGATG
ATGACGGGCG GCACGCTGGC CGGGTCGGGT CATATCGGTA CGGTGACCGC CGGCGCAGGC
GTCACCATCG CGCCGGGCAA CTCCATCGGG ACATTGACCA TCGGCGGCAA CCTCACCCTC
GATCCCAGTT CCACCTATGA GGTCGAGGTC GATCCGGCTG GCACTGCCAG CGATTTGATC
TCGGTCACGG GGACCGCGTT CCTCAACGGC GCCAGCGTTA CCCATGTTGG GATGAATGGC
GACTACCAGC CTTTCTCCAC CTATACGATC CTTACCGCCG CTGGCGGGAT CAACGGAACA
TTCGGCGCCG TCACCTCCGA TTATGCCTTC CTGGCGGCGG AGCTCTCCTA TGATCTGAAC
AACGTCTATC TGGAGATCGA ACGCAACAAC GTGCGTTTCA GCGACATGGC ACGGACGCGC
AACCAGATGG CCGCGGCAGA GGCTGCAGAG AATCTCGGAA CTGGCAACGA TATCTACGAC
GCCATCGTCA CATTGCCCGA TGACGAGCCG CTGATTCAGG CCAGTTACGA CGCGCTTTCC
GGTGAAATCC ATGGCTCGAT CAAAACGGCG CTGATTACGC AAAGCCTCGT TGTCCGCCAG
GCCGCCAACG AACGTCTGCG TTCCGCGTTT AGCGACGCCA GTGCTGGCGT AATCCCGATA
CAGGCTTTCT GGCCGGGCGG TCCGGAACTC ATCGCTGCCA ATCCTTCGGA CGCGCCGGTT
TTCTGGAGCA CGGCTTTTGG CGGCGCAAGC GAGACACGCA CGGACGGCAA TGCCGCCACC
CTCAACCACC AGACCGGCGG GCTTCTCGCT GGCGTCGACG CCATGTTCGA TGACGTCAGG
CTCGGCCTGA TGGCCGGTTA CAGCAACTCC CAATTCGACC CGCGGCACCG AAGCTCATCG
GGATCGAGCG ACGATTATCA CCTCGGCCTT TACGCGGGCA CGCAATGGGG CGGTCTCGCC
TTCCGCACCG GTCTCGCTCA TACGTGGCAC GAGATCGAGA CCAACCGCAG CGTCGCTATT
GGAAGCTTCG AGGACAGGCT GGAAGCAAGC TATAATGCCG GCACGCTGCA GGCATTTGCG
GAGCTGGGCT ATCGGTTTGA TACGGCGGCG GCCACTTTCG AGCCTTTCGT CAATCTCGCC
CATATCGGCA TTCGAACGGC GGGTTTCACC GAAGGGGGCG GGGCGGCAGC GCTCGACAGC
TCCAGCCGCA TGACCAACAC CACCATCACC ACGCTTGGCC TGCATGCCGA AATGGAGGTT
CGCTTGGGCG AGACGAACGC CACCCTGCGC GGCATGTCAG GCTGGCGGCA TGCCGCCGGC
GACATCGTTC CGGTGTCGAC GCATGCTTTT GCCGGAGGCG ACGCGTTCAC CGTCGCCGGA
GTGCCGGTGG CAGAGAACGC CTTCGTTCTT GACGCCGGGC TCGACTTCGA CCTCACCGAA
AGCGCCATCC TCGGCATCGC CTATTCCGGC CAGATTGCCG ACAACGCGCA GCAGCATGGG
GCCAAAGCGA CGCTGTCGGT GAAATTCTAA
 
Protein sequence
MGERISGAAI QGMVMLRLTG LASTAALVLA VGPGWAQVIT GNDTEIVDGN DPGGTGAGTQ 
PSPWTINTNL IVGDQNGDDA ALVIRNGGIV SNDIGVLGVD PGAAGTVTVT GTGSAWTNSD
DLYVGHRGVG VLNIEDGGVV DNIFGRIGYF SGASGTVTVT GTGSTWTNAQ DLYIGDSGTG
TLTISNGGTV SSTAGLISND TTAIGEVIVT GTNSIWSNSS YISVGEAGAG TLTISNGGSV
TASEGYVGYS SNGNGVVSVT DTGSSWINSG ALFVGEFGSG SMSVENGGTV SASEVIIADD
SGATGTVRIA GSAANGRGVL ETGYIERGGG DADLVFDGGI LSATGNEANF LRGFNAGEVT
IDAGGAFIDT NGFAVGIATD LQGAGGLTKK GSGTLTLSGT SSFTGVTTVE AGTLQAGSAG
AFVQNGAYAV NGGIFDLGGF DLTMAQLSGS GGEIVIGSAE LTLDQANNTT YGGILSGSGD
FTMLGSGTLR LTGNSSGFAG TTTVSDGRLI VNGSLGGILM MTGGTLAGSG HIGTVTAGAG
VTIAPGNSIG TLTIGGNLTL DPSSTYEVEV DPAGTASDLI SVTGTAFLNG ASVTHVGMNG
DYQPFSTYTI LTAAGGINGT FGAVTSDYAF LAAELSYDLN NVYLEIERNN VRFSDMARTR
NQMAAAEAAE NLGTGNDIYD AIVTLPDDEP LIQASYDALS GEIHGSIKTA LITQSLVVRQ
AANERLRSAF SDASAGVIPI QAFWPGGPEL IAANPSDAPV FWSTAFGGAS ETRTDGNAAT
LNHQTGGLLA GVDAMFDDVR LGLMAGYSNS QFDPRHRSSS GSSDDYHLGL YAGTQWGGLA
FRTGLAHTWH EIETNRSVAI GSFEDRLEAS YNAGTLQAFA ELGYRFDTAA ATFEPFVNLA
HIGIRTAGFT EGGGAAALDS SSRMTNTTIT TLGLHAEMEV RLGETNATLR GMSGWRHAAG
DIVPVSTHAF AGGDAFTVAG VPVAENAFVL DAGLDFDLTE SAILGIAYSG QIADNAQQHG
AKATLSVKF