Gene Rleg_5549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5549 
Symbol 
ID8016440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp136330 
End bp137868 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content55% 
IMG OID644827716 
Producttype II and III secretion system protein 
Protein accessionYP_002978916 
Protein GI241518288 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4964] Flp pilus assembly protein, secretin CpaC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00090409 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGATCAGCG CGATAGTATG CAACGCGATG GATGCGGCAT TGAACATTAG GGTTCATCCG 
CTCCGCAAGA CGGATTTAGG GAAGTCCTCG GTCCATGCCG GCTGGATTGC GCATACAGTT
CTCCTGCTCA CGCTGCTGTC GTCGGCCACA TATTGCCGTG CCGAGAATGT CATCAGCGTT
ACAGAGACCG GTGTGAATGT CACGCGGCAT GTCAGCATCG GTCTAAATAA GACGCTCATC
GTTGAACTTC CTCAGGACGC TCATGACATT GTTGTTTCAG ATCCAGGCGT GTCTGACGCG
ATAGTGCGAG CGACACGCAC CATCTTTCTC TTTGGCAAAA AGGTCGGGCA AACGAATATC
TTCATCCTCG ATGCAAACAA GCGACCAATC GTCAATATCG ATATCGCGGT TGAGAGAGAC
ATTGCCGGGC TGGAGACGGA TTTGCGTCGT TTGATACCGG ATGCGGCCAT CAAGGTTGAG
ATTATTTCCG ATAATATTGT CCTCACCGGC ACCGTAAGGT CGGCACAGGA CTCGGCACAG
GCCGCCGATC TTGCATCCGC TTTCGTCAAA GGCGGGGAGG CGACGACCCG CACCCAAAGT
GCGTCGAGCG GGGGTAGTCA AGGATCCGTG GCCCTTGTTG CGGAAGATCG GCAGGAATCC
AAAATCATCA ACCTTCTTCG CATCGCAGCC GACGATCAGG TGATGCTCAA AATGACGATA
GCGGAGGTAA AGCGAGAGAT CCTGAAGCAA CTGGGCTTTG ACAATGAGCT AAAAAACGCT
GGTGGCTCAA CAATTGCCCA GCTGGGAACG GCATCGACGG ATGCGACGAC CGCGACCTCC
GGCGGCGGCC TATCAGCCCT ATTCAGCGGA TCCTTCGGCA AGCATGGCCT CTCAACGACA
TTGAATGCGC TTGAACAAGC GAAAGTGGTT CGAACCTTGG CCGAGCCGAC ATTGACGGCC
GTCTCGGGTC AATCGGCATC GTTTCAAGCA GGTGGCGAGG TCCTCTATTC AAACACCGAC
CGCGATGGCA ATACGACCCA AACCCCGTAC AGCTACGGCA TCAGCCTCTC ATTCAAGCCC
ATCGTCCTAA CGTCCGGTCG GATCAGCCTG CAGATCTCAA CCGAGGTGTC CGAACCGGTT
ACCTCTATAT CCGGTTCATC CCCGACCTAC GGCAAGCGTT CGACCAGCAC CACCGTTGAA
CTGCCATCGG GTGGTTCGAT CGCTCTGGCG GGGCTAATCC GAGATAACTT CAACAGGACC
TCCAATGGGA CGCCGGTCTT GAACAAGATT CCAGGGTTTG GCGCTTTGTT TCGCCAGACA
AGCTTCGAGA GAAACGAGAC TGAACTCGTA ATTATCGCCA CACCCTATTT GGTTCGCCCT
GTCGCGGCAA AAGATCTGAA TCGGCCTGAT GACAATCTTA GCCCAGCTGA TGATGCCTCT
CAGGGGTTAC TCGACCGGAT CAACAAGCTC TACGGCAACG GCAAAACCTT GGAGCCAACA
GCGCAATATC ACGGCACCGT CGGCTTCATA TACAAGTGA
 
Protein sequence
MISAIVCNAM DAALNIRVHP LRKTDLGKSS VHAGWIAHTV LLLTLLSSAT YCRAENVISV 
TETGVNVTRH VSIGLNKTLI VELPQDAHDI VVSDPGVSDA IVRATRTIFL FGKKVGQTNI
FILDANKRPI VNIDIAVERD IAGLETDLRR LIPDAAIKVE IISDNIVLTG TVRSAQDSAQ
AADLASAFVK GGEATTRTQS ASSGGSQGSV ALVAEDRQES KIINLLRIAA DDQVMLKMTI
AEVKREILKQ LGFDNELKNA GGSTIAQLGT ASTDATTATS GGGLSALFSG SFGKHGLSTT
LNALEQAKVV RTLAEPTLTA VSGQSASFQA GGEVLYSNTD RDGNTTQTPY SYGISLSFKP
IVLTSGRISL QISTEVSEPV TSISGSSPTY GKRSTSTTVE LPSGGSIALA GLIRDNFNRT
SNGTPVLNKI PGFGALFRQT SFERNETELV IIATPYLVRP VAAKDLNRPD DNLSPADDAS
QGLLDRINKL YGNGKTLEPT AQYHGTVGFI YK