Gene Rleg2_4188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4188 
Symbol 
ID6982961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4361518 
End bp4363017 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content63% 
IMG OID643398919 
Producttype II and III secretion system protein 
Protein accessionYP_002283676 
Protein GI209551759 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4964] Flp pilus assembly protein, secretin CpaC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAATT CAACGCGGCG CGCCGGGCTT CTCCTCACAG GTTTTTTCTC GCTGGCGGTC 
GGTATCTCCG GTATTGCGCC GGCCTCTTTC GCGCCGCTTC TGGGCTCCAG CGAGGCGCGT
GCCGATTCCG AGAACCTGGT TCGCATCTCG CAGACCGGCC GCGATGCCCA TCGTCGGCTG
AAGCTCGGGC TGAACAAGGC CGTTGTCGTC GATCTGCCGG AGGATGCGCA TGATATTCTC
GTCTCCGATC CGACCATGGC CGATGCCGTC ACCCGCACCT CGCGGCGCAT CTACCTGTTC
GGCAAGAAGG TCGGCCAGAC GAATATTTTC GTTTTCGGCG CCGGCGGGCA GGAGATCGTC
AATCTCGACA TCGAGATCGA GCGCGATGTT TCCGGCCTCG AAGTCAATCT CCACCGCTTC
ATTCCAGACT CCAACATCAA TGTCGAAATC GTCTCCGACA ACATCGTGCT GACCGGCACC
GTGCGCACGC CGCAGGATGC CACGCAGGCG GCCGATCTGG CGCAAGTCTT CCTGAAGGGC
GGCGAGGCCA CGACCAGAAC CGAGACGGCA TCGGGTACCG GCGGCGACAG CTCGGTGGCG
CTTTTTGCTG AAGGCCGCCA GACCTCGCAG GTCGTCAACC TGCTGCAGAT CGAAGGCGAA
GACCAGGTCA CCCTCAAGGT GACGATCGCC GAGGTTCGTC GCGAGGTGCT GAAGCAGCTC
GGCTTCGACA ATCTGGTTTC CAATTCCTCC GGCATGACGG TCGCCCAGCT CGGCAGCCCC
AGCGCCGACA GCGCCACATC CGTCGTTGGC GGTGGCCTGG CGGCGCTCTT TAAGAGCTCG
ATCGGGAAAT ATGACATTTC GACCTACCTC AACGCGCTGG AGCAGGCCAA GGTCGTCAAG
ACGCTCGCCG AGCCGACGCT GACGGCAATA TCGGGCCAGG CCGCGACCTT CAATTCCGGC
GGCCAACAGC TCTATTCGAC AACCGACAGC AACGGCAACG TCACCGTCGT GCCGTTCAAC
TACGGTATCA ACCTCGCCTT CAAGCCGGTC GTGCTCTCAT CGGGACGCAT CAGTCTGCAG
ATCAAGACCA ATGTCTCCGA ACCGGTCGCC GGCAGCAGCG GCGCGACCTA TCAGCGCCGC
TCGGCGGAAA CCTCGGTGGA ACTGCCCTCG GGCGGCTCCA TCGCGCTGGC CGGCCTGATT
CGCGACAACG TCTCGCAGAC GATGGGCGGC ACACCTGGCG TATCGAAAAT CCCGCTGCTC
GGTACCCTCT TCCGCCAGAA GGGGTTCGAG CGTCAGGAAA CCGAGCTTGT CATCATCGCG
ACGCCCTATC TGGTGCGCCC GGTGGCGCGC AATCAACTCA ATCGGCCGGA CGATAATTTC
AGCCCCGAGA ACGACGGTGC GACCTTCTTC CTCAACCGTG TCAACAAGGT CTATGGCCGC
CGCGAGGCGC CCGTCGCCGA TGCGCAGTTC CACGGATCGA TCGGGTTCAT CTACAAATGA
 
Protein sequence
MGNSTRRAGL LLTGFFSLAV GISGIAPASF APLLGSSEAR ADSENLVRIS QTGRDAHRRL 
KLGLNKAVVV DLPEDAHDIL VSDPTMADAV TRTSRRIYLF GKKVGQTNIF VFGAGGQEIV
NLDIEIERDV SGLEVNLHRF IPDSNINVEI VSDNIVLTGT VRTPQDATQA ADLAQVFLKG
GEATTRTETA SGTGGDSSVA LFAEGRQTSQ VVNLLQIEGE DQVTLKVTIA EVRREVLKQL
GFDNLVSNSS GMTVAQLGSP SADSATSVVG GGLAALFKSS IGKYDISTYL NALEQAKVVK
TLAEPTLTAI SGQAATFNSG GQQLYSTTDS NGNVTVVPFN YGINLAFKPV VLSSGRISLQ
IKTNVSEPVA GSSGATYQRR SAETSVELPS GGSIALAGLI RDNVSQTMGG TPGVSKIPLL
GTLFRQKGFE RQETELVIIA TPYLVRPVAR NQLNRPDDNF SPENDGATFF LNRVNKVYGR
REAPVADAQF HGSIGFIYK