Gene Rleg_4476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4476 
Symbol 
ID8015239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4606852 
End bp4608351 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content63% 
IMG OID644827052 
Producttype II and III secretion system protein 
Protein accessionYP_002978253 
Protein GI241207157 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4964] Flp pilus assembly protein, secretin CpaC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.988373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAATT CAACGCGGCG CGCCGGACCT CTCCTGACAG GTTGCCTTTC GCTGGCAATC 
GGCGCCTCTG GTATGGTGCC GGCCTCTTTC GCCCCACTCT TCGGTGCAGG CGAAGCGCGC
GCCGATTCCG ATAGCCTGGT CCGGATCTCG CAGACCGGCT CCAATGCCCA TCGCCGGCTG
AAGCTCGGGC TCAACAAGGC CGTCGTTGTC GATCTGCCGG AGGATGCGCA TGATATCCTC
GTCTCCGATC CGACCATGGC TGATGCCGTG ACCCGCACCT CGCGGCGCAT CTACCTGTTC
GGCAAGAAGG TCGGCCAGAC GAATATCTTC GTCTTCGGCG CCGGTGGACA GGAGATCGTC
AATCTCGACA TCGAGATCGA GCGCGACGTC TCCGGCCTCG AAGTCAATCT CCGCCGCTTC
ATTCCCGACT CCAACATCAA CGTCGAAATC GTCTCCGACA ACATCGTGCT GACCGGCACC
GTGCGCACGC CGCAGGATGC CACACAGGCC GCCGATCTGG CACAGGTCTT CCTGAAGGGC
GGCGAGGCGA CGACCAGAAC CGAAACGGCA TCCGGCACCG GCGGCGACAG CTCTGTGGCG
CTTTTTGCCG AAGGTCGCCA GAGTTCGCAG GTCGTCAACC TGCTGCAGAT CGAGGGCGAG
GACCAGGTCA CCCTCAAGGT GACGATCGCC GAGGTCCGCC GCGAGGTGCT GAAGCAGCTC
GGCTTCGACA ATCTGGTTTC CAATTCTTCC GGCATGACGG TCGCTCAGCT CGGCAGTCCC
AGCGCCGACA GCGCCGCGTC GACGGTCGGT GGCGGTCTGG CGGCGCTCTT CAAGAGCTCG
ATCGGCAAAT ACGACATTTC GACCTACCTC AACGCGCTGG AACAGGCCAA GGTCGTCAAG
ACGCTCGCAG AACCGACGCT GACGGCGATA TCGGGCCAGG CCGCGACCTT CAATTCCGGT
GGCCAGCAGC TTTATTCGAC GACCGACAGC GACGGCAATG TCACCGTCGT ACCGTTTAAC
TACGGTATCA GCCTCGCCTT CAAACCGGTC GTGCTGTCAT CGGGCCGCAT CAGCCTGCAG
ATCAAGACCA ACGTCTCCGA ACCGGTGGCC GGCAGCGGCA ACGCCACCTA TCAGCGCCGC
TCGGCGGAAA CCTCGGTGGA GCTGCCTTCG GGCGGCTCGA TCGCGCTCGC CGGCCTTATC
CGCGACAATG TTTCCCAGAC GATGGGCGGC ACGCCCGGTG TCTCGAAGAT CCCGCTACTC
GGGACGCTCT TCCGCCAGAA GGGTTTCGAG CGTCAGGAAA CCGAGCTCGT CATCATCGCC
ACGCCCTATC TGGTGCGCCC GGTGGCGCGC AACCAGCTCA ACCGGCCGGA CGACAATTTC
AGCCCTGAGA ACGACGGTGC GACCTTCTTC CTCAACCGTG TCAACAAGGT CTATGGCCGC
CGCGAGGCGC CCGTTGCAGA CGCGCAGTTC CACGGCTCGA TCGGGTTCAT CTACAAATGA
 
Protein sequence
MGNSTRRAGP LLTGCLSLAI GASGMVPASF APLFGAGEAR ADSDSLVRIS QTGSNAHRRL 
KLGLNKAVVV DLPEDAHDIL VSDPTMADAV TRTSRRIYLF GKKVGQTNIF VFGAGGQEIV
NLDIEIERDV SGLEVNLRRF IPDSNINVEI VSDNIVLTGT VRTPQDATQA ADLAQVFLKG
GEATTRTETA SGTGGDSSVA LFAEGRQSSQ VVNLLQIEGE DQVTLKVTIA EVRREVLKQL
GFDNLVSNSS GMTVAQLGSP SADSAASTVG GGLAALFKSS IGKYDISTYL NALEQAKVVK
TLAEPTLTAI SGQAATFNSG GQQLYSTTDS DGNVTVVPFN YGISLAFKPV VLSSGRISLQ
IKTNVSEPVA GSGNATYQRR SAETSVELPS GGSIALAGLI RDNVSQTMGG TPGVSKIPLL
GTLFRQKGFE RQETELVIIA TPYLVRPVAR NQLNRPDDNF SPENDGATFF LNRVNKVYGR
REAPVADAQF HGSIGFIYK