Gene Rleg_5010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5010 
Symbol 
ID8007601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp395682 
End bp396962 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content66% 
IMG OID644821925 
Producttype III effector Hrp-dependent outers 
Protein accessionYP_002973185 
Protein GI241113350 
COG category[S] Function unknown 
COG ID[COG3395] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.447343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.232988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATTT TCCTCGGATC AATCGCTGAC GACTATACGG GCGCATCCGA CCTCGCCAAC 
ACCCTGACGA AGAACGGTCT GCGCACGGTA CAGACGGTCG GCATTCCCGA CCCGTCGCTG
GCACTACCGG ATGTCGACGC CGTGGTCGTT TCCCTGAAGA TCCGCTCCGT CTCGGCCTCG
GACGCCGTGA CGGCGGCGGC GAGCGCCGAG CGATGGCTGC GCCAGCGGGG TGCCGGCCAT
GTGCTCTACA AGATCTGCTC GACCTTCGAT TCCACCGATG CCGGCAATAT CGGTCCGGTC
ACCGAAGCCT TGAGGGAGGC GGCGGGCGGC GGCGTCGTGC TGGTGACGCC CGCCTTTCCG
GAAACGGGAC GCACCGTCTA TCTTGGCCAT CTCTTCGTCG GCGGACAGCC GCTGAATGAA
AGCCCGCTCA AGGATCACCC CCTCAATCCG ATGCATGACG CCAATCTCGT TCGGGTGCTC
ACGCGACAAT CGCGCAATGC TGTCGGGCTG GTCGATCTCA CGACCATCGG CGCCGGACCC
GGCGCCGTCA AAATGAGGCT TGATTCCTTT CGCACCGCAG GCGTCACCGC TGTTATCGCC
GATGCGATTT TCGAACGCGA TCTGGAAACG CTCGGCGAGA TTGCGTTGGA AATGCCGGTG
TCCACCGGTG CGTCCGGCCT CGGCCTCGGC CTTGCCCGCG CGCTCGTCCG CTCCGGTCGG
ATTTCCTCCG GCGGCGCAAC GACGGAGGAC GCCATTCGCC CGGTGGGCGG GCTTTCCGCG
ATCGTTGCCG GCAGTTGCTC CAAGGCGACG CTCCGCCAGC TCGACATCGC CGAACGGTCG
ATGCCCGTCC TGCGGCTCGA CCCGGAGCGG CTGCTTGCCG GTCCCGATGA AATCGCCGCG
GCGATTTCCT GGGCCGGAGA CCGCATCTCC GCCGGCCCCG TCGTCATCGC CGCGAGTGCT
GCGCCTGAAA CCGTGTCCCG GCTGCAATCG CTCCATGGAC GAGAGGCCTC CGGCCACGCG
ATCGAGACCG CGACGTCGAT CATCACAGCC GAACTGGTGG AGAGAGGCGT GCGGCGCCTG
GTGGTCGCCG GCGGCGAAAC CTCGGGCGCG GCCGTCGACA GGCTCGCCAT TCCGGCATTT
CTGATCGGCC CGGAGATTGC GCCCGGCGTG CCGGTGCTGC GCACAGTCGG CAATGCGCAG
GGCGACATGC TTCTGGCGCT GAAATCAGGA AACTTCGGAG GCGAGGATTT CTTTACGGCA
GCGCTGGCGA TGATGCGCTG A
 
Protein sequence
MAIFLGSIAD DYTGASDLAN TLTKNGLRTV QTVGIPDPSL ALPDVDAVVV SLKIRSVSAS 
DAVTAAASAE RWLRQRGAGH VLYKICSTFD STDAGNIGPV TEALREAAGG GVVLVTPAFP
ETGRTVYLGH LFVGGQPLNE SPLKDHPLNP MHDANLVRVL TRQSRNAVGL VDLTTIGAGP
GAVKMRLDSF RTAGVTAVIA DAIFERDLET LGEIALEMPV STGASGLGLG LARALVRSGR
ISSGGATTED AIRPVGGLSA IVAGSCSKAT LRQLDIAERS MPVLRLDPER LLAGPDEIAA
AISWAGDRIS AGPVVIAASA APETVSRLQS LHGREASGHA IETATSIITA ELVERGVRRL
VVAGGETSGA AVDRLAIPAF LIGPEIAPGV PVLRTVGNAQ GDMLLALKSG NFGGEDFFTA
ALAMMR