Gene Rleg2_3198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3198 
Symbol 
ID6981950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3287673 
End bp3288662 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content62% 
IMG OID643397915 
Producttol-pal system protein YbgF 
Protein accessionYP_002282691 
Protein GI209550774 
COG category[S] Function unknown 
COG ID[COG1729] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR02795] tol-pal system protein YbgF 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAC TTGTCGTGGC AGGCATGCTG TGCCTCGCGG CTGTGACCGG GAGCGAACGG 
GCGGCCTATT CCGCCTCGTT CTTTGGGCTG CATCTCGGCG GCCGGTCCAC GGAAAACCAA
GCGGCGCCGC CTGTTGTCAA GGTGCAGAGC GGCGATGCCG AGGTTCGCGT GCAACAGCTC
GAAGAGCAGC TTCGGCAGTT GAATGGCCGG ATCGAGGAGA TGAGCTTCCA GCTGCTGCAG
ATGCAGGAGA CGATCCGCAA GCAGCAGGAA GACAATGAAT TCCGCTTCCA ACAGTTGGAA
AAGACGGGTG CCAGCGGCGG CGGTGCCAAA GCTCCTGTCA AGAAGAGCGA GACCGATACC
GCTCCGGCAG CGTCCGGCGG CGATGACGTT GCCAGGGTGA TCCAAGCACC GCAGGGAGCC
GAAACGGCTC CCTCCACCAA CGTGCCCAGC AATACTGGCC TCGGGCAGCC GCCGAAGGAG
CTCGGCTCGA TCGATTTCGA TCAGAACGGC AACCCGGTCG GCGGAACCGT CGATGCAAAT
GCGGGAGTGG GCTCCGGCCC TATCCCCAAT GCCAATCCCG GCGCGCCGCA GCAGACTGCC
TCGCTTGGCG GTGAGGCGGA CCAGTACAAG TCGGCCTATG GTCACGTCTT ATCAGGCGAT
TACAGCACGG CCGAACAAGA GTTCACCCAG TACATCACCC GCTACCCGAG CAGCGCGCGG
GCGGCGGACG CCAATTTCTG GCTTGGCGAA GCGCTCTATT CGCAGGGCAA GTACAATGAG
GCGGCCAAGA CCTTCCTCAA TGCGCACCAG AAATACGCTA CATCGGAAAA GGCGCCCGAG
ATGCTGTTGA AGCTCGGCAT GTCGCTGGCC GCCCTCGACA ATACCGAGAC GGCCTGTGCG
ACGCTGCGCG AAGTCTCGAA GCGGTATCCG AAGGCTTCGC GTGCCGTCAT AAGCAAGGTT
GCGAGCGAAC AGAAGCGCCT CGCCTGCTAA
 
Protein sequence
MKKLVVAGML CLAAVTGSER AAYSASFFGL HLGGRSTENQ AAPPVVKVQS GDAEVRVQQL 
EEQLRQLNGR IEEMSFQLLQ MQETIRKQQE DNEFRFQQLE KTGASGGGAK APVKKSETDT
APAASGGDDV ARVIQAPQGA ETAPSTNVPS NTGLGQPPKE LGSIDFDQNG NPVGGTVDAN
AGVGSGPIPN ANPGAPQQTA SLGGEADQYK SAYGHVLSGD YSTAEQEFTQ YITRYPSSAR
AADANFWLGE ALYSQGKYNE AAKTFLNAHQ KYATSEKAPE MLLKLGMSLA ALDNTETACA
TLREVSKRYP KASRAVISKV ASEQKRLAC