Gene Rleg2_5530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5530 
SymbolflgK 
ID6978624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1178683 
End bp1180143 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content62% 
IMG OID643394629 
Productflagellar hook-associated protein FlgK 
Protein accessionYP_002279447 
Protein GI209547529 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0345523 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTCA CCTCCGGCTT GAACAGCGTC CAGAGTATTT TCAACAATAC GGGCCAGCAG 
AGCAGCGTCG TCTCGACCAA CATCGCCAAT GTCGGGAATT CCGACTATGT GAGGCGGGAG
GCGTCGATCA CGACGTCTCT CTCAGGCGCC CAGGTCGTCA GCATCAGCCG GGCGCAGGAA
ACTGCGCTGC TGGCGCAATA TCTGCAATCG AACTCCAAGG ACAGCGCCCA GCAGACGCTG
GTGACCGGCC TCGAAAGCTT GAAGTCGCTG ATGGGCGGCA ACGATTACGA GACCTCGCCA
AGCACATACC TCTCAGCATT TCAGCAGGCG CTGCAGACCT TTGCCACATC GCCGAGCAGC
ACGACCGCGG CGCAATCGGC CGTCACCGCC GCGCGGGATC TCGCCAATTC GCTGAATACC
GCAAGCGACG GCGTCCAGTC GATCAGGGCC GATGCCGACG CGGAGATCGC CACGCAGGTC
TCCTCGCTGA ATACACTGCT GTCGCAGTTC GAGACGGCCA ACAATGCCGT CAAGTTGGCG
ACGGCGACAG GCGCCGATAC CTCCTCGGCG CTCGACGAAC GAGAAAAACT GTTGAAGCAG
ATCTCCTCGA TCGTCGGCGT CACCACCGCC GTGCGCGACA ACAACGACAT GGCGCTCTAC
ACCTCCGACG GCACGGTGCT GTTCGAGACC GTACCGCGCA CCGTCACATA CGTGCCGACG
ACAACCTATG TGGCCGGAAC GGAGGGCAAT TCGGTCTATA TCGACGGCGT CGCACTCGAC
GCCGGCGAGG GGTCGACGAC AAGCGCCTCG GGCGGCCTGC AGGCGCTGCT GCAGCTTCGC
GACGACATCG CGCCGACATT CCAGGCCCAG CTCGACGAGA TCGCCAAGTC GCTCGTCCAG
GCCTTCTCGG AAACCGACGG CAGCACCAGC GCGCCCGGAC TTTTCGTCTG GACCACCGCG
TCGGGGACAT CAGGGGGAAC ACCGTCGGAT TCCGACGATA TCACCGGCAT CGCGTCGTCG
ATCTCGGTCA ATCTTGCCGT CGTTACCAGC GAGGGCGGTG ATGCCACGAA GCTGCGCGAC
GGAACGATCA GCGGCATCAC CGATCTCAAC AGCGCAGGAG ACAGCGGTTT CTCGGACAAT
CTCGACGCCC TCTATACGGC GTTGACGAAA CAGCGCTCGT TCTCCTCAGA CGCCGGTCTT
TCCACCACGC AAAGCCTGAT GGATTACGCC AGTTCCTCCA TCGGCTGGCT GGAACAATAT
CGCAGCGATG CGACGTCGGC TTCCGAAAAC ACGACTGCCG CGCTGTCGCG CTCCGACGAG
GCCTATTCCA ACGAAGCCGG CGTCAACCTC GACGAGGAGC TGACGCTCCT CCTCGATATC
GAACAATCCT ACAAGGCGGC GACGAAGATC CTGAACGTCA TCGACGAGAT GTTCAAGTCG
CTCCTCGACA TAGCGAGCTA G
 
Protein sequence
MSLTSGLNSV QSIFNNTGQQ SSVVSTNIAN VGNSDYVRRE ASITTSLSGA QVVSISRAQE 
TALLAQYLQS NSKDSAQQTL VTGLESLKSL MGGNDYETSP STYLSAFQQA LQTFATSPSS
TTAAQSAVTA ARDLANSLNT ASDGVQSIRA DADAEIATQV SSLNTLLSQF ETANNAVKLA
TATGADTSSA LDEREKLLKQ ISSIVGVTTA VRDNNDMALY TSDGTVLFET VPRTVTYVPT
TTYVAGTEGN SVYIDGVALD AGEGSTTSAS GGLQALLQLR DDIAPTFQAQ LDEIAKSLVQ
AFSETDGSTS APGLFVWTTA SGTSGGTPSD SDDITGIASS ISVNLAVVTS EGGDATKLRD
GTISGITDLN SAGDSGFSDN LDALYTALTK QRSFSSDAGL STTQSLMDYA SSSIGWLEQY
RSDATSASEN TTAALSRSDE AYSNEAGVNL DEELTLLLDI EQSYKAATKI LNVIDEMFKS
LLDIAS