Gene Rleg_5250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5250 
SymbolflgK 
ID8007424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp661660 
End bp663120 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content61% 
IMG OID644822158 
Productflagellar hook-associated protein FlgK 
Protein accessionYP_002973418 
Protein GI241113583 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.516905 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTCA CCTCCGCCTT GAACAGCGTG CAAAGTATTT TCAACAATAC GGGCCAGCAA 
AGCAGCGTCA TCTCGACGAA TATCGCCAAT GTCGGAAATT CCGACTATGT GAGACGGGAG
GCGTCGGTTA CGACGTCTCT TTCCGGCGCC CAGGTCGTCA GCATCAGCCG GGCGCAGGAA
ACTGCGCTGC TTGCCCAGTA TCTGCAAACG AATGCCAAGG ACAGCGCCCA GCAGACGCTG
GTGACCGGTC TCGAAAGCCT GAAGTCGCTG GTGGGCGGCA ATGACTACGA GACCTCGCCG
AGTACCTATC TCACGGCATT CCAGCAGGCG CTCCAGACAT TCGGCACGTC GCCGAGCAGC
ACGACCGCCG CGCAATCGGC CGTGACCGCC GCGCAGGATC TCGCCAATTC GCTAAATACC
GCAAGCGACG GCGTCCAGTC GATCAGAGCC GAGGCGGATG CGGAGATCGC CACACAGGTC
TCCACCTTGA ATACGCTGCT GTCGCAGTTC GAGGCGGCCA ACAATGCAGT CAAGCTGGCG
ACAGCGACCG GCACCGATAC ATCCTCGGCA CTCGACGAGC GTGAAAAGCT GCTGAAGCAG
ATCTCCTCGA TCGTCGGCGT CACCTCAACC GTGCGCGACA ATAACGATAT GGCGCTCTAT
ACCTCTGATG GCACCGTGCT GTTCGAGACC ATTCCCCGCA CCGTCACGTT CGCTCCGACG
GCAACCTACG TTGCCGGAAC CGAAGGCAAT TCCATCTATA TCGACGGTGT TGCGCTCGAC
GCCGGCGAGG GATCGACGAC GAGTGCCTCG GGCAGCCTGC AGGCGCTGCT GCAGCTTCGC
GACGAGATCG CCCCGACATT CCAGGCCCAA CTCGACGAGA TCGCCAAATC GCTCGTCCAG
ATCTTCTCGG AAACCGACGG CAGCACGAGC GCGCCGGGAC TTTTCGTGTG GACGACGGCG
TCGGGCGCAA CCGGGGCAAC ACCTGCTGCT TCCGACGATA CGACAGGGAT CGCCTCCACC
ATCTCGGTCA ATCTAGCCGT CGTCACGAGC GAGGGCGGTG ATGCGACAAA GCTGCGCGAT
GGCTCCATCA GCGGCATCAC CGATCTCAAC ACCGCGGGAG ACAGCGGCTT CTCGGACAAT
CTCGACGCCC TGTATCAGGC GCTGACGGAA CAGCGCTCGT TCTCTTCCGA CGCCGGTCTC
TCCACGTCAC AAAGCCTGAC GGACTACGCC AGCGCCTCCA TCGGCTGGCT CGAACAATAT
CGAAGCGATG CCACGTCGGC CTCCGAAACA ACGGCTGCGG CCTTGTCACG CTCCGACGAG
GCCTATTCCA ACGAAACCGG CGTCAACCTC GACGAGGAAC TGACGCTGCT TCTCGACATC
GAACAATCCT ACAAAGCGGC GACGAAGATC CTGAACGTCA TCGACGAGAT GTTCCAGTCG
CTCCTCGACA TAGCGAGCTA G
 
Protein sequence
MSLTSALNSV QSIFNNTGQQ SSVISTNIAN VGNSDYVRRE ASVTTSLSGA QVVSISRAQE 
TALLAQYLQT NAKDSAQQTL VTGLESLKSL VGGNDYETSP STYLTAFQQA LQTFGTSPSS
TTAAQSAVTA AQDLANSLNT ASDGVQSIRA EADAEIATQV STLNTLLSQF EAANNAVKLA
TATGTDTSSA LDEREKLLKQ ISSIVGVTST VRDNNDMALY TSDGTVLFET IPRTVTFAPT
ATYVAGTEGN SIYIDGVALD AGEGSTTSAS GSLQALLQLR DEIAPTFQAQ LDEIAKSLVQ
IFSETDGSTS APGLFVWTTA SGATGATPAA SDDTTGIAST ISVNLAVVTS EGGDATKLRD
GSISGITDLN TAGDSGFSDN LDALYQALTE QRSFSSDAGL STSQSLTDYA SASIGWLEQY
RSDATSASET TAAALSRSDE AYSNETGVNL DEELTLLLDI EQSYKAATKI LNVIDEMFQS
LLDIAS