Gene Rleg_0371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0371 
SymbolflgK 
ID8011577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp380230 
End bp381717 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content60% 
IMG OID644822966 
Productflagellar hook-associated protein FlgK 
Protein accessionYP_002974221 
Protein GI241203125 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.294715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTCA CATCCGCACT TAATACCGCG CAGAACATAT TCAACAATAC GGGTACTCAG 
AGCAGTGTCG TATCGAACAA TATCTCGAAT GCGGGCAACA AAGATTACGT GCGCCGGCAG
GCGATGCTCA CCACGTCCTT GAATGGCGCA CAGGTCGTCA AGATCGATCG GGCGCAGGAA
GAGGCGCTGC TGCGCCAATA CCTGAAGACA TCCTCTCAGG ACAGTGCCCA GCAGGCATTG
CTCGGCGGTC TCGAGGACCT CAAGTCGATC GTGGGTGGCA ACGACTACGA AACGTCGCCA
TCCACCTATC TCGGTGTTTT CCAGCAGAAG CTTCAGGCCT TCCGCACGAC GCCGGGCAGC
ACCGTCGCTG CTCAGGGCGC CATCACCGCC GCGCAGGACG TCGCCAACTC GCTGAACAAT
GCCTCGCAAT CCGTTCAGAA CGTCCGCGCC ACCGCCGACA AGCAGATCGC CACCGACGTC
GATAAACTGA ATACGCTTCT CAACGACTTC GAGAAGGCCA ACAACGCGGT GAAGACCGCC
ACGGCCTCGG GTGCGGATGC ATCCGGCGCG CTCGACGAGC GTGAGAAGGC TCTCAAGCAG
ATTTCGCAGA TCGTCGGCGT CAACACGACG ACGCGCGACA ATAACGACAT GGTGCTGAGC
ACCTCTGACG GGACGATCCT CTTCGAGACG ATCCCGCGCA AGGTGACGTT CAAGTCTCAG
GATGTCTATA CCGCGACCAT CACCGGCAAT TCGGTCTATG TTGACGGTGT GGCACTTCCA
CGCGGCAGCG GATCGACGAC GACGGGGCAG GGAAGCCTTC AGTCTCTCCT CCAGGTCCGC
GACGAGATCG CCCCGAATTT CCAGAAGCAG CTCGACGAAG TCGCCCGCGG CCTGGTCTCG
CTCTTCAAGG AGCAGAACAC GGCAGCCGGC CCGGCCTATG TGCCCGGCCT CTTCACCTGG
AGCGGCGGCA CGGTCGATAC CGGCGCCACC GCGGTTGCCG GCATGGCGGC GACCATCACA
GTCAGCAGCC GCGTCATCAC CTCGCAAGGC GGCGATCCGA TGCGCTTGCG CGATGGCGGC
GTCAACGCCA CCGGCCTCGT CCTGAACACG TCAGGCGCCA GCGGTTATAC GACGGAACTC
GATCGTCTCT ATACCGCATT GGGCTCCGAC ATCGATTTCG ATCCAGCGGC GGGGACGCCT
GTCGGGTTCG ACGCCACGAC AGGCATCGAT TCAAACGTCA GCATCATGGA ATTCGCCACA
AATTCCCTCG GCTGGCTCGA ACAGTACCGT AGCAATGCCA CGACGGCAGC GGAAAACACC
TCGGCGGCGC TGTCACGCTC TGACGAAGCC TATTCCAACG AGACGGGCGT CAACCTCGAC
GAGGAGCTGA CGCTGCTCCT CGACATAGAG CAGTCCTACA AGGCGGCGAC CAAGATCTTG
AACGCCGTTG ACGAAATGTT GAAGTCATTG CTGGATATTG CGAGTTAA
 
Protein sequence
MSLTSALNTA QNIFNNTGTQ SSVVSNNISN AGNKDYVRRQ AMLTTSLNGA QVVKIDRAQE 
EALLRQYLKT SSQDSAQQAL LGGLEDLKSI VGGNDYETSP STYLGVFQQK LQAFRTTPGS
TVAAQGAITA AQDVANSLNN ASQSVQNVRA TADKQIATDV DKLNTLLNDF EKANNAVKTA
TASGADASGA LDEREKALKQ ISQIVGVNTT TRDNNDMVLS TSDGTILFET IPRKVTFKSQ
DVYTATITGN SVYVDGVALP RGSGSTTTGQ GSLQSLLQVR DEIAPNFQKQ LDEVARGLVS
LFKEQNTAAG PAYVPGLFTW SGGTVDTGAT AVAGMAATIT VSSRVITSQG GDPMRLRDGG
VNATGLVLNT SGASGYTTEL DRLYTALGSD IDFDPAAGTP VGFDATTGID SNVSIMEFAT
NSLGWLEQYR SNATTAAENT SAALSRSDEA YSNETGVNLD EELTLLLDIE QSYKAATKIL
NAVDEMLKSL LDIAS