Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0331 |
Symbol | |
ID | 6979045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 334744 |
End bp | 335706 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643395043 |
Product | flagellin domain protein |
Protein accession | YP_002279856 |
Protein GI | 209547939 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.798731 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTTA AGATTACCAG CGCGGCCGCG GTGAATGCGC TTGCGGTGCT ACGCAGCATC AACAAAGAAG CCAGCCAGAC CCAGCAGCAA GTGTCCTCGG GATACCGTAT CGAGACGGCC GCCGACGATG CGTCCTACTG GTCGGTCGCG ACCGTCATGC GCTCGGACAG TACCAATCTT GGGACGATCG GGGATGCCCT CGGTCTCGGG GCTGCCAAGG TGGACGCGAC CTACACGGCG ATGAATTCGG CGATCGATCT TATGGGCCAG ATTCGCGCCA AGCTGGTTGC GGCAAGAGAG CCGGGCACCG ACAAGGATAA AATCAATGCC GAGATCGACG AATACAAGCA GCAGTTGCAG ACGATCGTCG AGTCGACCTC GTTTGCGGGC GAGAACTGGC TCCTGAACGG AAATACCACC GCGCCCCCGA CATGGTCGGT CATCTCCAGC TTCGTGCGCG CTCCCACCGG CGAATATCAG GCACGGACCA TCGATTTCCC GTCGTCGCAG ACCATCCTCG TCGACAAGAA CAATGCCAGC GGCGGGCTGT TTACCAAGGC GGTCGATGCC AATGCGATCA ACAATAGCGG CGCGACGGCG CGCAACTACT ACCTGCTGAA CGCCAACTCG ACGACGCCGG CGACAGGCAC GGAAATTGCC ATCGACAAGA ATACGACCGA CGCGCAACTG ACCGATATGC TGGATGTGAC CGACTCGCTG CTTTCCTCGC TGACGACGAC GGCCGCTTCC ATCGGCGTGA TGAAGACGCG CATCGACGAT CAGATAGACT ATACGGCCGA TCTCTCCGAT TCGATCGACA AGAGCGTCGG TGCGCTCGTC GATACCGACA TGGACGAGGC TTCGATCCGG CAGAAGGCGA TCGAAACCCA GAAGCAGATG GCCGTCGAAG CGATCTCGAT CCTCAACACG GCTTCGAGCA AGATTCTGAT CCTGCTGGAA TAA
|
Protein sequence | MTVKITSAAA VNALAVLRSI NKEASQTQQQ VSSGYRIETA ADDASYWSVA TVMRSDSTNL GTIGDALGLG AAKVDATYTA MNSAIDLMGQ IRAKLVAARE PGTDKDKINA EIDEYKQQLQ TIVESTSFAG ENWLLNGNTT APPTWSVISS FVRAPTGEYQ ARTIDFPSSQ TILVDKNNAS GGLFTKAVDA NAINNSGATA RNYYLLNANS TTPATGTEIA IDKNTTDAQL TDMLDVTDSL LSSLTTTAAS IGVMKTRIDD QIDYTADLSD SIDKSVGALV DTDMDEASIR QKAIETQKQM AVEAISILNT ASSKILILLE
|
| |