Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3922 |
Symbol | |
ID | 6982686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 4070735 |
End bp | 4071904 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643398645 |
Product | flagellin domain protein |
Protein accession | YP_002283410 |
Protein GI | 209551493 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.172476 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.265134 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAACG GCTATCATCA GGCTTATGCT GCCTGGAAAC GGGATCCCGA AGCCTTCTGG CGCGAAGCCG CCGCCGACAT CGACTGGTTT AAACCGCCGG CGCGGGTGTT TTCGCCCGAG GAAGGCGTCT ATGGCCGCTG GTTCTCAGGG GCTGAAACCA ATACCTGCCA CAATTGCCTC GACCGGCATG TGACTGCCGG GCGTGGTGGC GAGATGGCGG TTATTTTCGA CAGTGCGATG ACCGGCGAGA AGCGCCGTTT CACCTACGAC GAAGTCCTTG ACGAAGTGAA GGCCATCGCC GCGACGCTTG TTGATCTCGG GATCGGTCGG GGCGATCGCG TCATCCTCTA TATGCCGATG GTGCCGCAGG CGGTGTTTTC GATGCTCGCC TGCGCCCGCA TCGGTGCGGT TCACTCCGTC GTCTTCGGTG GTTTTGCCGC CAGCGAGCTT GCTGCCCGCA TCGATGATTG CGGTGCGAAG CTGGTGATCA CCGCGAGCTG CGGGCTCGAG CCCGGCCGCA TCGTTGCCTA TAAGCCCCTG GTCGACCAGG CGCTCACGTT GGCGCGTTCG AAGCCGGAGC GCTGTCTGGT GCTGCAGCGG CCGGAGCTTC GGGCGGATCT CGTCAGCGGC CGCGATCAGG ATTTCGAGGC GGCGGTGGCG CAGCATCGCG GCGCCGAGAT CGCCTGTGTT CCGGTCAAGG CGACCGACCC GCTCTATATC CTTTACACCT CGGGAACCAC CGGCCAGCCG AAGGGCGTCG TGCGCGACAC CGGCGCGATC GAGACGAGCA GCGGGATCAT CGGCACGGCC TTCAACGGAA CCTATGGCGG CACCTTCATC GTTATGGCGT CGATCTATGA TCTCGATATC ACCGGTTTCA CCCAGGGCCA GCTCGATTCA GCCCTGACCG GCGTCGAACT GGTTTTGGGT GCCATGACCG CCGCCGGCTC GGCTCTCGGC TCGATCTCGA CCCGTATCCA GCTGCAGGAA AATTTCGTCA GCGGTCTTCA CGATTCGATC GACTCCGGCG TCGGCCGCCT GGTCGATGCC GATATGGAAG AGGAATCGAG CAAGCTGTCG GCGCTGCAGA CGCAGCAGCA GCTCGCCGTC CAGTCGCTGT CGATCGCCAA CAGCTCGGCG CAGAACATCC TCACCCTGTT CCGCAGCTAA
|
Protein sequence | MQNGYHQAYA AWKRDPEAFW REAAADIDWF KPPARVFSPE EGVYGRWFSG AETNTCHNCL DRHVTAGRGG EMAVIFDSAM TGEKRRFTYD EVLDEVKAIA ATLVDLGIGR GDRVILYMPM VPQAVFSMLA CARIGAVHSV VFGGFAASEL AARIDDCGAK LVITASCGLE PGRIVAYKPL VDQALTLARS KPERCLVLQR PELRADLVSG RDQDFEAAVA QHRGAEIACV PVKATDPLYI LYTSGTTGQP KGVVRDTGAI ETSSGIIGTA FNGTYGGTFI VMASIYDLDI TGFTQGQLDS ALTGVELVLG AMTAAGSALG SISTRIQLQE NFVSGLHDSI DSGVGRLVDA DMEEESSKLS ALQTQQQLAV QSLSIANSSA QNILTLFRS
|
| |