Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2941 |
Symbol | |
ID | 4077092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 3111581 |
End bp | 3112588 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 638008270 |
Product | flagellar hook-associated protein FlgL family protein |
Protein accession | YP_614935 |
Protein GI | 99082781 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.383256 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.308963 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATGTAC AATCTTTTGG AGACATGGCG CAATATCTCT TCCTGCGTCG CCGCTCGGTC GAGCTGAACT CGACCCTTGA TACGCTCACT CAGGAAATGA GCACTGGGAT CGCCTCAAAC CTGCCAGAGC GGCTAGGTGG AGACCTCGGT TTTGTGGTCG ATCTGGAGCG TTCCATCTCG AAAATGGACA GCTATAAGGT TGCCGCGCAA GAAGCCGGTC TTTTTGCCTC GACCGCGCAG AGCTATCTGG AGCGGATCAA CGAGAGCGCC CTGAAGCTCG GCTCCGACAT TCTGGCGCTT TCGAGCACAT CAAATGACAC GACATCTCAG GAGATGGCGG CGCAGTCTGA GAACTTCCTG ACCGAGACGA TCTCCAATTT GAACGGGAAG TTTTCGGGCC GAAGCCTCTT TGCGGGAACG GATACCAGCG TCACGCCGCT GGAGAGCCCA AACACGATGA TGACAGCGCT GGTGGCGGAA GTTGGTGCTC TGACTGATGT CGATGACATT TTTCAGGCTG TTGAGGATTG GTTCAACGAT CCAGCGGGCT TTGATGCAAT CATGTACAAT GGGTCCACCA GCAGCATGAG CGCGGTTCGT ATCGGCGCCA ATGAAGAGGT GAACTTTGCA TTGCGGGCAG ACGATGAGCG GCTGAAACAA GCCATGCAGA GCTTTGCGCT TGGGGCACTG GCGAGCGAGG ACTATCTATC CCACCTCTCC ACCGTCGAAG CGTCTGCTCT TGTGAAACGT GCCGGGACTG AGCTGATGAA TGCCCAGGCA GAGCTGACAG ATGTGCAGTC CGACCTCGGG TTTGTCGAGG CGCGGATCGA GGAAACGCAG GTTCGAAACA CCGCTGCGCT GACCACGATG GGGACGACCC TGAACGATCT CATCCTCGCG GACCCGGCTG AGACCGCAAG CCGTGTGGAA GAAGTCCAGT TCCAGCTCGA AGCGCTCTAT TCGATTACCG TGCGCTCCTC GCAACTCAAT CTGTTGAACT ACATGTAA
|
Protein sequence | MHVQSFGDMA QYLFLRRRSV ELNSTLDTLT QEMSTGIASN LPERLGGDLG FVVDLERSIS KMDSYKVAAQ EAGLFASTAQ SYLERINESA LKLGSDILAL SSTSNDTTSQ EMAAQSENFL TETISNLNGK FSGRSLFAGT DTSVTPLESP NTMMTALVAE VGALTDVDDI FQAVEDWFND PAGFDAIMYN GSTSSMSAVR IGANEEVNFA LRADDERLKQ AMQSFALGAL ASEDYLSHLS TVEASALVKR AGTELMNAQA ELTDVQSDLG FVEARIEETQ VRNTAALTTM GTTLNDLILA DPAETASRVE EVQFQLEALY SITVRSSQLN LLNYM
|
| |