Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1017 |
Symbol | flhA |
ID | 5898472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1077295 |
End bp | 1079406 |
Gene Length | 2112 bp |
Protein Length | 703 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641561499 |
Product | flagellar biosynthesis protein FlhA |
Protein accession | YP_001682645 |
Protein GI | 167644982 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1298] Flagellar biosynthesis pathway, component FlhA |
TIGRFAM ID | [TIGR01398] flagellar biosynthesis protein FlhA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.308757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGACG CCGCCGTCCC CAAGACCGCT TCCTCGATGC CCAGCGCCAG TTCGCTGTGG GCGGGGATCC TGCGCGGCGA AATGGGTCTG GCCCTGGGCG TCGTCGGCAT CATCGTCCTG CTGATCATCC CGGTTCCGGC GATGATGCTG GACCTGCTGC TGGCCATTTC GCTGACCGGC GCCGTGCTGA TCCTGATGAC GGCGGTGCTG ATCAAGAAGC CGCTGGAATT CACCTCGTTC CCGACCGTCC TGCTGGTCGC CACCCTCTAT CGCCTGGGGC TGAACATCGC CTCGACGCGC CTGATCCTCG GCCACGGCCA GGAAGGCGCG CACGGGGCCG GCGCGGTGAT CGCGGCGTTC GGCAACCTGA TGATGCAGGG CAATTTCGTC ATCGGGGTGA TCGTCTTCAT CATCCTGGTG GTGGTGAACT TCATGGTCGT CACCAAGGGT TCGGGCCGGA TCGCCGAAGT CGCCGCCCGC TTCACCCTGG ACGCCATGCC CGGCAAGCAG ATGGCCATCG ACGCCGACCT CTCTACCGGC CTGATCGACC AGGACACCGC CAAGCAGCGC CGCAAGGACC TGGAGCAGGA ATCGACCTTC TTCGGGGCTA TGGACGGCGC GTCCAAGTTC GTGAAGGGCG ACGCCGTCGC CGGCCTGATC ATCACCGCCA TCAATGTCAT CGGCGGCATC CTGATCGGCG TCGTCCAGCA CAAGATGCCG ATCGGCGAGG CCTCGGCCAG CTACACCCTG ATGACCATCG GCGACGGCTT GGTCAGCCAG ATCCCGGCCC TGATCATCTC GATCGCCGCC GGCCTGGTGG TGTCCAAGGC CGGGGTCGAG GGCAGCGCCA ACGCCGCCCT GACCACCCAG CTGGCCATGA ACCCGGTGGC CTTGGGCATG GTCTCGGCCT CGTCGGGCGT CATCGCCCTG ATCCCGGGCA TGCCGATCAT TCCCTTCGCG GCCCTGGCGG CGGGTTCGGG ATACCTGGCC TATCGCCGCG CCCTCAAGGC CAAGGAGCCC AAGCCGCTCG ACCCCGCCGC CCTGGCCGCC CTGGCCGAGG CCGCCGCCGA GCCCGAGGAG GAGCCGATCA GCGCCTCCCT GGCCATCGAC GACGTCAAGA TCGAGCTGGG CTACGGCTTG CTGACCCTGA TCAACGACCT GGACGGCCGC AAGCTGACCG ACCAGATCCG CGCCCTGCGC AAGACCCTGG CCACCGAGTT CGGCTTCGTC ATGCCGCCGG TGCGGATCCT CGACAACATG CGCCTGGCCA ACCAAGGCTA TGCGATCCGC ATCAAGGAGA TGGAAGCCGG GGCCGGCGAG GTCCGCCTGG GCTGCCTGAT GGCCATGGAC CCGCGCGGCG GCCAGGTGGA GCTGCCCGGC GAGCACGTGC GCGAACCCGC CTTCGGCCTG CCCGCCACCT GGGTCGAGGA AGCCATGCGC GAGGAAGCCA CCTTCCGCGG CTACACCATC GTCGATCCCG CCACCGTGCT GACCACGCAC CTGACCGAGA TCCTCAAGGA GAACATGGCC GACCTGCTCT CCTACGCCGA GGTGCAGAAA CTGCTGAAGG ACCTGCCCGA GGGCCAGAAG AAGCTGGTCG ACGACCTGAT CCCCTCGGTG GTCAGCGCCG CGACCATCCA GCGCGTCCTG CAGGCCCTGC TCAAGGAGCG CGTGTCGATC CGCGACCTGC CGCAGATCCT GGAAGGCGTC GGCGAGGCCG CGCCGCACAC CGCTTCGGTG GTGCAACTGA CCGAGCAGGT CCGCGCCCGC CTGGCCCGCC AGCTGTGCTG GGCCAATCGC GGCGAGGACG GCGCCCTGCC GATCATCACC CTGTCGGCCG AGTGGGAGCA GGCCTTCGCC GAGGCCCTGG TCGGCCCGGG CGAGGACAAG CAACTGGCCC TGGCGCCCTC GCGCCTCCAG GAGTTCATCC GCGGCGTGCG CGACGCCTTC GACCAGGCCG CCATGGCCGG CGACCAGGCC GTGCTGCTGA CCAGCCCCGG CGTGCGCCCC TATGTCCGCT CGATCATCGA GCGCTTCCGC GGCCAGACCG TGGTGATGAG CCAGAACGAG ATCCATCCGC GTGCGCGGCT GCGCACGGTG GGGATGGTCT AG
|
Protein sequence | MADAAVPKTA SSMPSASSLW AGILRGEMGL ALGVVGIIVL LIIPVPAMML DLLLAISLTG AVLILMTAVL IKKPLEFTSF PTVLLVATLY RLGLNIASTR LILGHGQEGA HGAGAVIAAF GNLMMQGNFV IGVIVFIILV VVNFMVVTKG SGRIAEVAAR FTLDAMPGKQ MAIDADLSTG LIDQDTAKQR RKDLEQESTF FGAMDGASKF VKGDAVAGLI ITAINVIGGI LIGVVQHKMP IGEASASYTL MTIGDGLVSQ IPALIISIAA GLVVSKAGVE GSANAALTTQ LAMNPVALGM VSASSGVIAL IPGMPIIPFA ALAAGSGYLA YRRALKAKEP KPLDPAALAA LAEAAAEPEE EPISASLAID DVKIELGYGL LTLINDLDGR KLTDQIRALR KTLATEFGFV MPPVRILDNM RLANQGYAIR IKEMEAGAGE VRLGCLMAMD PRGGQVELPG EHVREPAFGL PATWVEEAMR EEATFRGYTI VDPATVLTTH LTEILKENMA DLLSYAEVQK LLKDLPEGQK KLVDDLIPSV VSAATIQRVL QALLKERVSI RDLPQILEGV GEAAPHTASV VQLTEQVRAR LARQLCWANR GEDGALPIIT LSAEWEQAFA EALVGPGEDK QLALAPSRLQ EFIRGVRDAF DQAAMAGDQA VLLTSPGVRP YVRSIIERFR GQTVVMSQNE IHPRARLRTV GMV
|
| |