Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2089 |
Symbol | |
ID | 5899544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2237252 |
End bp | 2238931 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641562578 |
Product | general secretory pathway protein E |
Protein accession | YP_001683715 |
Protein GI | 167646052 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | [TIGR02533] general secretory pathway protein E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.86871 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCAGTG TCGAGCGGCG ATCCTTTCAA AACTTTATTA TAAACAGCGA TCTGATTACC GCTGAGTCCT TGGTGCGGGC CAAGGCGGTT CAGTGCGAGA GCGGCGAGCG CATGGACGCG GTCCTGACGC GGCTGGGGTT GGTCACCGAG CGCGCCTTGG CCGACGCTTT GGCCAAGGCG ACGGGCCTAT CGCAAGTCGA TGGCGACGAT TTTCCGGCTT CGGCGGTCGG CGGGGAGCGC GTTTCGCCGC GCTTCCTGCG CGAGACCAAG GCCATTCCCC TCGCCCTCGA CGACGAGGCG CTGCGCGTGG CCCTGATCGA TCCGCTCGAC GACTACGTGA TCGCGGCGCT GAGCTACGCG TTCGAGCGGC CCGTGAAGCC GGCGGTCGCG CGGGCTGGCG ATCTGGACGC GGCGCTGGAC CGTCTCTACG GGCCGGCGAC CGACGCGGTC GCCGAGGCGG CCGACGAGGC CGACGAGGCC GACCTCGATC GGCTGAAGGA TCTGGCCAGC GACGCGCCGG TGGTGCGCGC GGTGAACGCC CTGATTTCGC GCGCCGCCGA ATTGCGTGCG TCCGACATCC ATGCCGAGCC CACCGAGGAC GGCCTGAAGG TACGCTTCCG GATCGATGGA GTGCTGGTCG ATCAGGAAAT CCTGCCTCAC CAGGTGAAGG CCGCCTTCGT CTCGCGCGTG AAGGTCCTGG CCAATCTGAA CATCGCCGAA CGCAGGCTGC CGCAGGACGG GCGGATGAGG CTGGCGGTGC GAGGGCAGGA GATTGACCTG CGGGTTGCCA CCGCGCCGAC CCTGCACGGC GAAAGCGTGG TGCTGCGGCT CCTCGACCGC TCGAACCTGT CGCTGGACTT CGACGCCCTG GGATTCGACG ATACGATCCT GCCGTCGTTC CAGGATGTCC TGGCGCGGCC CCACGGAATC GTTCTGGTCA CCGGTCCCAC CGGCAGCGGC AAGACGACCA CGCTTTACGC CGCGCTCGCG TCGCTGAACT CGCCCACGCG CAAGATCCTC ACCATCGAAG ACCCGATCGA ATACCGGCTG GCCGGCGTGA ACCAGACGCA GGTCAGCCCG CAGATCGGCC TGACCTTCGC CACCGCCCTG CGGTCCTTCC TGCGTCAGGA CCCCGACGTG ATGATGGTGG GCGAGATCCG GGACCTGGAG ACCGCGCAGG TCGCGGTCCA GTCGGCCCTG ACCGGCCACA CCATCCTGTC GACCCTGCAC ACCAACAGCG CGGCCGCCGC GGTGACCCGG CTGATCGACA TGGGCATGGA GCCCTTCCTG ATCAGCTCCA CCGTCAATGC CGTCCTGGCC CAGCGGCTGG TGCGCAGGTT GTGCCGCAGC TGTCGCACGT CGCACGTGGC CGATGCGCGA GAGCTGTCGG TGCTGGAGGC CCAGGCCGGC GAACGGGGCC GCGACCCGAT CCGTCTGTGG AGCGCGCCCG GCTGCGCCGA TTGCGGCCAT GCCGGCTTCA AGGGGCGGCT GGCCATCCTG GAACTGTTGC CGGTGGACGA CAGGATCGCG CGCCTAGTGC TGGCCCGCGC CGAGGCTCGC GAGATCGAGC GCGCCGCGGT CGCGGCCGGC ATGCGCACGA TGCTGCAGGA CGGCATGGCC AAGGCGATGG CGGGTCTGAC AACCATCGAC GAAGTCCTGC GCGTGACAAG GGAGGACTAA
|
Protein sequence | MVSVERRSFQ NFIINSDLIT AESLVRAKAV QCESGERMDA VLTRLGLVTE RALADALAKA TGLSQVDGDD FPASAVGGER VSPRFLRETK AIPLALDDEA LRVALIDPLD DYVIAALSYA FERPVKPAVA RAGDLDAALD RLYGPATDAV AEAADEADEA DLDRLKDLAS DAPVVRAVNA LISRAAELRA SDIHAEPTED GLKVRFRIDG VLVDQEILPH QVKAAFVSRV KVLANLNIAE RRLPQDGRMR LAVRGQEIDL RVATAPTLHG ESVVLRLLDR SNLSLDFDAL GFDDTILPSF QDVLARPHGI VLVTGPTGSG KTTTLYAALA SLNSPTRKIL TIEDPIEYRL AGVNQTQVSP QIGLTFATAL RSFLRQDPDV MMVGEIRDLE TAQVAVQSAL TGHTILSTLH TNSAAAAVTR LIDMGMEPFL ISSTVNAVLA QRLVRRLCRS CRTSHVADAR ELSVLEAQAG ERGRDPIRLW SAPGCADCGH AGFKGRLAIL ELLPVDDRIA RLVLARAEAR EIERAAVAAG MRTMLQDGMA KAMAGLTTID EVLRVTRED
|
| |