Gene Caul_2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2089 
Symbol 
ID5899544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2237252 
End bp2238931 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content68% 
IMG OID641562578 
Productgeneral secretory pathway protein E 
Protein accessionYP_001683715 
Protein GI167646052 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.86871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAGTG TCGAGCGGCG ATCCTTTCAA AACTTTATTA TAAACAGCGA TCTGATTACC 
GCTGAGTCCT TGGTGCGGGC CAAGGCGGTT CAGTGCGAGA GCGGCGAGCG CATGGACGCG
GTCCTGACGC GGCTGGGGTT GGTCACCGAG CGCGCCTTGG CCGACGCTTT GGCCAAGGCG
ACGGGCCTAT CGCAAGTCGA TGGCGACGAT TTTCCGGCTT CGGCGGTCGG CGGGGAGCGC
GTTTCGCCGC GCTTCCTGCG CGAGACCAAG GCCATTCCCC TCGCCCTCGA CGACGAGGCG
CTGCGCGTGG CCCTGATCGA TCCGCTCGAC GACTACGTGA TCGCGGCGCT GAGCTACGCG
TTCGAGCGGC CCGTGAAGCC GGCGGTCGCG CGGGCTGGCG ATCTGGACGC GGCGCTGGAC
CGTCTCTACG GGCCGGCGAC CGACGCGGTC GCCGAGGCGG CCGACGAGGC CGACGAGGCC
GACCTCGATC GGCTGAAGGA TCTGGCCAGC GACGCGCCGG TGGTGCGCGC GGTGAACGCC
CTGATTTCGC GCGCCGCCGA ATTGCGTGCG TCCGACATCC ATGCCGAGCC CACCGAGGAC
GGCCTGAAGG TACGCTTCCG GATCGATGGA GTGCTGGTCG ATCAGGAAAT CCTGCCTCAC
CAGGTGAAGG CCGCCTTCGT CTCGCGCGTG AAGGTCCTGG CCAATCTGAA CATCGCCGAA
CGCAGGCTGC CGCAGGACGG GCGGATGAGG CTGGCGGTGC GAGGGCAGGA GATTGACCTG
CGGGTTGCCA CCGCGCCGAC CCTGCACGGC GAAAGCGTGG TGCTGCGGCT CCTCGACCGC
TCGAACCTGT CGCTGGACTT CGACGCCCTG GGATTCGACG ATACGATCCT GCCGTCGTTC
CAGGATGTCC TGGCGCGGCC CCACGGAATC GTTCTGGTCA CCGGTCCCAC CGGCAGCGGC
AAGACGACCA CGCTTTACGC CGCGCTCGCG TCGCTGAACT CGCCCACGCG CAAGATCCTC
ACCATCGAAG ACCCGATCGA ATACCGGCTG GCCGGCGTGA ACCAGACGCA GGTCAGCCCG
CAGATCGGCC TGACCTTCGC CACCGCCCTG CGGTCCTTCC TGCGTCAGGA CCCCGACGTG
ATGATGGTGG GCGAGATCCG GGACCTGGAG ACCGCGCAGG TCGCGGTCCA GTCGGCCCTG
ACCGGCCACA CCATCCTGTC GACCCTGCAC ACCAACAGCG CGGCCGCCGC GGTGACCCGG
CTGATCGACA TGGGCATGGA GCCCTTCCTG ATCAGCTCCA CCGTCAATGC CGTCCTGGCC
CAGCGGCTGG TGCGCAGGTT GTGCCGCAGC TGTCGCACGT CGCACGTGGC CGATGCGCGA
GAGCTGTCGG TGCTGGAGGC CCAGGCCGGC GAACGGGGCC GCGACCCGAT CCGTCTGTGG
AGCGCGCCCG GCTGCGCCGA TTGCGGCCAT GCCGGCTTCA AGGGGCGGCT GGCCATCCTG
GAACTGTTGC CGGTGGACGA CAGGATCGCG CGCCTAGTGC TGGCCCGCGC CGAGGCTCGC
GAGATCGAGC GCGCCGCGGT CGCGGCCGGC ATGCGCACGA TGCTGCAGGA CGGCATGGCC
AAGGCGATGG CGGGTCTGAC AACCATCGAC GAAGTCCTGC GCGTGACAAG GGAGGACTAA
 
Protein sequence
MVSVERRSFQ NFIINSDLIT AESLVRAKAV QCESGERMDA VLTRLGLVTE RALADALAKA 
TGLSQVDGDD FPASAVGGER VSPRFLRETK AIPLALDDEA LRVALIDPLD DYVIAALSYA
FERPVKPAVA RAGDLDAALD RLYGPATDAV AEAADEADEA DLDRLKDLAS DAPVVRAVNA
LISRAAELRA SDIHAEPTED GLKVRFRIDG VLVDQEILPH QVKAAFVSRV KVLANLNIAE
RRLPQDGRMR LAVRGQEIDL RVATAPTLHG ESVVLRLLDR SNLSLDFDAL GFDDTILPSF
QDVLARPHGI VLVTGPTGSG KTTTLYAALA SLNSPTRKIL TIEDPIEYRL AGVNQTQVSP
QIGLTFATAL RSFLRQDPDV MMVGEIRDLE TAQVAVQSAL TGHTILSTLH TNSAAAAVTR
LIDMGMEPFL ISSTVNAVLA QRLVRRLCRS CRTSHVADAR ELSVLEAQAG ERGRDPIRLW
SAPGCADCGH AGFKGRLAIL ELLPVDDRIA RLVLARAEAR EIERAAVAAG MRTMLQDGMA
KAMAGLTTID EVLRVTRED