Gene Caul_4191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4191 
Symbol 
ID5901653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4555715 
End bp4557301 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content72% 
IMG OID641564713 
Productpilus assembly protein CpaE 
Protein accessionYP_001685813 
Protein GI167648150 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4963] Flp pilus assembly protein, ATPase CpaE 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGC GCACCGATCA CGACCCCTTC GACCTGGGCT TCGAGGCCGA CGACGAGTTC 
GCCGCGCCGG GTTCCGATCC GTGGCGCGCC GCGTCGTCGC CGCCGTCGCG CGCCGAGGAT
CCGTTCGCCG ACTTCCCGCC CGCCCGCCCG GGCGAGAGCG CGCCCTCGGC GTTCATGGAC
CTGCCGCCGT CCGCGCCGCC CTATGTTCCA AAGGCGCCGG CCGCCGCCGT CGCGTCGCAG
TCGCCGTCGC CGACGGTCGC CCCAGCCGCA CCCAAGGTCG CCCAGCCCGT GGCGGTCGAC
GCCGCCATCG CCGTCGCGCC CGCGGTCCAT CCGGTCGGCA GCACCCAGGC CATGGTCCAG
GAGGTGGTCG CCGCCGCCGA GGCCGACATG GGCGAGGCGG CCGTGCCGCG CATCACCATC
CACGCCTTCT GCGCCCGGCC CGAGACCGTC GCCCTGGTCG AGGCCGCCTC GGCCGACCGT
CGCATGGTTC GCGCCTCGAC CGTCGCCCGG CCCGGCGGCC TGGCCGCCGC CGTCGACTAC
TACCAGAACC AGTCCACCCC CTCGCTGGTG CTGGTCGAGA GCCTGGATTC CGCGCCGCTG
ATGCTGTCCC TGCTGGACGG CCTGGCCCAG GTCTGCGACC CGGGCACCAA GGTCGTGGTC
ATCGGCCAGA CCAACGACAT CGCCCTCTAT CGCGAACTGA TGCGCCGCGG CGTCAGCGAA
TACCTGACCC AGCCGTCCGG CCCGCTGCAG ATCATCCGCG CGGTGTCGAA CCTCTATGCC
GATCCGTCCG CGCCGTTCGT CGGCCGGCAG ATCGCCTTCG TCGGCGCCAA GGGCGGCGTC
GGCTCCTCGA CCCTGGCCCA CAACTTCGCC TGGTCGATGG CCGAGCGCAT CCAGGCCGCC
ACCGTGATGG TCGACCTGGA CCTGGCGTTC GGGACCGCCG GCCTCGACTT CAACCAGGAC
CCGCTGCAAG GCATCATCGA CGCCCTGGGC CAGCCCGAAC GGCTGGACGC GGTGCTGATG
GACCGGATGA TGGTCCGCTG CGGCGACCGC CTGTCGCTGT TCGCCGCGCC GGGCGCCCTG
GACCAGGACT ACGAGATCCC TGCCGACGCC TTCGAGGAAG TCACCCAGAA GATCCGCGGC
GCCGCGCCGT TCGTGGTGCT GGACCTGCCG CACAGCTGGT CGGCCTGGAC GCGCCGGGTG
CTGATCTCGA GCGACGACCT GGTGGTGGTG GCGACGCCCG ACCTGGCCTC CCTGCGCAAC
GCCAAGAACA TCGTCGACCT GGTCCGCCAG GCCCGACCCA ACGACGCGCC GCCCCGCCTG
GTGCTCAACC AGGTCGGCGT TCCGGGACGT CCCGAGATTC CGGTCAAGGA CTTCGGCGAG
GCCCTGGGCC TGACGCCCTC CCTGGTGCTG CCCTTCGATC CCAAGCCCTT CGGCATGGCC
GCCAACAACG GCCAGATGGT CGCCGAGGTG GCCCCCAAGT CGAAGGCCGC CGAGGGCATC
GACCACCTGG CCCGGCTGAT CAGCCGTCGC GAGCCGCCGC CGGCCCAGAA GGCCTCGGTG
CTCTCCGGCC TGTTCAAGAA GAAGTAG
 
Protein sequence
MTKRTDHDPF DLGFEADDEF AAPGSDPWRA ASSPPSRAED PFADFPPARP GESAPSAFMD 
LPPSAPPYVP KAPAAAVASQ SPSPTVAPAA PKVAQPVAVD AAIAVAPAVH PVGSTQAMVQ
EVVAAAEADM GEAAVPRITI HAFCARPETV ALVEAASADR RMVRASTVAR PGGLAAAVDY
YQNQSTPSLV LVESLDSAPL MLSLLDGLAQ VCDPGTKVVV IGQTNDIALY RELMRRGVSE
YLTQPSGPLQ IIRAVSNLYA DPSAPFVGRQ IAFVGAKGGV GSSTLAHNFA WSMAERIQAA
TVMVDLDLAF GTAGLDFNQD PLQGIIDALG QPERLDAVLM DRMMVRCGDR LSLFAAPGAL
DQDYEIPADA FEEVTQKIRG AAPFVVLDLP HSWSAWTRRV LISSDDLVVV ATPDLASLRN
AKNIVDLVRQ ARPNDAPPRL VLNQVGVPGR PEIPVKDFGE ALGLTPSLVL PFDPKPFGMA
ANNGQMVAEV APKSKAAEGI DHLARLISRR EPPPAQKASV LSGLFKKK