Gene Caul_4193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4193 
Symbol 
ID5901655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4558016 
End bp4559653 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content69% 
IMG OID641564715 
Producttype II and III secretion system protein 
Protein accessionYP_001685815 
Protein GI167648152 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4964] Flp pilus assembly protein, secretin CpaC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCC GGCTCATCTC CGCCTCGCTC GCGGCCGTCC TGGTCCTGGC GCCGTCGACC 
GCCGTCCTCG CCGACGGACC GGTGGGCGGC ACCCATACCT ATCGGCCGCC GGCCCGTGTC
GCCCCGCGCA TCATAGACAC GCCGACCCTG GCGCCGCCCG CCGACCAGGT TCTGCGCATC
GACCTGACCG CCTCTGGCGC CCAGAGCCTG ACCCTGGCGC GCGGCAAGTC GGCGATCGTC
GAGCTGCCGG TCGACGTGCG CGACATGCTG GTGACCAATC CGGCCGTGGC CGACGCCGTG
CTGCGCGGCC CGCGCCGGAT CTACGTGCTG GGCATGGGCC AGGGCGTGAC CGACGCGGTG
TTCTTCGACG CCGCCGGCCG CAAGATTCTC AGCCTGGCCA TCCGCGTCGA TCAGGATTCC
TCCGCGCTGC AGGACACCCT GCGCCGCGTG CTGCCCGCCG CGAACATCGA GGTCCAGTCG
ATCCGCGACA GCGTCATCCT GACCGGCATG GTCGCCAATG TCGGCGAATC CACCATGGCC
TCGCAGATCG CCGCCCGCTT CGTCGACAAG CCCGATAACG TGCTGAACAT GCTGACCATC
GCCGGCAAGG ACCAGGTGAT GCTCAAGGTC CGCATTGTCG AGGTCCAGCG CAACATCATC
AAACAGCTGG GCGTCGACTC CAGCGCCGTG CTCGGCCAGC TGGGCGAGAC CCAGTACGCC
TTCGGCCTGT CGCCCAGCTA CGGCATCAAC GGCAAGCTGC TGGGCGGCCT GACCGGCGGA
TACAAGGCCG ACACCACCAA GCAACCTACG CTTCAGGTGC CCTGTAACGC CGCTCTGCCG
GACGGCGCCA AGTGCCTGCA GATCGTTCGC GACAACAGCG TCTACTCCAA TGGCGACACG
GCCACGATCA CCGACACCGC CGGCAGCGCC GGTCTGAACT CGGCCAAGGG CATGATCCAG
GCGTTCGAGC GCGTGGGCCT GGTGCGCACC CTGGCCGAAC CCAACCTGAC CGTGGTGTCC
GGCGAGGCCG GCAAGTTCCT GGTCGGCGGC GAGTTCCCGG TGCCGGTCGG CCAGGACGCC
ACCGGCAAGG TGACGATCGA GTTCAAGCCC TACGGCGTGG GGCTGGGCTA TACGCCCATC
GTCCTGTCAG GCGGACGCAT CTCGCTGAAG CTGTCGACCG AGGTCTCGGA GCTGAGCAGC
CTGGGCGCCT TCACCCTGAC CACCAGCACC GGCACCTCCA CCAGTTCGAA CCTGACGGTG
CCGGGCCTGA CCGTCCGGCG CGCCGAGACC ACGGTCGAGC TGCCGTCCGG TGGCTCGCTG
ATGATCGCCG GCCTGCTGCA GCAGCAGACC CGCCAGAACA TCGACGCCTT GCCGGGCATG
ACCAGCCTGC CGATCCTGGG CGCCCTGTTC CGCTCGCGGG ACTATCTGAA CGGCGAGACC
GAACTGGTGG TGATCATCAC CCCCTATATC GTCGACCCGA CCAAGCCCCA GAACCTGCAG
ACCCCGGCAG ACGGCCTGCA AACGGCCAGC GACATGAGCA CGGCCCTGCT GGGCCGGCTG
AACAAGGTGG TCAAGGCGCC GGTCGGCGCG AACAGCGGAC GCGCCTACCA GGGCCCCGTC
GGCTATGTGA TCGAGTGA
 
Protein sequence
MIRRLISASL AAVLVLAPST AVLADGPVGG THTYRPPARV APRIIDTPTL APPADQVLRI 
DLTASGAQSL TLARGKSAIV ELPVDVRDML VTNPAVADAV LRGPRRIYVL GMGQGVTDAV
FFDAAGRKIL SLAIRVDQDS SALQDTLRRV LPAANIEVQS IRDSVILTGM VANVGESTMA
SQIAARFVDK PDNVLNMLTI AGKDQVMLKV RIVEVQRNII KQLGVDSSAV LGQLGETQYA
FGLSPSYGIN GKLLGGLTGG YKADTTKQPT LQVPCNAALP DGAKCLQIVR DNSVYSNGDT
ATITDTAGSA GLNSAKGMIQ AFERVGLVRT LAEPNLTVVS GEAGKFLVGG EFPVPVGQDA
TGKVTIEFKP YGVGLGYTPI VLSGGRISLK LSTEVSELSS LGAFTLTTST GTSTSSNLTV
PGLTVRRAET TVELPSGGSL MIAGLLQQQT RQNIDALPGM TSLPILGALF RSRDYLNGET
ELVVIITPYI VDPTKPQNLQ TPADGLQTAS DMSTALLGRL NKVVKAPVGA NSGRAYQGPV
GYVIE