Gene Caul_4190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4190 
Symbol 
ID5901652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4554196 
End bp4555710 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content68% 
IMG OID641564712 
Producttype II secretion system protein E 
Protein accessionYP_001685812 
Protein GI167648149 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGGCA AGCGCGACCA GGCCGCCGTT CGGACAGAGC AGGCGCCGCC GCCCGTCTCC 
AACGATGGGC CGGCCATCGC CACGCGCCCG CAGCGCGTCG ATCCCGCCGC CAACGAGCCG
GCCGCCACCG CCGCGCCCAG CCCCGCGCCC GGCAAGGCGG CGGGTCCCAG GGTCACCAAC
GGCCTGGAAC AACTGCGCGC CGCCCAGGGC GCGCCGCCGA CCACCAACGT GGTCCGCGAG
CAGAGCGACT ATTACCACGC CACCAAGACC ACGATCTTCA ACGCGCTGCT CAACACCATC
GACCTGAGCC AACTGGCCCA ACTGGACCTG AAGGCGGCGG CCGAAGAGAT CCGCGACATC
GTCGCCGAGC TGGTGGCGAT CAAGAACGTC TCGATGTCGG TCTCCGAGCA GGAGCACCTG
GTCCAGGACA TCATCAACGA CGTCCTCGGC TATGGCCCGC TCGAGCCCCT GCTGGCCCGC
GACGACATCG CCGACATCAT GGTCAACGGC GCACACCGGG TGTTCATCGA AGTCGGCGGC
AAGGTCCAGC TGACCAATGT CCGCTTCCGC GACAATCTGC AGCTGATGAA CATCTGCCAG
CGGATCGTCA GCCAGGTCGG CCGCCGGGTC GACGAAAGCA GCCCGATCTG CGACGCCCGC
CTGCCCGACG GCAGCCGCGT CAACGTCATC GCTCCGCCCT TGGCGCTGGA TGGTCCCACC
CTGACCATCC GCAAGTTCAA GAAGGACAAG CTGACGATGA AGAACCTGGT CGACTACGCG
TCGATCAGCC CGGAAGGGGC GCGGGTCCTG GGCGTGATCG GCGCCTGCCG CTGCAACATC
GTCATCTCGG GCGGCACCGG CTCGGGCAAG ACCACCCTAC TCAACACCAT GACCGCCTTC
ATCGACCCGA CCGAGCGGGT GGTGACCTGC GAGGACGCGG CCGAACTGCA GCTGCAGCAG
CCGCACGTGG TGCGCCTGGA AACCCGGCCG CCGAACCTGG AAGGCCAGGG CGCGGTGACC
ATGCGCGACC TGGTCAAGAA CTGCCTGCGG ATGCGTCCCG AACGGATCAT CGTCGGCGAA
GTCCGCGGCC CGGAGGCGTT CGACCTGTTG CAGGCCATGA ACACCGGCCA CGACGGATCG
ATGGGCACGC TGCACGCCAA CAGCCCGCGC GAGGCGATCA GCCGGATCGA GAGCATGATC
ACCATGGGCG GCTACGGCCT GCCCTCCAAG ACCATCAAGG AGATGATCGT CGGTTCGGTC
GACGTCATCA TCCAGGCCGC CCGCCTGCGC GACGGCACGC GCCGCATCAC CCACATCACC
GAGGTGGTGG GGCTGGAAGG CGACGTGATC GTCACCCAGG ATCTGTTCGT CTACGAGATC
AGCGGCGAGG ACGCCGCAGG TAACGTGGTC GGCAAGCACC GCTCGACAGG GATCGGCCGC
CCCCGGTTCT GGGACCGCGC CCGCTATTAC GGCCTGGAGC GCGAGCTGGC CGAAGCCCTC
GACGCGGCGG AGTAG
 
Protein sequence
MFGKRDQAAV RTEQAPPPVS NDGPAIATRP QRVDPAANEP AATAAPSPAP GKAAGPRVTN 
GLEQLRAAQG APPTTNVVRE QSDYYHATKT TIFNALLNTI DLSQLAQLDL KAAAEEIRDI
VAELVAIKNV SMSVSEQEHL VQDIINDVLG YGPLEPLLAR DDIADIMVNG AHRVFIEVGG
KVQLTNVRFR DNLQLMNICQ RIVSQVGRRV DESSPICDAR LPDGSRVNVI APPLALDGPT
LTIRKFKKDK LTMKNLVDYA SISPEGARVL GVIGACRCNI VISGGTGSGK TTLLNTMTAF
IDPTERVVTC EDAAELQLQQ PHVVRLETRP PNLEGQGAVT MRDLVKNCLR MRPERIIVGE
VRGPEAFDLL QAMNTGHDGS MGTLHANSPR EAISRIESMI TMGGYGLPSK TIKEMIVGSV
DVIIQAARLR DGTRRITHIT EVVGLEGDVI VTQDLFVYEI SGEDAAGNVV GKHRSTGIGR
PRFWDRARYY GLERELAEAL DAAE