Gene Caul_2890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2890 
Symbol 
ID5900345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3138082 
End bp3139881 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content68% 
IMG OID641563387 
ProductTPR repeat-containing protein 
Protein accessionYP_001684515 
Protein GI167646852 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0313082 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGCA GTCGCGACAT CCAGATCTCG GCCAGCGCGC TGGCCATGGC CCAGGAGGCC 
GTGGCCCAGG CGGTTAATCT GGGCTTCAAG GCCACGGCCG TCGGCGACGC CGCCTCGGCC
GACGCCTTGG CGCGGCTGGA TCGCCAGACC CACGAGGTCC ACAACCACCA GACCGCCAAG
CTGCTGGCCC AGTCAATCCA CGCCATGCAG AACCGCGACT TCGCCAAGGG CGAGAAGCTG
GCGCTCAAGG CGCTGGAGCG CGACGACAAG CTGGGCGTGG CCTGGCACGT GCTGGGGATC
GCCCGCGAGA AGATCGGCGA CTTCGCTGGC TCGATGCGCT GCTACGAGGC GGCCCTGAAG
CTGTTGCCCG ACCACGGCCC GGTGGCTGGC GACCTGGGGC GTCTGGCCTT CCGCATGGAC
ATGCCGGAGA TCGCCGCCAA GTTCTTCATG CACTATCTGA ACGCCCGGCC CGGCGACCTG
GAGGGCGTCA ACAACCTGGC CTGCGCCCTG CGCGACCTCA ATCGCTGCGA CGACGCCATC
GAGGTCCTGC GTCCGGCCAT CAACGAGCAC CCCGAGCAGC CTCTGCTGTG GAACACGCTG
GGCACCGTGA TGTGCAGCCT GGGCGACGGC AAGACCGCCG TGACCTTCTT CGACGAGACG
CTGCGACTGG CCCCGGAGTT CGGCAAGGCC TATCACAACC GCGCCTACGC CAAGCTCGAC
CTCGGCGATG TCGAGGGCGC CCTGGCCGAC TGTGATCTGG CCATCGCCGT GGCCGAGTCG
GCCGAGGACC TGGCGACCAT GCAGTTTGGC CGCGCCACGA TCCTGCTGGC CCTGGGCCGC
GTCGAGGAGG GTTGGAAGAC CTACGAGGCC CGGTTCTCCA AGGATCTGCT CGAGGCCCCG
CGCTTCACCA TCGACGAACC GCGCTGGCGC CCCGGCATGG ACCTGGCCGG CAAATCGCTG
ATGATCTGCG CCGAGCAGGG CCTGGGCGAC GAGGTGATGT TCGCCAACAT GCTGCCCGAC
GTGATCGAGG CGCTGGGTCC GGACGGCAAG CTGACCCTCG CGGTCGAGCG CCGGCTGATC
CCATTGTTCA AGCGCTCGTT CCCCGGCGTC ACCGTCGTTC CGCACCGCAC CGTCTCCTAC
GAGGGCCGGG TTTATCGCGG CGCGCCAGAG ATCGAGGACT GGAGCGGTTT CGACCTGTGG
ACCCCGATGG GCTCGCTGCT GGAAGTCTTC CGCCCCAGCG TCTCCGCGTT CCCCAACCGC
CCCAACTTCC TGAACGCCGA TCCGGCGCGG GTCGCGCACT GGCGGGGCGA ACTGGAGAAG
GCGCCCAAGG GTCCCAAGGT CGGCCTGCTG TGGAAGAGCC TGAAGCTGAA CGGCGAGCGC
GCCCGCCAGT TCTCGCCCTT CCTGCTGTGG CGGCCGGTGT TCGAAACCCC GGGCGTCACC
TTCGTCAACC TGCAGTACGG CGACTGCGGC GAGGAGATCG CCCTGGCCAA GTCGGAGTTC
GGCGTCGACA TCTGGCAGCC GCCGGGCATC GACCTGAAGC AGGACCTCGA CGACGTCGCC
GCCCTGTGCT GCGCCATGGA TCTGATCGTC GGCTTCTCCA ACGCGACGAC CAATCTGGGC
GGGGCCTGCG GGGCGCCGAT CTGGATGTTG ACGGGCGCCT CGTCCTGGAC CCGACTGGGC
GCCCAGTCAT GGCCCTGGTA CCCGCAAACC CGTTGCTTCA TCACGCCGGA CTACAACGAC
TGGAATCCGA CGATGCAGGA CGTTGGAACG GCGTTGCGCG AACGCTTCCC GGCGGGATAG
 
Protein sequence
MTRSRDIQIS ASALAMAQEA VAQAVNLGFK ATAVGDAASA DALARLDRQT HEVHNHQTAK 
LLAQSIHAMQ NRDFAKGEKL ALKALERDDK LGVAWHVLGI AREKIGDFAG SMRCYEAALK
LLPDHGPVAG DLGRLAFRMD MPEIAAKFFM HYLNARPGDL EGVNNLACAL RDLNRCDDAI
EVLRPAINEH PEQPLLWNTL GTVMCSLGDG KTAVTFFDET LRLAPEFGKA YHNRAYAKLD
LGDVEGALAD CDLAIAVAES AEDLATMQFG RATILLALGR VEEGWKTYEA RFSKDLLEAP
RFTIDEPRWR PGMDLAGKSL MICAEQGLGD EVMFANMLPD VIEALGPDGK LTLAVERRLI
PLFKRSFPGV TVVPHRTVSY EGRVYRGAPE IEDWSGFDLW TPMGSLLEVF RPSVSAFPNR
PNFLNADPAR VAHWRGELEK APKGPKVGLL WKSLKLNGER ARQFSPFLLW RPVFETPGVT
FVNLQYGDCG EEIALAKSEF GVDIWQPPGI DLKQDLDDVA ALCCAMDLIV GFSNATTNLG
GACGAPIWML TGASSWTRLG AQSWPWYPQT RCFITPDYND WNPTMQDVGT ALRERFPAG