Gene Caul_1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1008 
Symbol 
ID5898463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1067119 
End bp1068795 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content76% 
IMG OID641561490 
Productflagellar hook-length control protein 
Protein accessionYP_001682636 
Protein GI167644973 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.345301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.688575 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGA TCGCCGCTCC CGCCTCCGTC CTTCCCGCAG CCCCGGCCCC GGCCGGCGGC 
TACGACAAGG ACGCGGCCGA AGGCTTCGGC GCGCTGCTGG CTCAGGCCGG AAGCGCCAAC
GCCTCCAACG ACAAGGTCTC CACACGGGAC GCCAAGGCGC AGGACGGCAA GGCGTCGAAC
GCCAAGGCCG CCGACGCCAA GGCGGCCGAC CGGGCCGACC GGGCTTCCGA CGCTAAGGCC
GACAAGGCTG ACAAGACGCA GAGCGCCGCC GTCGTCGCCG ACAAGCCGGC TGACGCCAAG
AGCGCCGACG CCGCCAGGGC CTCGGACAAG ACCGCCGACG CCAAGACCCA GGACGGCAAG
GACCAGGCCG CCGCCGAGAC GGGCGTCGCC AAGGACGCCC AGGCCGCCGA CACGGCCTCC
GGCCAGGCCG ACGCCGCTGC CGCCCAAGCC GCCGCTGCGG CCCTGATCGC CGCCATGCAG
GCCCCGGCCG CTCCGGCCGT CCAGCCCGTC GCCGCCGCCG AGACGGCGAC CGCCGCCCAG
GCCGCTCAGG CCGCCGCGAC CGACGCCGCC GCCGCCTTCG CCCCGCTTGT CGCTCGGAAC
GGCGCGCCCA AGCAGGACAC CGCCGTTCCC GCCGCCGCCC CGGCGACCGC CGCGGACACG
GCCGGTCTCG ACCTGCCCGA AACGCCGGCG CTCTCGGCCG CCGACTTGGC CGCCGCCGCA
GCAGCGGCTC AGAAGGCCGC CGCCCAGCCC GCTCAGGCCG CCAAGACCAA CGTCGCCGCC
GCGACCCCCG TCGCCGCCCC GCAAGCCGCG CCGCCCACCG CCGAGGCCGT CAACGCCGCC
CTGGCCGCCG CCGCGCCGAC CTCGGACGTC GCCGAGGCCC CCGTCGCCGC CGAACCCGCG
ACCGCCGCCC AGGTCATCGC GTCGCAAGCC GCCTCGGTGA CGGGCAAGAT CGTCAAGGCC
GCCGCGGCCG CCAACGTCGC CCCGGCCCAG CAGGCCGACG CCGAGCCCGT CCACGGCCAG
GCGGTGGTGT CGGCCCACGC GGCCACCACC GGCGAAGGCG ACGCCCAGCA CAATGACAAG
TCCGGCTCGC AGGGGGCCGC CCTGGCGGAC GCCGCGCCGA TCGTCGTCGC CGACGCCCCG
GCCGCCCAGA CCTTCGTCGC CCCGGGCACC ACGGCCGCCC CGACCCCGAT CGTCGCCGCG
CAACTGCCGC CCAAGGCCGG CCCCGAGACG GTCAGCCACC TGACGACCGA AATCGCCAGC
AAGGCTGTGG TCGGCAAGAC CAGCCGCTTC GACGTCGTCC TGCAGCCGGA AGGCCTGGGC
CGCGTCGACG TGCGCATCGA GATCGCCAAG GACGGCAAGC TGACCGCCGC CCTGAACTTC
GACAACCCGG CCGCCGCCGC CGACATGCGC GGCAAGTCGG GCGAGTTGCG CCAGGCCCTG
GCCCAGGCCG GCTTCAACGT CGCCGACAAC GCGCTCAGCT TCGACAGCCA GGGCCAGAAC
GGCGGCCAGA ACCAGAACGC CTTCTTCAAC TTCCAGGGCG GCGAGAACGG CCGTCAGGCC
TTCCAGGGCC GCGCCTTCCA GTCGGCGCTG GCCGAAGACC TCCCCACCCT GTCCCCGTCC
GCCCTGCTGC CCGGCCTGCG TGTCGCCGAG CGCAGCGGCG TCGACGTGAA AATCTAG
 
Protein sequence
MTAIAAPASV LPAAPAPAGG YDKDAAEGFG ALLAQAGSAN ASNDKVSTRD AKAQDGKASN 
AKAADAKAAD RADRASDAKA DKADKTQSAA VVADKPADAK SADAARASDK TADAKTQDGK
DQAAAETGVA KDAQAADTAS GQADAAAAQA AAAALIAAMQ APAAPAVQPV AAAETATAAQ
AAQAAATDAA AAFAPLVARN GAPKQDTAVP AAAPATAADT AGLDLPETPA LSAADLAAAA
AAAQKAAAQP AQAAKTNVAA ATPVAAPQAA PPTAEAVNAA LAAAAPTSDV AEAPVAAEPA
TAAQVIASQA ASVTGKIVKA AAAANVAPAQ QADAEPVHGQ AVVSAHAATT GEGDAQHNDK
SGSQGAALAD AAPIVVADAP AAQTFVAPGT TAAPTPIVAA QLPPKAGPET VSHLTTEIAS
KAVVGKTSRF DVVLQPEGLG RVDVRIEIAK DGKLTAALNF DNPAAAADMR GKSGELRQAL
AQAGFNVADN ALSFDSQGQN GGQNQNAFFN FQGGENGRQA FQGRAFQSAL AEDLPTLSPS
ALLPGLRVAE RSGVDVKI