Gene Caul_3297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3297 
Symbol 
ID5900752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3569529 
End bp3570611 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content69% 
IMG OID641563803 
Producthypothetical protein 
Protein accessionYP_001684922 
Protein GI167647259 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.747205 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.456251 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG CGGTTCAGAC GGCGACAACC CCCCATGGCG ACCCACCCGC CGCGTCCACG 
GCGCGGGGGC GCGCCGATCG CAAGGAGCTG GCGTCCTTCG CGGTCGAGCG CACGCGCATG
CCGATGGTCA TCGCCGACGC GCGGCACGGG GACCATCCCA TCGTCCTGGC CAACCAGGCC
TTCCTGGATC TGACGGGCTA TGGCGCGGAG GAAGTGGTCG GACGCAATTG CCGGTTTCTC
CAAGGGGCGG GAACCTCGGA CGCGGCCATA GCCAAGATCC GCGCTGCGGT GGCGGCGGGC
CAGGAGTGCG ACGTCGAAAT CCTCAACTAT CGAAAGGACG GATCAGACTT CTGGAACCAG
CTGCATCTGA GCCCTGTCCA CGACGAGGCC GGTCAGCTTC TCTACATCTT CGCCTCCCAG
CGCGATGTCA GCGACTTCCG CAAGGTCCGG GACCTCGAGG CCGCAGAGCG CCGTCTGCTG
AGGGAGGTCG ACCATCGCGC GATGAACGCC TTGGCCATCG TCGAAGGCAT CGTCCGACTC
AGCTGCGCCG ACGACCCCTC ACAGTACGCC GCCGCCATCC AGCGCCGGGT GCAGGCCCTG
GCCAGCGCCC ATGCCCTGCT TGGCCGCCAG GCTTGGCGCG ATGTTCAGCT TGAGGAGTTG
CTGCGCACAC AGGTCGAGGG CTACGCGGGC AGGCGCATCG CGTTCGAGGG TCCGCCCATC
GAAATCGGCG CCGCCCTGGT TCAGCCCCTG GCGCTCGTCC TGCACGAGAT GGCGGCCAAT
GCGAGCCGTC ACGGGGCGCT GTCGGCGCCG GACGGCGAGA TCCGCCTGGG ATGGTCGCGA
GGTCCTGGCG AAGGCCTTGT TCTGACCTGG ACGGAGATCG GCGGCCCCCC GCCCGCCGCG
ATCCGCCCGC GCGGCTTTGG GGCGACGATG ATCTCGGCGA TCGTCGAACG GCAACTTGGG
GGCCAGGCGT TGCTGGCGTG GCGACCCGAA GGCCTGGCCG CGCGTTTTGT GTTGCCGCGC
CGTGATCGCA TTGAGAACTT CCGGCTGTCG GCGGCCACCG AGGACGCCGC GTCTCAAGCC
TGA
 
Protein sequence
MSDAVQTATT PHGDPPAAST ARGRADRKEL ASFAVERTRM PMVIADARHG DHPIVLANQA 
FLDLTGYGAE EVVGRNCRFL QGAGTSDAAI AKIRAAVAAG QECDVEILNY RKDGSDFWNQ
LHLSPVHDEA GQLLYIFASQ RDVSDFRKVR DLEAAERRLL REVDHRAMNA LAIVEGIVRL
SCADDPSQYA AAIQRRVQAL ASAHALLGRQ AWRDVQLEEL LRTQVEGYAG RRIAFEGPPI
EIGAALVQPL ALVLHEMAAN ASRHGALSAP DGEIRLGWSR GPGEGLVLTW TEIGGPPPAA
IRPRGFGATM ISAIVERQLG GQALLAWRPE GLAARFVLPR RDRIENFRLS AATEDAASQA