Gene Caul_0211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0211 
Symbol 
ID5897485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp223940 
End bp226045 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content66% 
IMG OID641560695 
Productcatalase 
Protein accessionYP_001681846 
Protein GI167644183 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0753] Catalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.247073 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAATC AGGACGACTT CACGACACAA GCCGGCCACG GCGGCGAAAC GCACCAGCGT 
GTGCCGGAAA AAGGTCCGCA TACGGCGGAA GGTCACCTGA CCACCAACCA GGGCATCCGA
GTTTCGGACA ACCAGAACCA GCTGAAGAGC GGCGAGCGCG GCCCGGTGCT GCTGGAGGAT
TTCGTCCTCC GCGAAAAGAT CTTCCATTTC GACCACGAGC GGATCCCCGA GCGGATCGTC
CATGCCCGCG GTTCGGCCGC CCACGGTTTC TTCGAGCTCT ACGAGTCGCT GTCGGACCTG
ACCAAGGCTG ACATCTTCCA GCGGGCCGGC GAAAAGACGC CGTTGTTCAC GCGGTTCTCG
ACCGTGGCCG GCGGGGCGGG CAGCGTCGAT ACGCCTCGCG ACGTGCGCGG TTTCGCGGTG
AAGTTCTACA CCCAGGAGGG CAACTGGGAC CTCGTCGGCA ACAATATCCC GGTGTTCTTC
ATCCAGGACG CCATGAAGTT CCCCGATCTG GTCCACTCGG TGAAGATGGA GGCCGATCGC
GGCTATCCGC AGGCCGCCAG CGCCCATGAC ACCTTCTGGG ACTTCATCAG CCTGATGCCC
GAGAGCACCC ACATGATCAT GTGGGCGATG AGCGACCGCA CCATTCCCCG AAGCCTGCGC
ACCATGGAAG GCTTCGGCGT CCACACATTC CGGCTGGTCA ACGCGGCCGG CAAGTCGACC
TTCGTCAAGT TCCACTGGAA GCCCAAGCAG GGCGTGGCCT CGACGATCTG GGACGAGGCG
GTGAAGATCG CGGGCGCCGA TCCGGACTTC CAGCGCCGCG ACCTGTTCGA GGCCATCGCG
CGCGGCGACT TCCCCGAATG GGAACTGGGC ATCCAGGCCT TCGACCAGGC CTTCGCCGAC
AGCCTGCCCT TCGACGTGCT GGATCCCACC AAGATCATTC CGGAAGAGGT GCTGCCGGTC
CGCATCGTCG GCCGGATGGT GCTGGACCGT TATCCGGACA ACTTCTTCGC CGAGACCGAA
CAGGTCGCCT TCTGCCCGGC CAATGTGCCG CCGGGCATCG ACTTCACCAA TGACCCGCTG
CTGCAGGGTC GGCTGTTTTC CTATCTCGAC ACCCAGAAGA GCCGCCTGGG CACGGCCAAT
TTCCACCAGC TGCCGATCAA TGCGCCCAAA TGTCCGGTGA TGAACTTCCA GCGCGACGGC
CAGATGCAGA TGGCCATCCC CAAGGGCCGG GCCAATTACG AGCCCAACAG CCTGGCCGAG
ACGGCGGGCG AGATTGGCGG GCCGCGCGAA TGCCCAATGA CCGGCTTCAC GACCTTCCCG
TCAGCCGAAG TGGCCAATGA GCAGGGCGAC AAGCTGCGGA TCCGTCCAGA GAGCTTCGCC
GACCATTACA GCCAGGCGCG CCTGTTCTTC CGCTCGCTGG ACCCCCACGA GCAGGCCCAC
CTGGCTTCGG CCATCGTCTT CGAACTGTCG AAGGTGGGGA TTGAGGCGGT CCGGACACGG
ATGATGGGGA ACCTTGTCAA TGTCGATCCG GACCTGGCCA AGCGGGTCGG GGCGGGCCTG
AACATGCCCG TGCCCAAGGC CTCGAAATCG GCCGTCCCGG TGCAGGATCT GGAGCCGTCG
CCGGCCCTGC GGATCGTCAA CGGTCCTCGC GCGCCCAAGG ACATCAAGGG CCACGTCATC
GGCATACTGG TGGCCGACGG CTCCGACGCG GCGGCGGTCG ACACGCTGAA GGCGGCGATC
GGCAAGGCCG GCGCTGTCGC CAAGGTCATC GCGCCGAAGA TCGGCGGGGC CAAGGGCGCG
GACGGGACGC TGATCCCGGC CGACGGCCAG TTGGCCGGCA CGCCGTCAGT GACGGTCGAC
GCCATCGCGC TCGTCTTGTC CGACACCGGC TGCGCGGCGT TGCTGAAGGA GGCCGCCGCC
GTCCAGTTCG TCATGGACGC CTTCGGCCAC CTGAAGGCCA TCGGGACCTC GGACGCCGCC
AAGCCGCTGC TCGACAAGGC CGGCGTCGAG CCGGACGAGG GCGTGGTGGG CCTGGGCGCC
GACTTCATCG CGGCGGCGGC CAAGCGGTTC TGGGATCGCG AGCCGAAGGT TCGGATGCTG
GCTTAA
 
Protein sequence
MANQDDFTTQ AGHGGETHQR VPEKGPHTAE GHLTTNQGIR VSDNQNQLKS GERGPVLLED 
FVLREKIFHF DHERIPERIV HARGSAAHGF FELYESLSDL TKADIFQRAG EKTPLFTRFS
TVAGGAGSVD TPRDVRGFAV KFYTQEGNWD LVGNNIPVFF IQDAMKFPDL VHSVKMEADR
GYPQAASAHD TFWDFISLMP ESTHMIMWAM SDRTIPRSLR TMEGFGVHTF RLVNAAGKST
FVKFHWKPKQ GVASTIWDEA VKIAGADPDF QRRDLFEAIA RGDFPEWELG IQAFDQAFAD
SLPFDVLDPT KIIPEEVLPV RIVGRMVLDR YPDNFFAETE QVAFCPANVP PGIDFTNDPL
LQGRLFSYLD TQKSRLGTAN FHQLPINAPK CPVMNFQRDG QMQMAIPKGR ANYEPNSLAE
TAGEIGGPRE CPMTGFTTFP SAEVANEQGD KLRIRPESFA DHYSQARLFF RSLDPHEQAH
LASAIVFELS KVGIEAVRTR MMGNLVNVDP DLAKRVGAGL NMPVPKASKS AVPVQDLEPS
PALRIVNGPR APKDIKGHVI GILVADGSDA AAVDTLKAAI GKAGAVAKVI APKIGGAKGA
DGTLIPADGQ LAGTPSVTVD AIALVLSDTG CAALLKEAAA VQFVMDAFGH LKAIGTSDAA
KPLLDKAGVE PDEGVVGLGA DFIAAAAKRF WDREPKVRML A