Gene Caul_2642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2642 
Symbol 
ID5900097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2867577 
End bp2868806 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content67% 
IMG OID641563133 
Productdi-haem cytochrome c peroxidase 
Protein accessionYP_001684267 
Protein GI167646604 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCGGT CCTGGCCCCT CATCGGCGCG ACCCTGGTGA CGGCGTTGGC GCTTGTCGCG 
GCGGCGGAGC CGACTTGGTC CGCCGCGGAT CTGGCGGTGC TGCGCGGCCT CTGGATCGGG
GCGTTGGGGC CGCCTCCCCT CGATCCTTCG AACAAGGTCG CCGACAATGC CGCCGCGGCG
GACCTTGGTC GGGCGCTCTT TTCCGACTCG CGCCTGAGCG CCAACGGCCA GGTCTCGTGC
GCCAGTTGCC ATCAGCCCGA TCACGCCTTC ACCGACGCCC TGCCTACCGG ACATGGCGTC
GGAACGGGAA ACCGCAGGAC CATGCCGATC GCCCCGGCCG TCTATTCGCC ATGGCAATTC
TGGGATGGCC GCGCCGACAG CCTGTGGTCC CAGGCCCTGG GGCCAATGGA GAATCCGGTC
GAGCATGGCT TTACCCGTAC GGAGGTCGCG CGGGTCCTCG CCGCGCACTA TCGCGACCCC
TACGAACAAC TCTTCGGGTC GATGCCCGAC ATGGCGGACC ACGACCGATT TCCGATCCGA
GCCGCTCCGG GCGGTGATCC GGCGGCCAGG GCGGCTTGGG CGACCATGAC GGACGCTGAC
CGGATCCTGA TCAACCGCGT CTACGCCAAT TTCAGCAAGG CCGTCGCCGC CTATGAGCGA
ACCTTGAAGG TTCGTCCCAG CCGGTTCGAC GACTATCTGG CTGGCGTCTT CGGCGCGCCG
GGCCGGCATG CCCGACTGTC GCCCGATGAA GTCGCCGGGC TGCGGCTGTT CATCGGCAAG
GGCCAATGCT CGAACTGCCA TAATGGCCCC CTGCTCTCCA ATCATGGCTT CGCCAATACC
GGCGTTCCTG CTCGTAAGGA CTTGCCGCGT GATCTTGGAC GAGCCGCGGG TGTACGCGCG
GCGACCGACG ATCCTTTCAA TTGCAGAGGC GTCTACAGCG ACGCCCCCAA GGGCTGCGAG
GAACTGGCGT TCGCCGTCGT CGATAGTCCC GCTCAGGTTC GGGCCTACAA GGTCCCCTCG
CTGCGCGGAG TCGGCCAGCG GGCGCCCTAT ATGCACGCTG GACAGTTCTC ATCCCTGGAG
CAGGTCGTCG ACCACTACAG CCGCGCGCCG CCCGCGCCCA GCGGGACGTC CGAGATCAAA
TCGCTCCGGC TTGGCGCCGA CGAACGACGA CAGATCGTCG CCTTCCTTCG CACGCTCGAT
GAACAACCCC CTCCCTCTCC CAAACCCTGA
 
Protein sequence
MTRSWPLIGA TLVTALALVA AAEPTWSAAD LAVLRGLWIG ALGPPPLDPS NKVADNAAAA 
DLGRALFSDS RLSANGQVSC ASCHQPDHAF TDALPTGHGV GTGNRRTMPI APAVYSPWQF
WDGRADSLWS QALGPMENPV EHGFTRTEVA RVLAAHYRDP YEQLFGSMPD MADHDRFPIR
AAPGGDPAAR AAWATMTDAD RILINRVYAN FSKAVAAYER TLKVRPSRFD DYLAGVFGAP
GRHARLSPDE VAGLRLFIGK GQCSNCHNGP LLSNHGFANT GVPARKDLPR DLGRAAGVRA
ATDDPFNCRG VYSDAPKGCE ELAFAVVDSP AQVRAYKVPS LRGVGQRAPY MHAGQFSSLE
QVVDHYSRAP PAPSGTSEIK SLRLGADERR QIVAFLRTLD EQPPPSPKP