Gene TM1040_3354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3354 
Symbol 
ID4075253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp366235 
End bp367584 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content59% 
IMG OID638004862 
Productdi-haem cytochrome c peroxidase 
Protein accessionYP_611588 
Protein GI99078330 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.475738 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.135039 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTTGCA AACTTCATCA TCTTGTTTCA CTCCTGTTGA CGGTATTTAT CGCCGTCAGC 
GCTCCTCATT GGGCCGCAGC TGCACCTGAT GTCCCACCGG CTTTGACACC CGATGATTTC
ATGGCGACCG ATCCCGCCAA AGCCGAACTC GGTCGCCTGC TATTTTATGA CAAAATACTG
TCAGGGAACA GAAATATCAG CTGCGGCACC TGTCATCATC CGCGTCACGG AACTTCTGAT
GGGCTCTCTT TGGGGCTGGG CGAGGGGGGG CATGGATTGG GGTCGGAACG CCAAAGCGAT
ACGGGACAAA ACCGGGTGCA AAAACGTGTG CCCCGGAACG CGCCAGCGCT TTGGAACCTC
GGGGCGCACG CGCTTGAGCA TGTGATGCAT GACGGCCGCG TGAGCAGGGA TCCGATCTAT
GCAAATGGGT TTAATACGCC CGCAGAGGAG TGGTTGCCAC AGGGTCTTCA GTCCTTGCTT
GCCGCCCAAG CGCTGTTCCC GATGACCTCG AGCATCGAGA TGGCGGGCAA CGTAGGCGAA
AACGAGGTCA TCGGCGCCGC CCGCGATCGG ATCGATGCGG CCTGGCCCAT TCTGGCCAAA
CGGGTGCGGG TCATCCCGGA ATACAGCGAG CGTTTTATCA GCGCGTTTGA GGACATCCGC
ACTGCTCCGG ACGTCAATAT CACACATGTT GCCGAGGCTT TGGCGCATTT CATGGTGCAG
GATTTCACCT CTTATGACAG CCCGTTCGAT GCCTATCTTT CCGGCGACAA TACCGCCTTG
TCCGCAAGTC AGAAGCGTGG CGCAGATCTG TTTTTCGGGA CCGCTGGCTG CAGCGGTTGC
CACGCCGGAT CCTTGCTGAC GGATCAGGGG TTTCATGCAC TGGGGCTCCC AGCCTTTGGG
CCGGGGCGCA CGCGAAAGTT TGATCCCTAT GCACGCGATG TGGGGCGCGC GGGCGAGAGC
GATGCTCTTG AAGATTTCTA CCGCTTTCGT ACGCCGATGT TGCGCAATGT CGCGCTGACG
GCCCCTTACG GGCATAACGG TGCCTTTCCG ACGCTGGAGT TGATGGTCCG GCATCATCTG
GATCCCGTCA GATCGCGTGA GGCTTGGTCG CCTGAGCTTC TGGTGTTGCC TGATGTGACG
TGGCTGCGCG AGATCGATTT TGTGATCCGA CAAGATAGAC TCGAAATGGC GCGACAGGCC
GCAGCGCGCG ATATAGATAT CCCACCGCGC ACCGATGCGG AAGTCGCTGA TCTGGTTGCA
TTTTTGCACA GTTTGACCGG TGCGCGGGCC GAGGCACAGA CATCTGAAAT TCCCAAGACC
GTTCCAAGCG GCCTTCCAGT AGACAGGTAA
 
Protein sequence
MRCKLHHLVS LLLTVFIAVS APHWAAAAPD VPPALTPDDF MATDPAKAEL GRLLFYDKIL 
SGNRNISCGT CHHPRHGTSD GLSLGLGEGG HGLGSERQSD TGQNRVQKRV PRNAPALWNL
GAHALEHVMH DGRVSRDPIY ANGFNTPAEE WLPQGLQSLL AAQALFPMTS SIEMAGNVGE
NEVIGAARDR IDAAWPILAK RVRVIPEYSE RFISAFEDIR TAPDVNITHV AEALAHFMVQ
DFTSYDSPFD AYLSGDNTAL SASQKRGADL FFGTAGCSGC HAGSLLTDQG FHALGLPAFG
PGRTRKFDPY ARDVGRAGES DALEDFYRFR TPMLRNVALT APYGHNGAFP TLELMVRHHL
DPVRSREAWS PELLVLPDVT WLREIDFVIR QDRLEMARQA AARDIDIPPR TDAEVADLVA
FLHSLTGARA EAQTSEIPKT VPSGLPVDR