Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3354 |
Symbol | |
ID | 4075253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 366235 |
End bp | 367584 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638004862 |
Product | di-haem cytochrome c peroxidase |
Protein accession | YP_611588 |
Protein GI | 99078330 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1858] Cytochrome c peroxidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.475738 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.135039 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTTGCA AACTTCATCA TCTTGTTTCA CTCCTGTTGA CGGTATTTAT CGCCGTCAGC GCTCCTCATT GGGCCGCAGC TGCACCTGAT GTCCCACCGG CTTTGACACC CGATGATTTC ATGGCGACCG ATCCCGCCAA AGCCGAACTC GGTCGCCTGC TATTTTATGA CAAAATACTG TCAGGGAACA GAAATATCAG CTGCGGCACC TGTCATCATC CGCGTCACGG AACTTCTGAT GGGCTCTCTT TGGGGCTGGG CGAGGGGGGG CATGGATTGG GGTCGGAACG CCAAAGCGAT ACGGGACAAA ACCGGGTGCA AAAACGTGTG CCCCGGAACG CGCCAGCGCT TTGGAACCTC GGGGCGCACG CGCTTGAGCA TGTGATGCAT GACGGCCGCG TGAGCAGGGA TCCGATCTAT GCAAATGGGT TTAATACGCC CGCAGAGGAG TGGTTGCCAC AGGGTCTTCA GTCCTTGCTT GCCGCCCAAG CGCTGTTCCC GATGACCTCG AGCATCGAGA TGGCGGGCAA CGTAGGCGAA AACGAGGTCA TCGGCGCCGC CCGCGATCGG ATCGATGCGG CCTGGCCCAT TCTGGCCAAA CGGGTGCGGG TCATCCCGGA ATACAGCGAG CGTTTTATCA GCGCGTTTGA GGACATCCGC ACTGCTCCGG ACGTCAATAT CACACATGTT GCCGAGGCTT TGGCGCATTT CATGGTGCAG GATTTCACCT CTTATGACAG CCCGTTCGAT GCCTATCTTT CCGGCGACAA TACCGCCTTG TCCGCAAGTC AGAAGCGTGG CGCAGATCTG TTTTTCGGGA CCGCTGGCTG CAGCGGTTGC CACGCCGGAT CCTTGCTGAC GGATCAGGGG TTTCATGCAC TGGGGCTCCC AGCCTTTGGG CCGGGGCGCA CGCGAAAGTT TGATCCCTAT GCACGCGATG TGGGGCGCGC GGGCGAGAGC GATGCTCTTG AAGATTTCTA CCGCTTTCGT ACGCCGATGT TGCGCAATGT CGCGCTGACG GCCCCTTACG GGCATAACGG TGCCTTTCCG ACGCTGGAGT TGATGGTCCG GCATCATCTG GATCCCGTCA GATCGCGTGA GGCTTGGTCG CCTGAGCTTC TGGTGTTGCC TGATGTGACG TGGCTGCGCG AGATCGATTT TGTGATCCGA CAAGATAGAC TCGAAATGGC GCGACAGGCC GCAGCGCGCG ATATAGATAT CCCACCGCGC ACCGATGCGG AAGTCGCTGA TCTGGTTGCA TTTTTGCACA GTTTGACCGG TGCGCGGGCC GAGGCACAGA CATCTGAAAT TCCCAAGACC GTTCCAAGCG GCCTTCCAGT AGACAGGTAA
|
Protein sequence | MRCKLHHLVS LLLTVFIAVS APHWAAAAPD VPPALTPDDF MATDPAKAEL GRLLFYDKIL SGNRNISCGT CHHPRHGTSD GLSLGLGEGG HGLGSERQSD TGQNRVQKRV PRNAPALWNL GAHALEHVMH DGRVSRDPIY ANGFNTPAEE WLPQGLQSLL AAQALFPMTS SIEMAGNVGE NEVIGAARDR IDAAWPILAK RVRVIPEYSE RFISAFEDIR TAPDVNITHV AEALAHFMVQ DFTSYDSPFD AYLSGDNTAL SASQKRGADL FFGTAGCSGC HAGSLLTDQG FHALGLPAFG PGRTRKFDPY ARDVGRAGES DALEDFYRFR TPMLRNVALT APYGHNGAFP TLELMVRHHL DPVRSREAWS PELLVLPDVT WLREIDFVIR QDRLEMARQA AARDIDIPPR TDAEVADLVA FLHSLTGARA EAQTSEIPKT VPSGLPVDR
|
| |