Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1422 |
Symbol | |
ID | 4078052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1518520 |
End bp | 1519656 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638006732 |
Product | di-haem cytochrome c peroxidase |
Protein accession | YP_613417 |
Protein GI | 99081263 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1858] Cytochrome c peroxidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0511364 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0579144 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAAAC TCTTACGCCA TGCTTCCTGC CTTATACTAT GTGCCGCGCC ATTGGTTGCA GCCCCCTATC AGACAGCTGA GGATCTTGGG GAGGCGCTGT TCTTTGAGAC GGATCTGTCT TTGAACCGCA GTCAGTCCTG TGCCACATGC CATGATCCCG CTGCGGGCTT CGTCGATCCC AGAGGTGACG CCGAAGGCGC GTTTTCCCGT GGAGACGATG GCGTGTCGCT TGGGGGGCGT CACGCACCCA CCGCAGCCTA TGCGTCTTTG TCGCCTGCCT TTCATCAGAA TGACGCGGGC GAGTGGGTGG GGGGCCAGTT CTGGGATGGC CGTGCCGCTG ATTTGGCAGA GCAGGCCGGA GGGCCGATCC TGAACCCCAT CGAACTGGGG CTTTCAAGCC AAGCGGAGGC CATCGCTCGA CTGGCCGCAC ATCCAGAGTA TGTCGAGAGT TTTCAGACGC TCTATAGTGT CGACATCACC ACAGATACGG AGGCGGGGTT CAAGGCGCTG ACCCAAGCGC TTGCCGCCTT TGAGAGCAGT GACGTCTTTC AGCCTTTCGA CAGCAAGTAC GACCGCTTCT TGCGCGGGGA TGTCGAGTTG AGCAGCGAAG AGGAGTTAGG CCGTCTCTTG TTCTTTTCGG AGCAGTTCAC CAACTGCAAC CAATGCCACC AGTTGCGCCG CAGCGCGATT GATCCCGCAG AGCCGTTTAG CGATTTTCGC TATCACAACA TCGGTGTTCC CGCGAATGTG GCGGGTCGCC TTGAAAACGG GGTGGCAGAG GATTGGGTGG ACGCCGGGCT TTATGAGAAC CCCAGCGTTC TGGATCGCGC GGAGCGCGGC AAGTTTAAGA CACCCACGCT GCGAAATGTG GCCGTGACGG GGCCCTATAT GCACAATGGG GTGTTTCAGG ATCTGCGCAC CGTTGTGCTG TTTTACAATC GCTACAACAG CAAGGCCGCA TCTGCACAGA TCAACCCCGA GACCGGTGCG CCCTGGGGAG AGATCCCGGT GCCGGACACA TTGGCGCAAA AGGAGCTGAC CCATGGGCCG GCACTGGATG ATCGTCGGGT GGATGCGCTG GTTGCTTTTC TCAAAACGCT GACCGATTCC CGTTACGAGC CCCTCCTGCA GGAGTAG
|
Protein sequence | MFKLLRHASC LILCAAPLVA APYQTAEDLG EALFFETDLS LNRSQSCATC HDPAAGFVDP RGDAEGAFSR GDDGVSLGGR HAPTAAYASL SPAFHQNDAG EWVGGQFWDG RAADLAEQAG GPILNPIELG LSSQAEAIAR LAAHPEYVES FQTLYSVDIT TDTEAGFKAL TQALAAFESS DVFQPFDSKY DRFLRGDVEL SSEEELGRLL FFSEQFTNCN QCHQLRRSAI DPAEPFSDFR YHNIGVPANV AGRLENGVAE DWVDAGLYEN PSVLDRAERG KFKTPTLRNV AVTGPYMHNG VFQDLRTVVL FYNRYNSKAA SAQINPETGA PWGEIPVPDT LAQKELTHGP ALDDRRVDAL VAFLKTLTDS RYEPLLQE
|
| |