Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_0431 |
Symbol | |
ID | 6132101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 514406 |
End bp | 515644 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641640756 |
Product | Dyp-type peroxidase family protein |
Protein accession | YP_001767432 |
Protein GI | 170738777 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2837] Predicted iron-dependent peroxidase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01412] Tat-translocated enzyme [TIGR01413] Dyp-type peroxidase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00775198 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGACG CTCCCTCCCG CCGCAGCCTC CTGCGCGCCC TCGCCGCGGG CACCGGCGCC GCCGCCCTGG CGCCGGTCCC CGGCCGGGCC GCCTCCCTGG AGGAGGCGGA CGGCACCGCC GACCGCCAGC CCTTCCGGGG CCTCCACCAG GCCGGCATCC TGACGCCCCA GCCAGCCTCC GCGCTCTTCG TGGCCTTCGA CGTCCTGGCG CAGGATCGCG GCGAGCTCGC GCGCCTGTTG CGCATCCTGA CCGAGCGCTT CGCCTACCTG ACCGAGGGCG GGCCGGTGCC GGGCCTCGAT CCGCGCCTTC CGCCGGCGGA TTCGGGCATC CTCGGCCCGG TCGTGCGCCC CGACAACCTC ACGGCGACGC TCGCGGTCGG TGCCAGCCTG TTCGACGAGC GCTACGGCCT CGCCGCGGCG AGGCCGCGCC GCCTGCGCCC GATGGACCGC TTCCCGAACG ATTCCCTCGA CGCGAAGCTC TGCCACGGCG ACCTGCTGCT GCAGCTCTGC GCCAACACGG CCGACACCGC CATCCACGCC CTCCGGGACG TCATCCGCAC GACCCCGGGC CTCCTGTCCC TGCGCTGGAA GCGCGAGGGC TTCCTGCCGC CCCAGGTCGT CAAGCGCGCC GGGCAGGACA CGGTGCGCAA CATGCTGGGC TTCAAGGACG GCACCGCCAA CCTCGACACC CGCGACGCGG AGCTGATGCG TCGCCAGGTC TGGGTCGGGC CGGAGGACGG CGAGCCGGCC TGGGCGCGCA ACGGCAGCTA CCAGGTGATC CGCCTCGTGC GGAACCTCGT CGAGCGCTGG GACCGCACGC CGCTCGGCGA GCAGGAGCGC ATCATGGGGC GCCACAAGGA CACCGGCGCT CCGCTCGGCG AGACCCGCGA GCACGCGCTG CCGGACTACG CCTCGGACCC GAAGGGCGAG CGCACCCCCC TCGACGCGCA TATCCGGCTC GCCAACCCGC GCAAGGCCGA GACGGCGCGC AACCTGATCC TGCGCCGCGG CTACAATTAC TCGGCGGGGA TCACGGCCTC GGGCCAGCTC GACATGGGGC TGATCTTCGT GTCGTTCCAG CGCGACCTCG ATGCCGGCTT CGTGGCCGTG CAGAACCGCC TCAACGGCGA GCCGCTGGAG GAGTACATCC GGCCGGTCGG GGGCGGCTAC TTCTTCGCCC TGCCGGGCGT GACGGCGCCG GACGGGTACC TGGGCGAGGA ACTGCTGGTC GGGGCGTGA
|
Protein sequence | MTDAPSRRSL LRALAAGTGA AALAPVPGRA ASLEEADGTA DRQPFRGLHQ AGILTPQPAS ALFVAFDVLA QDRGELARLL RILTERFAYL TEGGPVPGLD PRLPPADSGI LGPVVRPDNL TATLAVGASL FDERYGLAAA RPRRLRPMDR FPNDSLDAKL CHGDLLLQLC ANTADTAIHA LRDVIRTTPG LLSLRWKREG FLPPQVVKRA GQDTVRNMLG FKDGTANLDT RDAELMRRQV WVGPEDGEPA WARNGSYQVI RLVRNLVERW DRTPLGEQER IMGRHKDTGA PLGETREHAL PDYASDPKGE RTPLDAHIRL ANPRKAETAR NLILRRGYNY SAGITASGQL DMGLIFVSFQ RDLDAGFVAV QNRLNGEPLE EYIRPVGGGY FFALPGVTAP DGYLGEELLV GA
|
| |