Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0143 |
Symbol | |
ID | 4069728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 152107 |
End bp | 153009 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637982143 |
Product | Dyp-type peroxidase |
Protein accession | YP_589222 |
Protein GI | 94967174 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2837] Predicted iron-dependent peroxidase |
TIGRFAM ID | [TIGR01413] Dyp-type peroxidase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.641407 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00262856 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTACACGC CGCAAGCTGG CATCTTCGCG CTCGGAACGT CGTCCCACGC CTACCTCGAG TTCGACGCTC TCGACGCAAC CAAGCATGAC GAATTTGCCG CGAAGCTCGC CGCCATCCAC GAACCACGCA CCACCACCGG TGGCGTGAAC TTCGTCATAG GCTTCCGCCC AGAGCTGTGG CGCAAGTTGA CTCCCGGTGA TATCCCCACC GATGTGAAGG GCTTCAACGC TCCCATCAAC GGCATCGAAG GCTTCGCAAT GCCGGCCACC CAACACGACG CCGTGGTCTG GCTCTCCGGC AGTGCCTACG ACGTCCTCTT CGACATGGCG CGGGACGTCA TCCGCGACCT CGACGGCCTC GCCAAGCTCG CCGACGAAAC CGCAAGCTGG CCCTATCGCC ACGTCCGCGA CCTCACCGGC TTCATCGACG GCTCCGAGAA TCCTTCGCTC CTCGACGCAC CCGCCGCCGC TCTCATCCCC GAAGGCACTC CCGGCGCCGC CGGATCCATC CTGTTGCTGC AAAAATGGGT GCACAAGTCC GCCGAATGGG AAGCCCTGCC CGTCGAGCGC CAGGAAAAGA TCATGGGCCG CCTCAAGCTC GACAGCACCG AAATTGAAGA CAAGCCCGAA GACTCGCACG TCGCCCGCAC CGATCAGGAC GACTTTGGCA AGGTCTTCCG CCGCAACATG CCCTACGGAG GCGTCCAGGA TCACGGCACC ATGTTTGTCG GCTTTACCTG CGAGCAACAG CGCCTCGCAA AAATGCTCGA CAGCATGGCC GGCCTCGTCA ACGGCACCCG CGACGCCCTC ACCCGCTTCA CCACGCCACT CACCGGCTCG TACTACTTCG TGCCTTCCGT CGAGAGTCTC CGCCGCTTGC GGCCGGACGA AGCTGCGAGC TGA
|
Protein sequence | MYTPQAGIFA LGTSSHAYLE FDALDATKHD EFAAKLAAIH EPRTTTGGVN FVIGFRPELW RKLTPGDIPT DVKGFNAPIN GIEGFAMPAT QHDAVVWLSG SAYDVLFDMA RDVIRDLDGL AKLADETASW PYRHVRDLTG FIDGSENPSL LDAPAAALIP EGTPGAAGSI LLLQKWVHKS AEWEALPVER QEKIMGRLKL DSTEIEDKPE DSHVARTDQD DFGKVFRRNM PYGGVQDHGT MFVGFTCEQQ RLAKMLDSMA GLVNGTRDAL TRFTTPLTGS YYFVPSVESL RRLRPDEAAS
|
| |