Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0415 |
Symbol | |
ID | 4068733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 476046 |
End bp | 477974 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637982418 |
Product | HAD family phosphatase |
Protein accession | YP_589494 |
Protein GI | 94967446 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3882] Predicted enzyme involved in methoxymalonyl-ACP biosynthesis |
TIGRFAM ID | [TIGR01681] HAD-superfamily phosphatase, subfamily IIIC [TIGR01686] FkbH-like domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.4216 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0403594 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGCAC TGTATTCGGA GTTGCGATGG CTACCGCGGG GGCCGGAAGA CTTTACGCGG CAGCTTAAGA GCGTTGGGGA ATCGGAAGAG ACGCTCGGCA GTGCGTTGCA GAGGCTCGCG AGTTTTGCGC TGGATCTCAA TCAACTGACG AAGCTCGCGA AGGTGGTCGC GAAAGCCCAG AGCGAGAAGC GGTCGCTCGC ACCGTTGATC CCATTCCGCC TGGCTGTGCT CAGCAACTCG ACAACAGATC TGCTTGTTCC GGCGTTGATT GCAAGTGCTG CGCGGCACGG CATCTCGCTT GAGGTGGCAC GGTCTTCCTA CGATCAAGCC GCGCAAGAAG CGTTGAATCC TGATTCGGAG ACGAACCGGT CGAAGCCCGA TGCGGTGCTG TTTGCAATGG ACTATCGCGC GCTGCCTTTG AAGTTGAGCG CCGGAAATGC ACAAGCGGCA GCGGAGACCG TCAACGATGC GATCGGCTAC TTGCAGTCCC TACGCGACGG TGTGAAGGAA CATTCCGGTG GGTTGTGTAT TTTTCAGACT TTTGCACCAC CGGTAGAGTT GCTTTTTGGC AGCATGGATA GATCGGTGCC GGGGACGATG CGGAGCCTGA TCGATGCGAT CAATCGTGAG TTGGCGGATT TCGTCAGAAG CTCCGGCGAT GTTCTGCTGG ATGTTGCTGG ACTTGCGGAG ACCGTTGGTC TCGCTGATTG GCACGATCCG CAGCTTTGGA ACCTTGGCAA GTTCGCGTTC TCTGACCGTT TCATTCCGAT TTACGCGGAT CGTGTGGCGC TGACGATCGC GGCGATTCGG GGGAAGAGCA GAAAAGTACT CGTACTCGAT CTCGATAACA CCATCTGGGG CGGAGTTCTT GGGGACGATG GGATCGAGGG GATCAAGTTG GCGCAAGGCG ACGCCGAGGG CGAAGCATAC CTTGCGGTGC AACGGATGGC TCTGGACTTG CGGAGCCGAG GAATCGTGCT GGCAGTTTCG TCGAAGAACG ATGATGCGAT CGCGCGAGGG CCGTTCGAAC GGCATCCGGA GATGCTGTTG AAGCTGGAGC ATGTCGCGGT ATTCCAGGCG AACTGGAGTG ACAAAGCGAC GAACTTGCAG GCGATTGCGG ATGAGTTGTC GTTGGGGCTG GATGGGTTGG TGCTTCTCGA CGACAATCCG GTGGAACGCG GCATGGTGCG ACAACTACTG CCCCAGGTTG CGGTTCCGGA GCTACCGGAG GAGGCGGCGT TTTACGCGCG CACACTTGCG GCTGCGGGCT ACTTCGAGAC GGTTACGTTT GCGGCCGAGG ATTTGAAGCG GGCGGCGTTT TATCAGGACA ATGCGAAGAG AGCGGAACTC CAAAAGCAGG TCCGGGGCGT CGACGCCTAT CTCGCTTCGC TCGATATGAC GATCACGTTC CAGCCCTTCG ACGCGGCTGG ACGTTCGCGG ATCGTGCAGT TGATCAACAA GTCGAACCAG TACAACGTGA CGACGCGGCG ATACACCGAG CCGGAAGTAA TCGAAGCTGA GGAAGATGCA AAGGTCTTCA CCCTGCAGGT ACGACTCGCG GACAGGTTTG GCGACAACGG CATGATCAGC GTGGTGATTT GCCGGCCAGC GGAGGCGGGG ACGTGGGAGA TCGATACCTG GCTGATGAGT TGCCGTGTGC TCGGTCGTCG GGTGGAGCAC ATGGTGTTGC GCGAGGTGCT GCGTCATGCA AGGGCAACGG GAATCCAGAA GCTGCGTGGC ACCTACCTAG CGACGGAGAA AAACGGCCTG GTTGCAGATC ATTACTCCAA GCTAGGGTTC AATCAGATCG CTAAACACTT GGAGGCGGTC ACGCAATGGG AGTTGTTATT GGAAAAAGCT GATGTGTCTG AAGCACCGAT GAAAGTGATC TCAAGGGGAT TTCCTGCTGC GCAGGAAGTG CTCCCGTAG
|
Protein sequence | MNALYSELRW LPRGPEDFTR QLKSVGESEE TLGSALQRLA SFALDLNQLT KLAKVVAKAQ SEKRSLAPLI PFRLAVLSNS TTDLLVPALI ASAARHGISL EVARSSYDQA AQEALNPDSE TNRSKPDAVL FAMDYRALPL KLSAGNAQAA AETVNDAIGY LQSLRDGVKE HSGGLCIFQT FAPPVELLFG SMDRSVPGTM RSLIDAINRE LADFVRSSGD VLLDVAGLAE TVGLADWHDP QLWNLGKFAF SDRFIPIYAD RVALTIAAIR GKSRKVLVLD LDNTIWGGVL GDDGIEGIKL AQGDAEGEAY LAVQRMALDL RSRGIVLAVS SKNDDAIARG PFERHPEMLL KLEHVAVFQA NWSDKATNLQ AIADELSLGL DGLVLLDDNP VERGMVRQLL PQVAVPELPE EAAFYARTLA AAGYFETVTF AAEDLKRAAF YQDNAKRAEL QKQVRGVDAY LASLDMTITF QPFDAAGRSR IVQLINKSNQ YNVTTRRYTE PEVIEAEEDA KVFTLQVRLA DRFGDNGMIS VVICRPAEAG TWEIDTWLMS CRVLGRRVEH MVLREVLRHA RATGIQKLRG TYLATEKNGL VADHYSKLGF NQIAKHLEAV TQWELLLEKA DVSEAPMKVI SRGFPAAQEV LP
|
| |