Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1697 |
Symbol | |
ID | 4070480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2057264 |
End bp | 2058775 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637983705 |
Product | sialate O-acetylesterase |
Protein accession | YP_590772 |
Protein GI | 94968724 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.240139 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0645618 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTTTT CGTTGAAACT TATCGCCATT GTCCTATGCC TTGCCTCGTG CGTCTTCGCA GAGGTCTCGC TACCAGCCGT ACTCTCCGAC GGTGCAGTTT TACAGCGCGG GATGCCTATC CATTTCTTCG GAAGAGCGGC ACCGGGCGAG GCGATCACCA TCTCGTTGAA CAAGCAATCG AAGAGCACGA CGGCGGACTA CGTCGGCCGC TGGCACCTCT ATCTCGCGCC GGAAGCCGCC GGCGGCCCAT ATGACGCTAC GGTGAAGGGA AGCAACACGA TCACCGTGCA CGACGTTCTG ATCGGCGATG TTTGGGTCGC ATCCGGACAG TCCAACATGG AGTATCCGAT GGAAGGCTGG GGAGGAACTC CAAAGCAGAA TCTCGATGAG TTTCCCAAAG CCAACTTTCC GACGTTGCGC TTCTTCCAGA CGCAACATGC TTACTCCGAT CACCCGTTGA TGGACATCCC GAAGCCTGCC AAGTGGGTCG CGTGCACTCC GGAAACCGCG AAGAAGTTCT CGGCGGTCGC GTATTACTTC GCCAAGAACC TCATAGAGAA GGAAAAGGTG CCCGTCGGCA TCATGGAAGC TGATTGGGGT GGCAGCGTCG CCGAAGCTTG GACGAGTCTG GACGGTCTAT CAAGCAAAGC CGGTCTGATG CCGATCTTCG CGAATCGCGC CACGATGATG GACAAATACG TGGATGAGGC CGAGATCATC GGCCCGCAAG AACAGCGCTT AAAAGATGAG GCTAAGGCAA AAGGCCAGCC CGAACCGTCG TTCCCGTGGC ATCCGGATCC GCATAGCTGG GCTCCATCTG AGCTCTACAA TGCAATGATC TCGCCGCTCA CGCCCTACCC CATCCGCGGA GTCATCTGGT ATCAGGGTGA GAGCAACTCC GCCTACGATC GCGCCCCGCA TTATGCGGAA CTGTTCCAGA CTATGATTCG CGACTGGCGC AATCATTGGG GGGTCGGGGA CTTCCCATTC CTCTTCGTGC AGATCTCCGC GTACAAATCT AGCGAAGCAG AGCACTGGGG ATCGCTACGG CAGACACAAT TGGAGAGTCT GGCACTGCGC AACACGGGCA TGGCCGTGAC GATCGATGTT GGGAATCCCG ACGATGTGCA CCCAACGGAC AAGGTGACCG TCGGGTCGCG TCTTGCGCTT GCCGCTCGTG CCCTCAGTTA CGGCGAGAAG ATCGAGTACT CCGGCCCTCT CCCGCGCCAG GTCACGCGCG AAGAAAAAGC CCTCCGCATC TCATTCGATC ACGCGGAGAG CTTGCAGGCA GGGAAGAATG GCTGGTGTGG ATTCGAGGTT GCAGGAACCG ACGGCAAGTT CTCGCCGGCT ACCGCGAAGA TCGAAGCTAC GCAGATTGTT GTCTCGAGCC CGGCAGTCAG TGAGCCAGTT TCCGTGCGCT ATGACTGGAC GAATGCGCCT GATTGCTTCT TCTACAACCA GATGGGTTTG CCCGCTTCTT CCTTCGAAGC AAGTTTGCCA CTGTTTCACT GA
|
Protein sequence | MRFSLKLIAI VLCLASCVFA EVSLPAVLSD GAVLQRGMPI HFFGRAAPGE AITISLNKQS KSTTADYVGR WHLYLAPEAA GGPYDATVKG SNTITVHDVL IGDVWVASGQ SNMEYPMEGW GGTPKQNLDE FPKANFPTLR FFQTQHAYSD HPLMDIPKPA KWVACTPETA KKFSAVAYYF AKNLIEKEKV PVGIMEADWG GSVAEAWTSL DGLSSKAGLM PIFANRATMM DKYVDEAEII GPQEQRLKDE AKAKGQPEPS FPWHPDPHSW APSELYNAMI SPLTPYPIRG VIWYQGESNS AYDRAPHYAE LFQTMIRDWR NHWGVGDFPF LFVQISAYKS SEAEHWGSLR QTQLESLALR NTGMAVTIDV GNPDDVHPTD KVTVGSRLAL AARALSYGEK IEYSGPLPRQ VTREEKALRI SFDHAESLQA GKNGWCGFEV AGTDGKFSPA TAKIEATQIV VSSPAVSEPV SVRYDWTNAP DCFFYNQMGL PASSFEASLP LFH
|
| |