Gene Acid345_1697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1697 
Symbol 
ID4070480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2057264 
End bp2058775 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content58% 
IMG OID637983705 
Productsialate O-acetylesterase 
Protein accessionYP_590772 
Protein GI94968724 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.240139 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0645618 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTTT CGTTGAAACT TATCGCCATT GTCCTATGCC TTGCCTCGTG CGTCTTCGCA 
GAGGTCTCGC TACCAGCCGT ACTCTCCGAC GGTGCAGTTT TACAGCGCGG GATGCCTATC
CATTTCTTCG GAAGAGCGGC ACCGGGCGAG GCGATCACCA TCTCGTTGAA CAAGCAATCG
AAGAGCACGA CGGCGGACTA CGTCGGCCGC TGGCACCTCT ATCTCGCGCC GGAAGCCGCC
GGCGGCCCAT ATGACGCTAC GGTGAAGGGA AGCAACACGA TCACCGTGCA CGACGTTCTG
ATCGGCGATG TTTGGGTCGC ATCCGGACAG TCCAACATGG AGTATCCGAT GGAAGGCTGG
GGAGGAACTC CAAAGCAGAA TCTCGATGAG TTTCCCAAAG CCAACTTTCC GACGTTGCGC
TTCTTCCAGA CGCAACATGC TTACTCCGAT CACCCGTTGA TGGACATCCC GAAGCCTGCC
AAGTGGGTCG CGTGCACTCC GGAAACCGCG AAGAAGTTCT CGGCGGTCGC GTATTACTTC
GCCAAGAACC TCATAGAGAA GGAAAAGGTG CCCGTCGGCA TCATGGAAGC TGATTGGGGT
GGCAGCGTCG CCGAAGCTTG GACGAGTCTG GACGGTCTAT CAAGCAAAGC CGGTCTGATG
CCGATCTTCG CGAATCGCGC CACGATGATG GACAAATACG TGGATGAGGC CGAGATCATC
GGCCCGCAAG AACAGCGCTT AAAAGATGAG GCTAAGGCAA AAGGCCAGCC CGAACCGTCG
TTCCCGTGGC ATCCGGATCC GCATAGCTGG GCTCCATCTG AGCTCTACAA TGCAATGATC
TCGCCGCTCA CGCCCTACCC CATCCGCGGA GTCATCTGGT ATCAGGGTGA GAGCAACTCC
GCCTACGATC GCGCCCCGCA TTATGCGGAA CTGTTCCAGA CTATGATTCG CGACTGGCGC
AATCATTGGG GGGTCGGGGA CTTCCCATTC CTCTTCGTGC AGATCTCCGC GTACAAATCT
AGCGAAGCAG AGCACTGGGG ATCGCTACGG CAGACACAAT TGGAGAGTCT GGCACTGCGC
AACACGGGCA TGGCCGTGAC GATCGATGTT GGGAATCCCG ACGATGTGCA CCCAACGGAC
AAGGTGACCG TCGGGTCGCG TCTTGCGCTT GCCGCTCGTG CCCTCAGTTA CGGCGAGAAG
ATCGAGTACT CCGGCCCTCT CCCGCGCCAG GTCACGCGCG AAGAAAAAGC CCTCCGCATC
TCATTCGATC ACGCGGAGAG CTTGCAGGCA GGGAAGAATG GCTGGTGTGG ATTCGAGGTT
GCAGGAACCG ACGGCAAGTT CTCGCCGGCT ACCGCGAAGA TCGAAGCTAC GCAGATTGTT
GTCTCGAGCC CGGCAGTCAG TGAGCCAGTT TCCGTGCGCT ATGACTGGAC GAATGCGCCT
GATTGCTTCT TCTACAACCA GATGGGTTTG CCCGCTTCTT CCTTCGAAGC AAGTTTGCCA
CTGTTTCACT GA
 
Protein sequence
MRFSLKLIAI VLCLASCVFA EVSLPAVLSD GAVLQRGMPI HFFGRAAPGE AITISLNKQS 
KSTTADYVGR WHLYLAPEAA GGPYDATVKG SNTITVHDVL IGDVWVASGQ SNMEYPMEGW
GGTPKQNLDE FPKANFPTLR FFQTQHAYSD HPLMDIPKPA KWVACTPETA KKFSAVAYYF
AKNLIEKEKV PVGIMEADWG GSVAEAWTSL DGLSSKAGLM PIFANRATMM DKYVDEAEII
GPQEQRLKDE AKAKGQPEPS FPWHPDPHSW APSELYNAMI SPLTPYPIRG VIWYQGESNS
AYDRAPHYAE LFQTMIRDWR NHWGVGDFPF LFVQISAYKS SEAEHWGSLR QTQLESLALR
NTGMAVTIDV GNPDDVHPTD KVTVGSRLAL AARALSYGEK IEYSGPLPRQ VTREEKALRI
SFDHAESLQA GKNGWCGFEV AGTDGKFSPA TAKIEATQIV VSSPAVSEPV SVRYDWTNAP
DCFFYNQMGL PASSFEASLP LFH