Gene Acid345_0890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0890 
Symbol 
ID4069140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1109595 
End bp1110935 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content57% 
IMG OID637982897 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_589967 
Protein GI94967919 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.267642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACGA GCTCTGAATC GCTGACGGTT GGAATTCTCC AGCCGCGCAA ACGAATTGAA 
GGCGCGCCCC TCGCACGGAA CGTCGTGTTC AGCGTAGTCG ACTACCTGAC GCAACCGCTC
CTGATGTTGC TTACAGCGCG CTACTTCGTC AGAGCGCTGG GGTTACCGCT TTTCGGGATT
TGGATTTTGG TCCTCGCCAT CATTGGGAGC AGTGGCAGCA TCTGCACGGG TTTTGGAGAC
GCGGCATTGA AGTATGTCGC CGCTATGCGT GGCCGCATGG ACGATGACGG TGTTTCGCGA
GTTATCGGCT TATCTGCAAT GCTGAACCTT TCCATGGGAA TTGCGTTAGC GCTCGCGTTT
TATGCGCTCG CACCCTGGTC CGCCACGCAC ATGTTCCATC TCAGCGGACA ACTAGCTACC
GAGTTCGTTA CGGCTCTTCG CATTGGCGGG GGCGTCCTCG CCGTTCGGTC GCTATCCTTC
GTTTTCATTG GGGCATTCCG CGCATTTGAA CTATATGAGC GGGCAACTCA GGTTGTCGTG
AGTACGAGAC TTGCAACAGC TCTCGCGGCG CTCGTCTTGG TCTGGAAAGG GTTCGGAGTC
GTCGCAATTC TTTGGATCAC TTTGATTTGC GAATTCGGAG CACTCTTAAC GTTGGTGCAC
CGCGGCGCCG GGGTTTGGCG AGAGGTTCGT GTCCCGCGGC TTAAAGAAGA TGACTGGCGA
TCGCTTACAT CGTTTGGCTT CTTCGGATGG GTCCAGGCGC TCTCCGGAAC ACTCTTCAGC
CAAGCCGACC GGCTCGTCGT CGCCGCTTTG CTCGGACCTT CAGCTTTGAC CTATTACGGC
GTGTGTGTGC AATGCACGCA GCCGATTCAC GGACTGACCG CAGCTGGATG CAATGTCTTG
TTCCCGCATC TGAGCACGAA GGTCGAGACC GCCGGCACAT CGTATTTGCG GAAGTTTCTG
GCTCGCGCAT TTCGTCTCAA CCTACTCACC GTTCTTGGGT TGGCGATCGT GCCCCTCCTG
TTGAGCAGAC CTCTCCTCAC ACTCTGGATG GGAAAATCAT TCTCCGACCA TGCAGCTGTC
ACACTTTCCC TCGTGGCGGC GAGCTTTGCT CTCCTTGCTC TGAATGTCCC AGGTCACTAC
GCTCTCATGG CTCTCGGAGA GGTGCGGTAC CTGACAATCT TGAATGTAGC CGGGTGCGTC
CTGTCTCTCT TACTCGCTTG GTTCTTTATC CCGAAGATCG GAATCGCGGG AGCAGCGGCT
GCAAGGCTGG CATACGGCCC GTTGACCTGG TTGTTGTATG CGAGGCTGCA GCGGCTAACA
AGCCGGGAAG AGGCGAGATA A
 
Protein sequence
MATSSESLTV GILQPRKRIE GAPLARNVVF SVVDYLTQPL LMLLTARYFV RALGLPLFGI 
WILVLAIIGS SGSICTGFGD AALKYVAAMR GRMDDDGVSR VIGLSAMLNL SMGIALALAF
YALAPWSATH MFHLSGQLAT EFVTALRIGG GVLAVRSLSF VFIGAFRAFE LYERATQVVV
STRLATALAA LVLVWKGFGV VAILWITLIC EFGALLTLVH RGAGVWREVR VPRLKEDDWR
SLTSFGFFGW VQALSGTLFS QADRLVVAAL LGPSALTYYG VCVQCTQPIH GLTAAGCNVL
FPHLSTKVET AGTSYLRKFL ARAFRLNLLT VLGLAIVPLL LSRPLLTLWM GKSFSDHAAV
TLSLVAASFA LLALNVPGHY ALMALGEVRY LTILNVAGCV LSLLLAWFFI PKIGIAGAAA
ARLAYGPLTW LLYARLQRLT SREEAR