Gene Acid345_0837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0837 
Symbol 
ID4072363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1039966 
End bp1041348 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content58% 
IMG OID637982846 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_589916 
Protein GI94967868 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.920136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAAA AAGAACGTAC TGCATCCACC GACGACCCGC GGGTGAATTG GCATTTAGAG 
GACGATTCAC TTCCATCCGT CGAAATTCCT GTTCAGGTCG GCAAGCCACG TTCTCTCAAG
GCGAACGTCA TTTGGACCCT ATGCGGTAAT TTCATCTACG CTTTCTCGCA GTGGGCCATG
CTCGTTTGTA TCGCTAAATT AGGGGATCCG ACGATGGTCG GACAATTCGC ATTCGGATTG
GCGGTGAGTG CTCCCATTTA CATGTTCACC AACATGCAAC TCCGGTCCGT GCAAGCCACC
GATGCGAAAA GCGAGTACCG CTTCTCAGAG TATTTCGGGC TCCGCATGCT GGCCAGCGTC
GCCGGTCTTC TTGCCGTCTG TGTTGTCTCG GCGCGCAGTT CTTCAATGCG TACTACCGCG
CTCGTGGTGT TCGGCGTCGG CCTTGCTAAG TTCATGGAAA GCGTGAGCGA CGTAATCTAC
GGGCTCTGCC AGAAACACGA GCGCATGGAC AGCATCGCGA TCAGCATGTC CATAAAAGGG
CTTGGATCTG TTGCCGCACT TGTGGGCGTC CTTCGCTACA CCCACAACTT GGTTTATGCG
GTGCTCGCCA TGGCCGGGTG GTGGGCTCTA CTGCTGCTGT TTGTCGACCT TCGTTGGGCA
CATAAATTCG CACAGATCGA CCCCGCGGAC CAGGGCACGA TTATTCCTTC GTTCGAACGG
AAAATACTCT TCTCGCTTGG CGTCCTGGCG CTCCCCATGG GCATCCAGAC CATGCTCGCC
AGCCTGACAA CCAACATTCC GCGATATGTC ATTCAGCACG ACATGGGCGC CGCGGCATTA
GGTCTCTATG CCGCCATGGC TTACTTCATG CTCGCGGGAC ACACCGTCAT CGCTGCGGTT
GGCAATTCCG TCCAAGCCAG ACTGGCGCGG CATTGGCAGC AATCCCTGCC ACTCTTTCGG
CGCTTGCTGG TTCGCTGCGC GGTCTTTGCC TTCGGCATGG GAGCGCTCGC AGGAGTGATT
GCGCTTGGGG CCGGCAAACC GCTTCTCACC CTCTTCTACC GGCCAGAGTA TGCGAAGAAC
CACAACGCAT TCACGGTACT CATGTTCGCC ACCGGCTTCT ATTATGTCGG ATCGATGCTC
GGCGCCGGCG TGGCAGTGGT GCGGCGCTTC TGGCTCTTTA CGGTGCTCTA CGCCAGCGTT
CCGCTCGTCG CATTAACGTC CTCGATCGTG CTTGTCCCGC GCTCAGGCTT GATGGGAGCG
GCCATCGCAA CCCTCATCTT TTGCGTGGCA AACGCCGTGG TTCCGATGAT CGTTATCGCG
CAGGCGTATA GACAACGCGT CGGTGCGCTT CCGGGGGTCG CCCCTTTAAG CGAACCTGCA
TGA
 
Protein sequence
MSQKERTAST DDPRVNWHLE DDSLPSVEIP VQVGKPRSLK ANVIWTLCGN FIYAFSQWAM 
LVCIAKLGDP TMVGQFAFGL AVSAPIYMFT NMQLRSVQAT DAKSEYRFSE YFGLRMLASV
AGLLAVCVVS ARSSSMRTTA LVVFGVGLAK FMESVSDVIY GLCQKHERMD SIAISMSIKG
LGSVAALVGV LRYTHNLVYA VLAMAGWWAL LLLFVDLRWA HKFAQIDPAD QGTIIPSFER
KILFSLGVLA LPMGIQTMLA SLTTNIPRYV IQHDMGAAAL GLYAAMAYFM LAGHTVIAAV
GNSVQARLAR HWQQSLPLFR RLLVRCAVFA FGMGALAGVI ALGAGKPLLT LFYRPEYAKN
HNAFTVLMFA TGFYYVGSML GAGVAVVRRF WLFTVLYASV PLVALTSSIV LVPRSGLMGA
AIATLIFCVA NAVVPMIVIA QAYRQRVGAL PGVAPLSEPA