Gene Acid345_0889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0889 
Symbol 
ID4069139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1108325 
End bp1109611 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content54% 
IMG OID637982896 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_589966 
Protein GI94967918 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3524] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.6181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATCCG ACGACAGGCT TGCTGCTTCA GTCGATGTGA CGAATGCGGT CCGCGTTAGT 
TCGCGCTTGG ATCGACTCGC CATCGGATGG GAGCGACGAG GCGCGATTGT CAAAGCCGCG
GTGATAGGTG TGATCGTCAG CACGACTCTC GCTTTCGTTA TTCCAAAGCA GTACGAGTCA
ACAGCACGCA TCATGCCACC CGAAGGGGGA ATGAGTTCGG CGATTATGGC GGCGCTGGCA
AGTCGGGCTC TTCCGGGAAA TCTTGGAGCA ATTGCGGGCA GCCTTTTCGG CTTCCAAAGT
ACCAGCACGG TCTTCGTTAA CTTGCTACAG AGTCGGAGCG TGACCGAACG GGTCGTTGAT
CGCTTTGATC TTCAGAAAGT CTATCGAAGC CGCTATAAAC AGGACGCCCT AAAAAAGCTC
CATCGGAGAA CCGAGATTGC AGAAGACCGT AAGACGGGCA TCATTACGAT TACGGTCGCA
GACACTGACC GTCGTCGCGC TCGCGATATG GCCCAAACTT ATCTCGATGA ACTGAACTCT
CTGGTAACCC GTGTGAACAG TTCGGCCGCC GGACGAGAAC GCGAGTTCAT CGCACAACGC
CTTGTCACAG TGAAGCGTGA TCTCGATGAT GCCGAACGCC AGTTGAGTGT GTTTTCGACG
AAGAACGCCA CGCTCGACGT TAAAGAACAG ACTCGCGCAA TGGTCGAGGC AACAGCAAAA
CTAGAGGGAG AACTCATTAT TGCGCGTTCG GAATTGAGTT CGCTAGATCA GATATATGGG
CCCGAGAATG TGAGAGTGCG GGCGGGTCGC GCTAGAGTCG GCCAATTGGA ACATGAACTC
AAGAATGCCA CCGGCTCTGG TGTGCCGAGC GACATTACCG AATCTACTCC ATATCCTCCT
TTAAGAGCTC TGCCAACGTT AGGCGTGCAA TGGGCCGATC TCTACCGACG CGTGAAGCTG
CAGGAAACGG TATTCGAGCT ATTGACGCAA GAGTACGAAC TCGCCCGCAT CGAAGAGGCG
AAAGCAATTC CAAGCATCAG CGTAATTGAT CCGCCGAATT GGCCGGAACG CAAGTCCTTC
CCGCCGCGAT TGGTGATCAT GCTCGTAGGG ACCTTACTGA GTGTATTGGG AACCTTCTTC
GTCATCGTGA GGAAGGCTGA GTGGCGCGCG GTTCCAGAAG AAGATCCGAA AAAGTTACTG
TTCCGTGCCG TCATGCTCGA CTTGAAAGAG GATAGCCCTC AATGGCTATC GAAGAAGACG
GTTCACCACA ATGGCCACGA GCTCTGA
 
Protein sequence
MPSDDRLAAS VDVTNAVRVS SRLDRLAIGW ERRGAIVKAA VIGVIVSTTL AFVIPKQYES 
TARIMPPEGG MSSAIMAALA SRALPGNLGA IAGSLFGFQS TSTVFVNLLQ SRSVTERVVD
RFDLQKVYRS RYKQDALKKL HRRTEIAEDR KTGIITITVA DTDRRRARDM AQTYLDELNS
LVTRVNSSAA GREREFIAQR LVTVKRDLDD AERQLSVFST KNATLDVKEQ TRAMVEATAK
LEGELIIARS ELSSLDQIYG PENVRVRAGR ARVGQLEHEL KNATGSGVPS DITESTPYPP
LRALPTLGVQ WADLYRRVKL QETVFELLTQ EYELARIEEA KAIPSISVID PPNWPERKSF
PPRLVIMLVG TLLSVLGTFF VIVRKAEWRA VPEEDPKKLL FRAVMLDLKE DSPQWLSKKT
VHHNGHEL