Gene Acid345_1702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1702 
Symbol 
ID4070485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2066511 
End bp2067506 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content59% 
IMG OID637983710 
ProductUDP-galactose 4-epimerase 
Protein accessionYP_590777 
Protein GI94968729 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.189185 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGTAC TTGTCACGGG AGGCGCAGGA TATATCGGCA GCGTCGTTGC CGCCGCGCTC 
GTCGAGCGAG GACACAGCGT TGTTGTCTAC GACAACCTCA GCAATGGGCA TCGCGCCGCA
GTTCCCAGCG CTGCGCAAGT CGTTGCCGGC GATATCGGTG ATCGAGCGAT GCTCGATGCC
ACGCTTCGCA ATGGCGCATT CGATGGAGTC ATGCACTTCG CGGCATTCAT CGAAGCCGGC
GAGTCGATGC GTTTTCCCGA AAGGTTCTTT CGCAACAACA CCGCGAACAC GCTGACGTTA
CTCGAACTCA TGCTTGAGCA TCGCGTCTCG CGCTTCGTTT TCTCCTCGAC GGCAGCGCTT
TACGGGAACC CTGAGCGCAC GCCCATCGAA GAATCTGACC CACTTAAGCC CACCAACGCC
TATGGCGAAT CGAAACTGCT CGTCGAGCGG ATGCTCGAGT GGTTCCACTC GATCCACGGC
CTGTGCTACG CGAGCCTCCG GTACTTCAAT GCTGCCGGAG CTACCGCCAC GCTCGCGGAA
GATCATCATC CGGAATCGCA CCTGATCCCC ATCGTTCTGG AAGCGGCGGC AGGGAAGCGC
GATTCGATTG CGATCCATGG CACAGACTAT CCAACTCCCG ACGGCACTTG TGTTCGCGAC
TACATCCATG TCTCCGATCT TGCAGATGCG CACCTTCTTG CGTTGGAACG CCTCGGTCGA
GACGAGCAAC CGGAGCGATT GATCTACAAT CTCGGCAACG GGCACGGCTC CAGTGTTCTG
GAAGTGATCG AGGCCGCGAA GCGCGTGACG GGCAATCCCA TTCAGGTGAA AGAGGGTCCC
CGCCGCGCCG GCGATCCCGA AATCCTTGTT GCCAGTTCGC AAAAGATCCG CAAAGAACTC
GGGTGGAGCC CTAAGTACAC TGACATCGAC ACCATTATTG AGAGCGCGTG GAGATGGCGC
AACTCGCACC CGAAGGGCTA CGGAGACGAA CAGTGA
 
Protein sequence
MKVLVTGGAG YIGSVVAAAL VERGHSVVVY DNLSNGHRAA VPSAAQVVAG DIGDRAMLDA 
TLRNGAFDGV MHFAAFIEAG ESMRFPERFF RNNTANTLTL LELMLEHRVS RFVFSSTAAL
YGNPERTPIE ESDPLKPTNA YGESKLLVER MLEWFHSIHG LCYASLRYFN AAGATATLAE
DHHPESHLIP IVLEAAAGKR DSIAIHGTDY PTPDGTCVRD YIHVSDLADA HLLALERLGR
DEQPERLIYN LGNGHGSSVL EVIEAAKRVT GNPIQVKEGP RRAGDPEILV ASSQKIRKEL
GWSPKYTDID TIIESAWRWR NSHPKGYGDE Q