Gene Acid345_3806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3806 
Symbol 
ID4071090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4498657 
End bp4500093 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content56% 
IMG OID637985829 
ProductUDP-glucose 6-dehydrogenase 
Protein accessionYP_592880 
Protein GI94970832 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0278162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.412621 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCG CAGTGATTGG TTCGGGGTAC GTTGGGCTTG TCTCGGCCGC GTGCTTTGCG 
GAAATCGGCC ATGAGGTAAT CTCCGTCGAT AACGATCACG CCAAAGTTAA TGCGCTGCGC
AATGGAGAGG TGCCAATCCA TGAGCAGTTC CTGCCGGAAT TGCTTGCGAA GCACCGGGGT
AAGGGCTTGA AATTTTCGAC TTCGGTTGGA GATGCGACTG CATGGGCAGA CGCCGTCTTC
ATCACGGTCG GTACGCCCCA ATCGGCCACT GGAGAAGCGG ATCTTTCCTA CGTCGAGGCG
GTAGCGCACG AGATCGCAAC TGCGATTCAT GGCTCGAAGC TGGTAGTGGA GAAGAGTACA
GTTCCGGTCA GAACCTGCGA AGCAATCCGA AAGGTTTTGC AATTGTGCGG GGCGCCAGCG
GATCTCTTCT CGGTTGCGTC TAATCCAGAA TTCTTGCGCG AGGGATCTGC TGTCCTCGAC
TTCCTTCACC CCGACCGGAT CGTAATCGGC GTTGATACCG AGTTCAGCCG TGGGTTGATG
GAGCAAATTT ATTGGCCGCT GACGAGCGGC GAATATTACA AGCGCTCGGA CGCTCTTGGG
GCCGGCCCTC GTTTTTCGGA GTGTGCCCCG CTGATCGTGA CGAGCCCGAA GAGCGCCGAA
CTGATCAAGC ACGCATCGAA TGCTTTCCTG GCTATGAAGA TTTCGTTCAT CAATGCGGTC
GCAAATATTG CAGAATCGGT TGGCGCTGAC ATTGACGAAA TTCGAGCGGG CATAGGAGCC
GATTCACGCA TTGGCAATCG TTTCCTGAAC GCCGGCGTTG GATATGGCGG ATCGTGTTTT
CCGAAGGATG TGCAGGCTTT TCATGCGGTC GCCCAGGAGT GCGGATATCG GTTTGGCTTG
CTCAATGAAG TGATTGAGAT AAATGCCGAG CAGCGTCGCC GCTTCATCTT GAAGGTTCGC
TCAGCGGTAT GGACGCTGCG CGGCAAGACT CTTGCGGTCT TGGGCGCAGC TTTTAAGGGC
GGCACTGACG ACATTCGTGA GTCGCCTGCG ATCGCGATTG TTGATGAGCT GCTCGCTCAG
GGGTCGTCGG TGCGGTTGTT CGATCCAGCG GCGCTGCCGA AAGCCAAAGC GGTGCTTGGT
GACAGTGTTC AGTATGCGAG TGATGCCTAC GATGCAGCGA CGGGCACGGA TGCGTTGCTC
ATTCTGACGG AGTGGCCGGA GTTCGCTCAA CTCGATTTAG AACGTCTACG CAAAGCGATG
AAATTCCCAA TCATCGTGGA CGGCAGAAAC CTGTATCGGC CGTCGTTTAT GGCCAAGGCG
GGCTTCGCGT ACCACAGCAT TGGACGGCCC GAACTTGCAG CTGAGAAATC GGCTCAAGGC
GTGCGACGTG ACGATTGGAT CTACGTTAAC AAGTCGGGAG CAGCATCTGC TGATTAG
 
Protein sequence
MKIAVIGSGY VGLVSAACFA EIGHEVISVD NDHAKVNALR NGEVPIHEQF LPELLAKHRG 
KGLKFSTSVG DATAWADAVF ITVGTPQSAT GEADLSYVEA VAHEIATAIH GSKLVVEKST
VPVRTCEAIR KVLQLCGAPA DLFSVASNPE FLREGSAVLD FLHPDRIVIG VDTEFSRGLM
EQIYWPLTSG EYYKRSDALG AGPRFSECAP LIVTSPKSAE LIKHASNAFL AMKISFINAV
ANIAESVGAD IDEIRAGIGA DSRIGNRFLN AGVGYGGSCF PKDVQAFHAV AQECGYRFGL
LNEVIEINAE QRRRFILKVR SAVWTLRGKT LAVLGAAFKG GTDDIRESPA IAIVDELLAQ
GSSVRLFDPA ALPKAKAVLG DSVQYASDAY DAATGTDALL ILTEWPEFAQ LDLERLRKAM
KFPIIVDGRN LYRPSFMAKA GFAYHSIGRP ELAAEKSAQG VRRDDWIYVN KSGAASAD