Gene Acid345_3495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3495 
Symbol 
ID4072753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4122496 
End bp4123605 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content57% 
IMG OID637985517 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_592570 
Protein GI94970522 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.437184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGA ATAAAGGAAG TTTCCGTTCG GCACTCATTC TGGGCGGAGC TGGCTTTATC 
GGCTCCAATC TGGCAAGTTG GCTCCTCCAA AATACCTCGG CTAAAGTACA CATTTTCGAC
AACTTATCGC GTTTCGGCGT GCGCAACAAT TTGGATTGGC TGCAGGGAAT GGCGGCCACT
TCTGGGCGGC TGCAGATCAC CGTTGGGGAC GTTCGCGACG CCGCCCACGT GGAGCGAGTG
GTTCGGCACG CGACGGAGAT CTACCACTTC GCGGCGCAGG TTGCAGTAAC CACATCAATC
TCCGATCCGA GGCACGATTT CGAGGTAAAT CTCGGGGGGA CGGTCAACGT ACTCGAAGCC
GCGCGGAAAA GCGACAATCA GCCCTTCATC TTCTTTACTT CGACCAACAA GGTATACGGC
GATTTCGGTG CGGAAGACCT TTATTTAGAC GGCAAGCGGT ATCGCAGCAA AAACGCAGCC
GGAACTTCAG AAACGCAGCC GCTGGATTTC CACTCGCCCT ATGGATGCTC GAAGGGGGCG
GCCGACCAAT ACGTTCGTGA CTATGCCCGG ATTTATGGAC TGAACACGGT AGTGTTCCGG
ATGTCGTGTA TCGCGGGCCA GCAGCAGTTC GGCAATGAAG ACCAGGGATG GGTGGCACAT
TTTCTGTACT CTGCTCTGCG CGGCGCGCCG ATCACGATCT ACGGCAATGG CAAACAAGTC
CGCGATGTGC TCTGCGTGGA TGACCTGGTA CGCGCAATCG ATCTGGCGCG GCAGTTGCCG
GCCTCATCCG AGGGGCGGAT TTACAACATC GGTGGCGGCG CGGAGAATGC GCTCTCGCTT
CTCGAATTAA TGGACCTTGT AAAGAGCGTG ACGGGGCACG GCTGCGATGT GACCTACGAT
GCCGCTCGGC CCGGTGACCA GCTTTACTAC GTCACCGACT TCGCGAAGTT CAAGCGGGAT
TCAGGGTGGC AACCGGAGAT CAGCCCCGAA GGCACGCTCA AAAAGATCTA CGACTTCTAC
AAGAAGAACC GCGATCTGTT CGCGCTGACT GCTGCAAGAC CTTCGATCTT GCCTGCGGCT
TCCGCTTCGG AGCTGGCGCA ACCGGCATGA
 
Protein sequence
MTMNKGSFRS ALILGGAGFI GSNLASWLLQ NTSAKVHIFD NLSRFGVRNN LDWLQGMAAT 
SGRLQITVGD VRDAAHVERV VRHATEIYHF AAQVAVTTSI SDPRHDFEVN LGGTVNVLEA
ARKSDNQPFI FFTSTNKVYG DFGAEDLYLD GKRYRSKNAA GTSETQPLDF HSPYGCSKGA
ADQYVRDYAR IYGLNTVVFR MSCIAGQQQF GNEDQGWVAH FLYSALRGAP ITIYGNGKQV
RDVLCVDDLV RAIDLARQLP ASSEGRIYNI GGGAENALSL LELMDLVKSV TGHGCDVTYD
AARPGDQLYY VTDFAKFKRD SGWQPEISPE GTLKKIYDFY KKNRDLFALT AARPSILPAA
SASELAQPA