Gene Acid345_3265 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3265 
Symbol 
ID4072677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3867701 
End bp3868819 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content59% 
IMG OID637985286 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_592340 
Protein GI94970292 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.791109 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAC GAATCCTGGT CACGGGCGGC GCGGGGTTTG TGGGATCGCA CTTGGTCGAC 
GCCTTGCTCC GTGCAGGCCA CAGCGTTCGC GTCTTTGACA ACCTCTCGCC CCAGGTGCAT
CCACACGGCT TGCCAAGCTA TCTCGCCACC GACATCGAGT TCATTCAAGG CGACATGCGC
GACCTTGACG CCGTTCGCCG CTCGCTCGAA AACATCGACG TCATCTTTCA CAAAGCCGCT
GCTGTCGGCG TTGGACAGTC CATGTATGAG ATTTCCCATT ACATGAGCGC TAACACGCAG
GGCACCGCCA ACTTGCTGCA AGCGATGCTC GACAGCCGCC GCGATTTCGA GAAGCTGGTT
GTCGCCTCCT CAATGTCCAT CTACGGCGAA GGGAAGTACC GGTGCGCCGA GCACGGCGAC
ATTGCTCCCG AGCCTCGGCC GATCGATCAA TTGCGGAAGA AGGAGTGGGA AGCGCTTTGC
CCCGTCTGTA ACGCGAAGCT CGCGCCTATT CCCACCGACG AATCCAAGCG GTTGCAGTGC
ACTTCCATCT ACGCCCTCTC GAAAAAAGAC CAGGAAGAGA TGTGTCTTCT CTATGGCCGC
ACCTATGGAG CGCCGGTGGT CGCATTGCGA TACTTCAACA TCTACGGCAC GCGGCAGGCG
CTTTCGAACC CTTACACCGG TGTGGCAGCG ATCTTTGCCT CGCGCTTGCT GAATCACCGC
TCTCCCATGA TTTTCGAAGA TGGCGAGCAG CAGCGGGATT TCGTCAGTGT GCACGACATC
GTGCAGGCGA ATTTGCTGGC TATGGACCGC GAAGAGGCGA ACGGCCTGGC CATCAACATC
GGCTCAGGCG CGCCAATCTC GATTTCTCAA GTAGCTGACA TTCTCGGCGC CGCCCTCGGC
CTTCACGTCG AGCCTGAAAT CACCGGCAAG TATCGCGCCG GAGACATTCG CCACTGTTTC
GCTGACATCG GTCTGGCGCA GAAGGTCCTC GGCTACCGGC CCAAGCATCG CTTCGCGGAT
GGCATCGGTG AGCTCGTTGC CTGGCTGCGC AATCAGTCCG CTACCGACAA GGTCTCGGAT
GCCACGCAGC AGCTCACGGC TTACGGATTG ACGGCTTAG
 
Protein sequence
MRKRILVTGG AGFVGSHLVD ALLRAGHSVR VFDNLSPQVH PHGLPSYLAT DIEFIQGDMR 
DLDAVRRSLE NIDVIFHKAA AVGVGQSMYE ISHYMSANTQ GTANLLQAML DSRRDFEKLV
VASSMSIYGE GKYRCAEHGD IAPEPRPIDQ LRKKEWEALC PVCNAKLAPI PTDESKRLQC
TSIYALSKKD QEEMCLLYGR TYGAPVVALR YFNIYGTRQA LSNPYTGVAA IFASRLLNHR
SPMIFEDGEQ QRDFVSVHDI VQANLLAMDR EEANGLAINI GSGAPISISQ VADILGAALG
LHVEPEITGK YRAGDIRHCF ADIGLAQKVL GYRPKHRFAD GIGELVAWLR NQSATDKVSD
ATQQLTAYGL TA