Gene Acid345_0895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0895 
Symbol 
ID4069145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1115049 
End bp1116065 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content54% 
IMG OID637982902 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_589972 
Protein GI94967924 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000813821 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAAAGC GTGTCCTTGT GACGGGCGCT GGTGGGTTTA TCGGGCACCA TCTCATGAAC 
GCCTTGGTCG ACCTGGGATA TTGGGTTCGA GGTGCGGACA TAAAGAGCCC GGAGTTTCAG
CCCAGCCGCG CGGATGAATT TCATCTCCTT GATCTTCGCG AGGTACAAAA CTGCGAGCAA
ATGACAGACG GAGTGGATAT GGTCTTCGCG CTTGCTGCAG ATATGGGGGG CATGGGCTAC
ATTTCAAGCC ATCATGCGGC CATTCTGCAC ACGAATACAT TGATCAACTT CAATACGCTG
GAAGCGGCAA GGCGCAGCGG GGTGCGGCGC TATCTGTTCA CCTCGTCGGC TTGCGTCTAT
CCCGAGTACC GTCAACTTGC TACTGACGTA CCGGCCCTAC GCGAGGAGGA TGCTTACCCG
GCTGCTCCGC AGGATGCATA TGGCTGGGAA AAGTTGATCA CGGAGCGCCT ATGCACTCAC
TATCGCGAAG ACTATGGGAT GGAAATGCGA ATAATTCGCT TCCATAATAT CTTTGGACCG
CTGGGGACGT GGGAAGGAGG ACGCGAGAAA GCTCCTGCCG CGATGTGCCG CAAAGTTGCG
ATCGCTAAAC TCACAGGTAA TCACGAAATC GAAATCTGGG GCGATGGCAA ACAGACTCGT
TCCTTCTGCT ATATCGACGA TTGCGTCACC GGTATCCATA AGCTCATGGT GTCCGATTTT
GCGTATCCGT TGAATCTCGG GCAGGATCGC ATGGTAAGCA TCAATGAACT CGCGGATTTA
GTTGCGGATA TCGCAGGTAT TCGCGTCAAC AAGCGTCACG TTTCTGGGCC GATGGGAGTA
CGCGGTCGTA ATTCCGATAA CACACTCTTG CGACAGGTTC TCGGCTGGAC CCCTGTGATC
TCTTTGGAAG ATGGCCTGCG TCGTACTTAC CGTTGGATCG AGGCTCAGGT GGCCGCCAAA
CTTTCGGAGA AATGCTCGAG TTCGTTCACT TCGAAGGTCG CGGCTACTAC GCCATGA
 
Protein sequence
MLKRVLVTGA GGFIGHHLMN ALVDLGYWVR GADIKSPEFQ PSRADEFHLL DLREVQNCEQ 
MTDGVDMVFA LAADMGGMGY ISSHHAAILH TNTLINFNTL EAARRSGVRR YLFTSSACVY
PEYRQLATDV PALREEDAYP AAPQDAYGWE KLITERLCTH YREDYGMEMR IIRFHNIFGP
LGTWEGGREK APAAMCRKVA IAKLTGNHEI EIWGDGKQTR SFCYIDDCVT GIHKLMVSDF
AYPLNLGQDR MVSINELADL VADIAGIRVN KRHVSGPMGV RGRNSDNTLL RQVLGWTPVI
SLEDGLRRTY RWIEAQVAAK LSEKCSSSFT SKVAATTP