Gene Acid345_1785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1785 
Symbol 
ID4072845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2164291 
End bp2165301 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content62% 
IMG OID637983793 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_590860 
Protein GI94968812 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.250908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTCC TCATCATCGG CGGCACTCGC AACCTCGGGC CCTCCATCAT CTCTGCCCTC 
GTCACCGCGG GCCACCAGGT CACCATCTTC CATCGCGGCC GAACTCTTTA CGACCTCCCT
CGCGAAGTCG AAGTCCTGAA CGGCGACCGC GCCCAGCGGG CCGATTGCGA GCGCAGTTTC
GGAGGCCGCG ACTTCGACGC CGTCATCGAC ACCACGCTTT ACAACGGCCG CGACGCCGCG
ATCGCCACTG AAATCTTCGA AGGCCACGTC TGCCAATACA TTTTCATATC AACAGGACAG
GTCTATCTCG TCCGCACCGG CCCGCAGCGT CCATTCCGCG AAACCGACTA CGACGGCCCG
CTCATGCCGG AGCCGCCGAA AGACCATCAT CAAGATCACG ACAACTGGGT CTACGGCATC
GAGAAGCGAG AGGCCGAAGA CATCCTCGCC GAGGTCCACG CGAAGCACGC TTTCCCATAT
GTCTCGCTCC GCCTGCCGAT GGTCAACAGC GAGCGCGACC ACTACCATCG CCTGCAGAAC
TACCTCCTTC GCATGTGGGA TGGCAGCCCG CTGCTCATTC CCGACGAGCC CGGCCTTCCG
GTTCGACACG TTTACGGCCA GGACGTTGTT CGCGCCATCG AACTCTGTTT GGCGAATCGC
GAAACCATCG GTCGCGCCTA CAACATCGGC CAGGACGAAA CGCTTTCCCT CCGCGAGTTC
CTCGATCTCA CAGCCGAGAT CGCACATTCC AAGCCCCAGA TCGCCGCCTT CCCGCGCCCG
TTGCTCGATT CCGCGCGCCT GCTGCCGGCA TGTTCGCCCT TCAGCGGTCC TTGGATGTCG
AGTCTCGACA ACGCGCACAG CAAGCAGGAA CTCGGGATGA CGTACACCCC GCTTCGTGCC
TATCTCGCCA AGCTGGTCGA GTATTTCCGC GAGCACCGCG AGCCGGCGCC GCCTGGCTTC
GAAGAGCATC GCAACCGGGA ACTTGCTTTT GCACAGCATC ATGGAGCCTA G
 
Protein sequence
MRVLIIGGTR NLGPSIISAL VTAGHQVTIF HRGRTLYDLP REVEVLNGDR AQRADCERSF 
GGRDFDAVID TTLYNGRDAA IATEIFEGHV CQYIFISTGQ VYLVRTGPQR PFRETDYDGP
LMPEPPKDHH QDHDNWVYGI EKREAEDILA EVHAKHAFPY VSLRLPMVNS ERDHYHRLQN
YLLRMWDGSP LLIPDEPGLP VRHVYGQDVV RAIELCLANR ETIGRAYNIG QDETLSLREF
LDLTAEIAHS KPQIAAFPRP LLDSARLLPA CSPFSGPWMS SLDNAHSKQE LGMTYTPLRA
YLAKLVEYFR EHREPAPPGF EEHRNRELAF AQHHGA