Gene Acid345_3813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3813 
Symbol 
ID4071097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4507820 
End bp4508815 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content54% 
IMG OID637985836 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_592887 
Protein GI94970839 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.168438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.238091 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGCG CGATCTCAAA GTCGTCTCCC ATTTTTGTGG CTGGCCACCG CGGGCTTGCG 
GGGTCTGCAA TCGTGCGGCG CCTGCAGAGG GCGGGTTACG AGCGCCTATT TCTTAAAACG
CACTCGGAGT TGGATCTCTC AGACGAGATC GAAGTTCGAA AATTCTTCGA CCGTTACCGC
CCGGAATGCG TGTTTTTAGC CGCTGCAAAG GTGGGTGGAA TCCTTGCTAA CCGGGATTAT
CCAGCGGATT TCTTCATTCA AAATGCGCGT ATCCAAAACA ATGTCATCAG CACGTCTTTT
CAATTCGGCG TGAAGCGGAT GGTATTTCTC GGCTCCAGTT GCATTTACCC GAAACTTGCA
CCGCAGCCTC TCAAGGAAGA ATACCTTCTT ACGGGACCGC TCGAGTTTAC AAATCGTTCA
TACGCGGTGG CTAAGATCGC CGGTATCGAA TTGTGCTGGG CGCTGAATCG GCAGCACGGT
ACAAAGTTCC TGGCTGCGAT GCCGACCAAC CTCTATGGGC CCGGCGACAA TTACGATCGG
AACGGATCCC ACGTACTTCC AGCGTTGATT CGAAAAGTTC ATGAGGCGAT CGAAGGACGT
CAGGAAACTG TCACAGTTTG GGGAAGTGGC GAGCCGCGCC GTGAATTCTT GTATAGCGAC
GACATGGCAG ATGCCTGCGT CTTTCTCATG GAATTGGCGG AAGAAACCTA CGATGCGTTC
GTCTCCGATC CCGAGCGACC GCCCTTGTTG AATATTGGAT GTGGAGAAGA TCTCACCATT
TCTGCTTTGG CCCATCTAGT GGCAAAGGAA CTTGGCTACG AGGGCGAGAT CGTATTTGAT
CCCTCCAAGC CGGACGGAAC GCCACGAAAG CTTCTCGATG TGTCCCGCTT GTTCCAAATG
GGTTGGCGTC CGAAAATGTC GTTGGCTGCC GGAATCCGGG AAGCTTACGC CGATTTCAAG
GTCCGGTATT CGTCGATCGC AGCTGCGTCT CGATAG
 
Protein sequence
MSSAISKSSP IFVAGHRGLA GSAIVRRLQR AGYERLFLKT HSELDLSDEI EVRKFFDRYR 
PECVFLAAAK VGGILANRDY PADFFIQNAR IQNNVISTSF QFGVKRMVFL GSSCIYPKLA
PQPLKEEYLL TGPLEFTNRS YAVAKIAGIE LCWALNRQHG TKFLAAMPTN LYGPGDNYDR
NGSHVLPALI RKVHEAIEGR QETVTVWGSG EPRREFLYSD DMADACVFLM ELAEETYDAF
VSDPERPPLL NIGCGEDLTI SALAHLVAKE LGYEGEIVFD PSKPDGTPRK LLDVSRLFQM
GWRPKMSLAA GIREAYADFK VRYSSIAAAS R