Gene Acid345_1147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1147 
Symbol 
ID4069956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1429465 
End bp1430571 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content59% 
IMG OID637983157 
Productaldose 1-epimerase 
Protein accessionYP_590224 
Protein GI94968176 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2017] Galactose mutarotase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.934595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCCATA ACCGCATCTT GGCAGTCTTC TTCACGCTTT CTGTTCTCGC CTCACTTGCC 
CACGCCGCAA CCACCGTCAG CAAATCTGAG TTCGGCAAAA TGCCCGACGG CCGCTCCGTC
GACATCTATA CCCTCAAAGA CGGCGCCATC GAAGCCCGTA TCACCACCTA CGGTGCGCGC
ATCGTCTCCT TGCTTGCCCC CGACAAGAAC GGCAAAACCG CCGACATCAC CCTCGGCTAC
GACAACGTTG ACGGCTACGT CAAAGACGGT GCATCCTTCG GTTCGCTCGT CGGCCGCTAC
GCCGGTCGCA TCGGCAACGC AACCTTCAAC CTCGATGGCA AAGACTTCCA TACCCCCAAG
AACGACGGCC CCAACACCTT GCACGGCGGA CCCGAAAATT TTGGCAAGCA GCTTTGGACA
GGCAAGCAGA TTGCCAATGG CGTTGAACTG ACTTACGTCA GCAAAGATGG CGAAGCCGGT
TTCCCCGGCA CCCTGACCAC AGTCGTCCGC TACACGCTGA TCGGCAAAGA CCTCAAGCTC
GACATCTCCG CTGCCACCGA CAAGGACACC GTCCTCAACC TGACCAACCA CGCCTACTGG
AACCTGGCTG GTGAAGGTAG CGGCGACGTC GCCAAGCAGG AAGTGCAGAT CAACGCCGCG
AAAGTTGTCC CCGTAAACGA TGGCCTGATT CCCACCGGCA AACTCGCTGA TGTCGCCGGC
ACGCCCCTCG ATCTTCGCAA GCTCACTCCC ATCGGTGCGC ACGTTGACGA CAAGTCGAAC
GACCAACTCA AGTACGGCAT CGGCTACGAC ATCACCTACG TTCTCGACAA CAACGGTAAG
CTCGTGCCCG CCTCCGAAGC CTACGATCCT GCCAGCGGAC GCGTTCTCAC CGTGCTCACC
GACCAGCCCG GCCTGCATTT CTACAGCGGC AATCACATGG ACGGCGTAGC CGGCAAAGGT
GGACACAAAT ACGCCTTCCG CAATGCCTAT GCCTTCGAAG CCCAGAATTT CTCCGACGCT
CCGAACCAGC CCAACTTCCC CAGCGCCGTG CTGAAGCCCG GCCAGAAATT CCACCACATC
ATCATCTTCC GTTTCTCGAC GAAGTAA
 
Protein sequence
MLHNRILAVF FTLSVLASLA HAATTVSKSE FGKMPDGRSV DIYTLKDGAI EARITTYGAR 
IVSLLAPDKN GKTADITLGY DNVDGYVKDG ASFGSLVGRY AGRIGNATFN LDGKDFHTPK
NDGPNTLHGG PENFGKQLWT GKQIANGVEL TYVSKDGEAG FPGTLTTVVR YTLIGKDLKL
DISAATDKDT VLNLTNHAYW NLAGEGSGDV AKQEVQINAA KVVPVNDGLI PTGKLADVAG
TPLDLRKLTP IGAHVDDKSN DQLKYGIGYD ITYVLDNNGK LVPASEAYDP ASGRVLTVLT
DQPGLHFYSG NHMDGVAGKG GHKYAFRNAY AFEAQNFSDA PNQPNFPSAV LKPGQKFHHI
IIFRFSTK