Gene Acid345_0321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0321 
Symbol 
ID4068598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp347992 
End bp349119 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content58% 
IMG OID637982324 
Productaldose 1-epimerase 
Protein accessionYP_589400 
Protein GI94967352 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2017] Galactose mutarotase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.520613 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGAA TCCCAAAAGT TCTCGTGCTG ACGCTCCTTG CTATTCTTAC CGTCGGCATG 
GCGGAAGCGA AAACCAACGT GACTAAGCAA ACATTCGGCA AAGTTCAGGA CGGCACTGCC
GTCGACCTCT ACACCTTGAG CGACGGCCCG TACGAAGCCC GCATCATGAC CTACGGCGGC
GTGCTTGTTT CCTTCAAAGC GCCCGATAAA GCCGGCAAGA CTGCCGACGT GATCCTCGGC
TTCGACGATG CTGCCGGTTT CTATGACAAC TTCAACGGCG CGCACAATGC ATTTTTCGAC
GCCATCATCG GTCGCTACGC CAATCGCATT GGCAAAGGTG CATTCACTCT CGACGGGAAG
AAATACGACC TGCCGAAGAA CGATGGTCCG AACACGCTGC ATGGTGGCCC GCACGGTTTT
AACAACGTGG TGTGGCAAGG CAAGCAACTC CCGAACGGCG TGGAACTCAC CTACGTGAGC
AAAGACGGCG AGATGGGCTT CCCCGGGAAC ATGACCGCCA CCGTGAAGTA CACGCTCACC
AAGGGCGATT TGCGGATCGA GTACTCGGCG ACGACCGACA AGGCCACTGT CGTGAACCTG
ACCAATCACT CCTACTTCAA CCTGGCGGGC GAAGGGTCAG GCGACATTCT GAAACATCAG
CTCATGATCA ACGCCTCGAA AATCACGCCC GTGGACGCGA CTTTGATTCC GACTGGCGAG
CTGACTTCAG TCGACGGCAC GCCCTTCGAC TTCCGCAAAT CCACCGAGAT CGGCGCACGC
ATCAACAACG ACGATGAGCA ACTCAAGCGC GGCCACGGCT ACGATCACAA CTGGGTGCTC
GACTCAACAG GCGGTAAGCT TGCCGAGGCT GCAGAAGTGT ACGAGCCAAC TTCCGGCCGC
GTGCTGAAAG TACTCACCGA TCAGCCCGGC ATCCAGTTCT ACTCCGGCAA CTTCCTCGAT
GGCGCCGTAA AAGGCAAAGG CGGCAAGCCC TACACCCATC GCTCGGGATT GTGCCTGGAG
ACGCAGCATT TCCCCGACAC ACCCAACCAC GCGAACTTCC CGTCCGCCGA ACTGAAGCCG
GGACAGAAGT ACCACACCGT CACGGTCTTC AGTTTCTCGA CTCGCTAG
 
Protein sequence
MQRIPKVLVL TLLAILTVGM AEAKTNVTKQ TFGKVQDGTA VDLYTLSDGP YEARIMTYGG 
VLVSFKAPDK AGKTADVILG FDDAAGFYDN FNGAHNAFFD AIIGRYANRI GKGAFTLDGK
KYDLPKNDGP NTLHGGPHGF NNVVWQGKQL PNGVELTYVS KDGEMGFPGN MTATVKYTLT
KGDLRIEYSA TTDKATVVNL TNHSYFNLAG EGSGDILKHQ LMINASKITP VDATLIPTGE
LTSVDGTPFD FRKSTEIGAR INNDDEQLKR GHGYDHNWVL DSTGGKLAEA AEVYEPTSGR
VLKVLTDQPG IQFYSGNFLD GAVKGKGGKP YTHRSGLCLE TQHFPDTPNH ANFPSAELKP
GQKYHTVTVF SFSTR