Gene Acid345_2247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2247 
Symbol 
ID4072992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2668149 
End bp2669222 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content62% 
IMG OID637984263 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_591322 
Protein GI94969274 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACTC GACCGGCAGT GATCGTCACC GGAGTTGCAG GAAACCTCGG GCGACGCTTG 
CTGCTGCAGC TTGGCGACTT CGACGTGACC GGAGTGGATG TTCGCGCTCC GGAAGGCGCG
TCTTTAGGCC GCTTCGAACA AATGGACCTG GGGACTGAAT CCGCCTCCAC TCAGCTCATT
GATCTTCTCC GCGCGACCGG GGCCACTACG GTTGTGCATC TGGCGTTCAT CGTGGATCCG
CTACGCTCCG GGGTGCTCGA CGAGCAGCAC ATGTGGCAAG TCAACGTGGC GGGCACGGCA
CGCGTGATGG AGGCCATCAG CGTGGTGAAC CGCTACGGTG GCGCGGTGTC GAAGTTCATC
TATCCCAGCA GCGTTGCCGT TTATGGGCCG GAGACTGACG ACCTCGTTGA CGAGAACAGT
CCGCTGAAGG CGCGGAGCCT GCCGTATGCA CTCCACAAGC AGGAATGTGA GGAGGTCGTG
CGTTACCGCC AGGAGTGGAT GACCGGCTGT CGCACGTACA TGTTGCGGCC GCACATCTTC
GCAGGCGCAA CCGTCGAGAA CTACATGATC GGCACGCTGC GCGGCACGTT CTTCGGCAAC
GGCAAACGCG CTGCCCGCAT GAAGGACGAA GGCAAGCGGC TCCCAGCATT ATTGCCGTGG
GGCAAGCAGT ATCTCGAGAA GAAGATTCAA TTCGCGCACG TGGACGACGT CGCGCGCCTC
ATCGCCCACC TGCTGCGAAG GCCTCCGGAT AACGATCCGC AATTGACGGT GTTGAATGTC
GCCGGACGCG GAGAGCCGCT CACGATTCAA CAGTGCACGC AGATTGCCGG CACGAAAATC
CGTCGCGTCC CGAGCCAGCG CATTGCTCGC ATCATTGCGC AGAAGATGTG GGATTGGGGC
ATCTCGGGCG CCCCGCCGGA GGCGCTGCCC TACATGATGG GCTCGTACAC GATGAACACC
TCGCGCCTGA AGGCTTTCCT CGGCGCCGAG TACGAGAACG TGATCCAGTT CACCGTGGAA
GGCGCGCTAC GCGACAGCGT GCAGGCAAAC GCCGAATCCG CATCCGCGAG CTAG
 
Protein sequence
MSTRPAVIVT GVAGNLGRRL LLQLGDFDVT GVDVRAPEGA SLGRFEQMDL GTESASTQLI 
DLLRATGATT VVHLAFIVDP LRSGVLDEQH MWQVNVAGTA RVMEAISVVN RYGGAVSKFI
YPSSVAVYGP ETDDLVDENS PLKARSLPYA LHKQECEEVV RYRQEWMTGC RTYMLRPHIF
AGATVENYMI GTLRGTFFGN GKRAARMKDE GKRLPALLPW GKQYLEKKIQ FAHVDDVARL
IAHLLRRPPD NDPQLTVLNV AGRGEPLTIQ QCTQIAGTKI RRVPSQRIAR IIAQKMWDWG
ISGAPPEALP YMMGSYTMNT SRLKAFLGAE YENVIQFTVE GALRDSVQAN AESASAS