Gene Acid345_2838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2838 
Symbol 
ID4070357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3375890 
End bp3376864 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content57% 
IMG OID637984856 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_591913 
Protein GI94969865 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02622] CDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.904735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAACA CTTTCTGGCG CGATCGCTCG GTCTTGGTTA CGGGCGCCAC CGGCCTTCTC 
GGCGGATGGC TCACGCGACA CCTGCTCGAG CAAGGTGCTT CAGTCACGGC GCTGGTGCGC
GATTCCGTGC CGCAGTCGGA GTTTGAACGC TGCCTGATGC GCCAGCGCGT GAACGTGGTG
CAGGGAGATT TGAGCAAGCC ACAGTTGCTG GAGCGGGTGC TTGGCGAATA CGAGGTCGAG
ACCGTTTTCC ATCTCGCGGC ACAGACGATT GTTGGGATCG CGAACCGCAA TCCGGTCTCG
ACGTTTGAGA GCAATATTCG CGGGACGTGG AATTTGCTGG AAGCGTGTCG TCGTTCGCCG
AACGTCAGCG CGATTGTGCT GGCGTCGTCC GACAAAGCCT ATGGCGATCA GACGGTACTT
CCGTACACCG AAGATATGCC ACTGCAAGGC CGCCATCCCT ACGACGTAAG CAAGTCGTGC
GCCGACCTTA TTGCGCAGTC GTACGCGCAT ACGTTTCGTG TGCCGGTGGC GATAACGCGT
TGCGGGAATT TTTATGGTGG TGGGGACCTC AACTGGAACC GCGTGGTCCC GGGCACGATT
CGTTCCGTAT TTCGTGGAGA GCGTCCGATT ATTCGCAGCG ATGGAAAGTT TGTGCGCGAC
TACTTCTATA TAGAGGATGG CGCGGCGGCT TACATGCTGC TTGCCGAGCG ACTCACGGTC
GATAAAAAGT TGATTGGCTC GGCGTTTAAT TTTTCGAACG AAGCGCAAAT CAACGTACTC
GACCTGGTGA ACACGATCCT TCAGAAGATG AACTCGAACC TGAAACCGGA GATCCAGAAC
CAGGCGAATA ACGAGATCCG GCACCAATTC CTGAGCGCCG AACGCGCGCG CAAGCAGCTC
AACTGGCGGG CGCAGTACAC GCTTGACGAA GGCCTGGAGC GCACGATCGC CTGGTACAAA
GAGGTATTGC AGTAG
 
Protein sequence
MTNTFWRDRS VLVTGATGLL GGWLTRHLLE QGASVTALVR DSVPQSEFER CLMRQRVNVV 
QGDLSKPQLL ERVLGEYEVE TVFHLAAQTI VGIANRNPVS TFESNIRGTW NLLEACRRSP
NVSAIVLASS DKAYGDQTVL PYTEDMPLQG RHPYDVSKSC ADLIAQSYAH TFRVPVAITR
CGNFYGGGDL NWNRVVPGTI RSVFRGERPI IRSDGKFVRD YFYIEDGAAA YMLLAERLTV
DKKLIGSAFN FSNEAQINVL DLVNTILQKM NSNLKPEIQN QANNEIRHQF LSAERARKQL
NWRAQYTLDE GLERTIAWYK EVLQ