Gene Sare_2173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2173 
Symbol 
ID5704957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2497669 
End bp2498667 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content72% 
IMG OID641271655 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001537026 
Protein GI159037773 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0967474 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0101731 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACAG TGCGCAGATG CCTGGTCACC GGCGGCTTCG GCTTCCTTGG TAGCCATGTG 
GTCGAACGGC TGCTGCACCG GGGGGACGAA GTCGTGGTCT ACGACCCGGC CGGACCACCA
CCGGACCTGC GCGCTCCCGC CGGCCGTCTT CGGCACGTGC CCGGCGACGT CCGGGACGCC
GAGCGCCTGA TCACCGCCGC CGAGGGCGTG GACGAGGTCT ACCACCTGGC AGCGGTCGTC
GGTGTCGACC GGTACCTGCG GCGGCCGCTC GACGTGGTGG AGGTCAACGT GGGCGGAACC
CACAACGCGC TGCGGGCGGC CCGGCGCGCC GGCGCGCGGA TCGTGGTGTC CAGCACCAGC
GAGGTCTACG GGCGCAATCC CCGCGTGCCG TGGCGGGAGG ACGACGACCG GGTGCTCGGC
AGTACCGCGA CCGACCGGTG GTCGTACTCG ACGAGCAAGG CAGCGGCCGA ACACCTCGCC
TTCGCGTACC ACCGACAGGA GGGGCTGCCG GTGACGGTGC TCCGGTACTT CAATGTCTAC
GGTCCACGCC AGCGTCCGGC GTACGTGTTG AGCCGTAGCA TCGTCCGCAT GCTGCGGGGC
GAACCGGCCG TGGTGTACGA CGACGGCCGG CAGACCCGGT GTTTCACCTG GGTGGACGAG
GCGGTGGAGG CGACGCTGTC GGCTGCGGGA CTGCCTCGGG CGGTCGGCGA GTGTTTCAAC
ATCGGCAGCA GCGTGGAGAC GACCATCGGC GAGGCGATCC GCATGGTCGG CAGCGTCGCC
GGCGCGCCCG GGCCGGCCCT GACCGTACCT ACCGGGGCCG GCCCGGGCGC TCACTACCAG
GACATTCCCC GCCGGCTCCC GGACTGCGGC AAGGCCGCGG CGCTGTTGGG GTGGCGGGCT
CGAATGCCGC TGCTGGAGGG CCTGGGCCGG ACCGTCGAGT GGGCCCGCCG AAATCCGTGG
TGGACGGCGC AGGCCGACGA CGGGCTGGGG GTCCGTTAG
 
Protein sequence
MTTVRRCLVT GGFGFLGSHV VERLLHRGDE VVVYDPAGPP PDLRAPAGRL RHVPGDVRDA 
ERLITAAEGV DEVYHLAAVV GVDRYLRRPL DVVEVNVGGT HNALRAARRA GARIVVSSTS
EVYGRNPRVP WREDDDRVLG STATDRWSYS TSKAAAEHLA FAYHRQEGLP VTVLRYFNVY
GPRQRPAYVL SRSIVRMLRG EPAVVYDDGR QTRCFTWVDE AVEATLSAAG LPRAVGECFN
IGSSVETTIG EAIRMVGSVA GAPGPALTVP TGAGPGAHYQ DIPRRLPDCG KAAALLGWRA
RMPLLEGLGR TVEWARRNPW WTAQADDGLG VR