Gene Sare_2116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2116 
Symbol 
ID5704970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2437857 
End bp2438846 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content71% 
IMG OID641271601 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001536972 
Protein GI159037719 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.463092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00350753 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAGCTCAT CCAGGGTGGT GGTCACGGGC GGGTGTGGGT TCATCGGCAG TCACCTGGTG 
GACCAGTTGG TCAGGCGGGG CGACGACGTG GTGACCTTCG ACGGCGTAGC ACCCAGCACG
GGTGAGCGGC GTCCCGGTAC CACGGCGCGG CACATCGTCG GTGACGTCCG CGACCCCTCG
GGGCTCGCCC AGGCGATACA GCCCGGCGTC GACGTGGTCT ACCACATGGC CGCCGTGGTT
GGGGTGGACC AGTACCTGGC CCGGCCACTG GACGTCATCG ACATCAACCT CAACGGCACC
CGCAACGTCC TCGAACTGGC CGCCAGGGCC GGTGCACGGG TGATCGTGGC CAGCACCAGC
GAGGTGTTCG GCAAGAACCC GGCGGTGCCC TGGAAGGAGG ACGGCGACCG CGTCCTCGGC
CCGACCACGG CCGACCGGTG GGCGTACTCC TCCAGCAAGG CACTCGCGGA GCACCTGACG
TTCGCGTTCG CCCGCCAGCA CAGCCTGGCG GCCACCGTGG TGCGCTACTT CAACGTGTAC
GGGCCACGCC AGCGTCCCGC CTACGTCGTG TCCCGCAGCA TCCACCGAGC CCTCAACGGG
CTCGCCCCGG TGGTGTACGA CCAGGGCCGG CAGTCCCGCT GTTTCACGTA CGTGGCCGAC
GCGGTGGACG GGACCATGCT GGCCGCCGCT GCGCCGTCCG CCGTCGGTGA GGCGTTCAAC
CTCGGCAGCA TGCGAGAGAG CATGATCAGC GAGGTCGTCG AGCTGGTCGC CAAGTTGGCG
GGCGGCACCT CTACCACGTC GGTGGACACC GCGGCACGGC TCGGCGCCGC GTACCAGGAC
CTACCCCGGC GCGTGCCGGA CAACACCAAG GCCCGCACGA CTCTCGGCTG GGACTGTGCC
ACACTGCTGG AGGACGGCCT GGCGCGGACG ATCGAGTGGG CCCGCGCCAA CGCCTGGTGG
CTGGCACGGG CCGACACCGG CGCCGCGTGA
 
Protein sequence
MSSSRVVVTG GCGFIGSHLV DQLVRRGDDV VTFDGVAPST GERRPGTTAR HIVGDVRDPS 
GLAQAIQPGV DVVYHMAAVV GVDQYLARPL DVIDINLNGT RNVLELAARA GARVIVASTS
EVFGKNPAVP WKEDGDRVLG PTTADRWAYS SSKALAEHLT FAFARQHSLA ATVVRYFNVY
GPRQRPAYVV SRSIHRALNG LAPVVYDQGR QSRCFTYVAD AVDGTMLAAA APSAVGEAFN
LGSMRESMIS EVVELVAKLA GGTSTTSVDT AARLGAAYQD LPRRVPDNTK ARTTLGWDCA
TLLEDGLART IEWARANAWW LARADTGAA