Gene Sare_0415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0415 
Symbol 
ID5708229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp475061 
End bp476137 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content72% 
IMG OID641269940 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001535335 
Protein GI159036082 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00302013 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCCCCG GTGGCCCCTC CGGTGCCCCG GGGGTCGTCG TCGTGACTGG GGTCAGCCGC 
TACCTGGGCG CCCACGTCGC CGCGCGGCTC GCCGCCGACC CGCGTATCGG GCGGGTTATC
GGCGTCGATC CGCCCGAGTC GGGCGGGGAA CTCACCGGTC TGCTCGACCG GGTCGAGCGG
GTACGGGCGG ATGCTGGTTC CATCGGTGGC CTGCTCGCCG ACCTCGACGT GGACGCGGTC
GTGCACCTGG CCCTAGTCAG TGCCCCCGAT CCGCAGCACG GGGGCCGTGC GGCGATGAAG
GAGCAGAACG TCATCGGCAC GATGCAACTG CTCGCTGCCG CGCAGCATGC CCCCCGGCTG
AACAAGCTCG TGGTCCGTTC CTCGACCGCG GCATACGGGG CGTCGTTCCG CGACCCGGCC
GTCTTCACCG AGGAGACCGA GCCGCGCGAG GTGCCGCGTG GTGGCTTCGG CCGGGACATC
CTGGATATCG AAGGGTATGT GCGAGGTTTC CGTCGCCGTC GGCCCGACGT CACCGCCACG
GTGCTGCGGT TCGCGCCGTT CCTCGGCTCG ACCGCCGACA CCACGCTCAC CCGCTATTTC
GCCCAACCGC TGATCCCCAC CGTGTTCGGC CGTGACCCCC GGCTGCAGTT CCTGCACTTC
GAGGATGCGT TGGAGGTGCT GCACCAGTCG ATCGTCATGG CCCATCCCGG CACCTACAAC
GTGGCCGGTC CCGGAGTGCT CGCCCTCTCC CAGGCCATTC GGCGGGCCGG CCGGGTAGGG
GTGCCGGTGC TGGAACCGGG CCTGTCCGGG GCGGCTGCGC TGGCCCGCGC TCTCGGCTTC
GGCCGCTACG GGCTGGACCA GGTCGACCTG TTCGTGCACG GCCGGGTCGT GGACACGACC
CGGCTCGAGC GGGAGTTCGG CTTCACACCA CGCTCGACGG CCACGGCGTT CGACGACTTC
ATCCGCGCCC ACCGCGGTGG CGTCGTGCTG ACCCGGGAGC GGCTTGCCGC CGCCGAGCGG
CTGGTGCTCG ACGGGGTCCG GCAGGTCCGC TCCGCGGCCG CTCGGGAGCG GCCGTGA
 
Protein sequence
MTPGGPSGAP GVVVVTGVSR YLGAHVAARL AADPRIGRVI GVDPPESGGE LTGLLDRVER 
VRADAGSIGG LLADLDVDAV VHLALVSAPD PQHGGRAAMK EQNVIGTMQL LAAAQHAPRL
NKLVVRSSTA AYGASFRDPA VFTEETEPRE VPRGGFGRDI LDIEGYVRGF RRRRPDVTAT
VLRFAPFLGS TADTTLTRYF AQPLIPTVFG RDPRLQFLHF EDALEVLHQS IVMAHPGTYN
VAGPGVLALS QAIRRAGRVG VPVLEPGLSG AAALARALGF GRYGLDQVDL FVHGRVVDTT
RLEREFGFTP RSTATAFDDF IRAHRGGVVL TRERLAAAER LVLDGVRQVR SAAARERP