Gene Sare_3491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3491 
Symbol 
ID5704762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4026497 
End bp4027774 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content69% 
IMG OID641272918 
ProductN-acylglucosamine 2-epimerase 
Protein accessionYP_001538284 
Protein GI159039031 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2942] N-acyl-D-glucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0507934 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC ACCTCCCCAC CCGGCCGGCG AACTCACCCG ACCAGATCAC ATCCACCCCC 
CATCAGCGTC TCCTCGTCAC CCAGGCCGAC GCACTGCTTC GCGCCGCCCG TGCCTCGGTG
CGGCCCGAGG GAGGGTTCTG GTGGGTGACC GAACGTGGCG AACCCGATCC CCGTGAGCCG
CTCCACACCT GGATCGCCTG TCGGATGACT CATGTCTTTG CCCTCGCCCA CCTCGGACAC
GCCCCCGACA CCGCCCATGG GGTCGACCAC GGTGTCGCCA CGCTGCGCGG CACGCTGCGT
GACGCGCGGC ACGGCGGCTG GTTCAGCGCG GTGGATCTGA GTGGTGAACC GGTCACCGAC
CGGAAATCCG CGTACGAGCA CGCTTTTGTC CTGCTGGCGG CGAGCAGCGC CACCCGTGCC
GGTAGGCCGG GCGCCGAACA GCTACTGGAC GAGGCGCTCA ACGTGGTACA CGACCGGTTC
TGGGACGAGG CCGCCGGCCG GACCCGGGAG TCCTGGAACC GTGACTGGTC CGAGTCGGAG
CCCTACCGGG GCGCGAACAG CAGTATGCAC ATGGTCGAGG CGTTCCTGGC CGCCGGCGAC
GTCACTGGGG ACCGCCGCTG GGCGCAACGC GCGCTGGCGA TCTGCGATCA CCTGGTGCAC
GACGTGGCCG CCCGACACCA CTGGCGGCTG CCCGAGCACT TCACCACCGA CTGGGAACCG
CAGTTGGACT ACAACCTCGC ACAGCCCGCG GACCCCTTCC GGCCATACGG ATCCACCGTG
GGCCACTGGC TGGAGTGGGC CCGACTGCTG CTGCATCTCG AAACCGCGCT CGCGGCGCCA
CCTGCCTGGC TACTCGATGA CGCGCGTGCG CTGTTCACCG CGGCGGTCAC CCGTGGCTGG
TCGGTCGACG GCGCGGACGG CTTCGTCTAC ACCCTCGACT GGACGGACCA GCCGGTCGTC
CGTTCCCGGA TGCACTGGGT GCTCGCCGAG GCGATCGGGG CCGCTGCCAC GCTCTGGCGT
CGCACCGGCG ACGAACACTA CGAGCACTGG TACCACGTCT TCTGGGACTA CGCTGGCCGC
CACCTCATCG ATGAGGACAC CGGACAGTGG CGCCACGAGC TGGACGAGAC GAACCAGCCG
GCGAGTCTGG TCTGGCACGG CCGTCCCGAC GTGTATCACG CCTACCAGGC CGTCCTTTTG
TCCCAGTCAC CGATCACGCC CAGCCTCGCG GGTCTGTTCG CTCCCGCGCC GACGAACCAC
GTGGAGGAGG ACCGGTGA
 
Protein sequence
MTEHLPTRPA NSPDQITSTP HQRLLVTQAD ALLRAARASV RPEGGFWWVT ERGEPDPREP 
LHTWIACRMT HVFALAHLGH APDTAHGVDH GVATLRGTLR DARHGGWFSA VDLSGEPVTD
RKSAYEHAFV LLAASSATRA GRPGAEQLLD EALNVVHDRF WDEAAGRTRE SWNRDWSESE
PYRGANSSMH MVEAFLAAGD VTGDRRWAQR ALAICDHLVH DVAARHHWRL PEHFTTDWEP
QLDYNLAQPA DPFRPYGSTV GHWLEWARLL LHLETALAAP PAWLLDDARA LFTAAVTRGW
SVDGADGFVY TLDWTDQPVV RSRMHWVLAE AIGAAATLWR RTGDEHYEHW YHVFWDYAGR
HLIDEDTGQW RHELDETNQP ASLVWHGRPD VYHAYQAVLL SQSPITPSLA GLFAPAPTNH
VEEDR