Gene Sare_0286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0286 
Symbol 
ID5705259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp319652 
End bp320764 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content72% 
IMG OID641269813 
ProductN-acetylglucosamine-6-phosphate deacetylase 
Protein accessionYP_001535208 
Protein GI159035955 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.494242 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTAC GGGTGACCGG CAAGGTGGTG ACCCCGGCCG GGGTGATTCG CCAGGGCTGC 
GTGGAGATCA ACGGGGAGCG GATCACCGCT GTCGCCGAGT ACCCGTCGGT GCGGGACGGG
TACTGGATCC TGCCCGGCTT CGTGGACATG CACACCCACG GTGGTGGCGG GCACACCTTC
ACCACCGGCG ACGCCGACCA GGCCCGTGCC GCCGCCGGCT TCCACCTCCG GCACGGCACC
ACCACCCTGC TGGCCAGCCT GGTGAGTTCG CCGTTCGAGC TGATGCGCGC CGCCACCACG
GCCTACCGGC CGCTGGTCAC CGAAGGGGTA CTCGCCGGGA TCCACTTCGA GGGCCCGTAC
CTGGCGGCGG CCCGCTGCGG GGCACAGAAC CCGGCGTACC TTCGCGACCC GTCAACCGAC
GAGTTGACCG AGCTACTGGG GTTGGGCCAC GGCACGATCC GCATGGTCAC CCTGGCCCCG
GAGCGGGACG GAGCAACGGC GGCAATCAAG CTGCTCGCCG CACACGGCGT GGTCTCGGCG
ATCGGCCACA CCGACGCCAC GTACGAGCAG ACCCAGGCCG CCATCGCGGC CGGCGCGAGC
GTCGCCACCC ACCTGTTCAA CGGGATGCGC CCAGTGCACC ACCGCGAGCC GGGCCCGGTG
GTGGCCCTGC TGGAAGCCCC ATCCGTGGTC TGCGAGTTGG TCGCCGACGG GGTGCACCTG
CACGACGGCA TGCTCGGGTA CGTAACCACC ACGGCCGGCG TGGACCGGGC CGCCCTGATC
ACCGACGCGA TGGCCGCCGC CGGCATGCCC GACGGCGAGT ACGAGCTGGG CGGCCAGACC
GTCACGGTGA CCACCGGTGT GGCCCGGCTG GCCAACGATG GAGCGATCGC CGGCAGCACG
CTGACGATGG ATGCCGCGCT ACGGCACGCG GTCGCCACCG GGATCGCCGT CGCGGAGGCC
GCCCGGATGG TGTCCACCAC GCCCGCCCGT GCGATCGGTC TCGGCGATCG GGTGGGCGCA
CTTGCGCCCG GTCTCCGGGC CGACCTGGTG GTGCTCGACG ACGACCTGAA CGTGGTCCGG
GTCATGCGCG CCGGCTCCTG GTTGGACCAG TGA
 
Protein sequence
MTVRVTGKVV TPAGVIRQGC VEINGERITA VAEYPSVRDG YWILPGFVDM HTHGGGGHTF 
TTGDADQARA AAGFHLRHGT TTLLASLVSS PFELMRAATT AYRPLVTEGV LAGIHFEGPY
LAAARCGAQN PAYLRDPSTD ELTELLGLGH GTIRMVTLAP ERDGATAAIK LLAAHGVVSA
IGHTDATYEQ TQAAIAAGAS VATHLFNGMR PVHHREPGPV VALLEAPSVV CELVADGVHL
HDGMLGYVTT TAGVDRAALI TDAMAAAGMP DGEYELGGQT VTVTTGVARL ANDGAIAGST
LTMDAALRHA VATGIAVAEA ARMVSTTPAR AIGLGDRVGA LAPGLRADLV VLDDDLNVVR
VMRAGSWLDQ