Gene Smed_3410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3410 
Symbol 
ID5324294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3617383 
End bp3618543 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content66% 
IMG OID640792361 
ProductN-acetylglucosamine-6-phosphate deacetylase 
Protein accessionYP_001329066 
Protein GI150398599 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGGA ACAAGACGAT CACCGGAGCA CGGGTTTTCG ACGGCATCGA CTGGCACGAC 
GGCGCCGCCC TCGTGGTCGA GTCAGGGCAC GTGAAGTCGA TTGTGCCGGC GGGGAGCGTA
GCCGTCGGTG GCGAGACCGT CGACGCCCAT GGCCTGCTTC TCGTACCCGG CTTCATCGAT
CTTCAGGTGA ATGGCGGCGG CGGCGCACTC CTGAACGAAG AACCTACCCT CGCAGGCATC
CGGCAGATCT GCTCGGCGCA TGCGACATTC GGTACGACGG CGCTGCTGCC GACGCTGATC
ACCGATACCC GCGCCGTCAG GACCGCGGCG ATAGCGGCAG GCCTCGAGGC TAAAGGAGCC
GCGGTGCCGG GCTTCCTCGG CCTGCATCTC GAAGGCCCTC ATCTCTCGGT CGCGCGTAAG
GGGGCGCACG ATCCCGCGCT GATCCGCCGG ATGGAGGACG ACGATCTCGC CGAGATACTT
GGCTGCGCAA AGGCGCTCGG CCGCCTGATG CTGACCGTGG CGCCGGAAAA TGCCACAAAG
GAGCAGGTTC GGGCGCTGGC CGATGCCGGG GTCGTGGTGA GCCTTGGCCA TACCGATGTG
GATTACGATA CCGCCCGCGC CTATGCCAAA GCGGGAGCGA GAACCGTCAC GCACCTCTTC
AACGCCATGA GCGGGCCTGG TCACCGTGAG CCGGGCGTTG TCGGTGCCGC TCTGGCGACG
GGCGCTCTCC ATGCCGGCAT GATCGCCGAC GGCTATCATG TCCACCCGGC TTCCATGGGC
ATAGCATTGC GCGGCAAGAA GGGACCGGGG CAGATCTTTC TGGTCACCGA CGCCATGTCG
CCCCTCGGCA CTGACCAGAC GAGCTTCTTC CTCAACGGAC GAAAAATCCT GCGGCAGGAC
GGCCGCCTGA CTCTCGCCGA CGGCACCCTC GCCGGCGCCG ATATCGATAT GTTGTCTTCT
GTTCGTTTCG TCCACCAGAG GCTCGGCCTT CCGGTCGAGG AGGCGATCCG CATGGCGTCC
GCCTATCCCG CCGACGCCAT GGGAATAGCC TCGCACAAGG GCCGGCTCCT GCCGGGTGCG
GATGCCGATT TCGTGCTGCT CACGCCGGAG CTCGGCATCA GATCGACCTG GATCGGCGGA
GAAAGAGTCT TTGCCGCTTG A
 
Protein sequence
MNGNKTITGA RVFDGIDWHD GAALVVESGH VKSIVPAGSV AVGGETVDAH GLLLVPGFID 
LQVNGGGGAL LNEEPTLAGI RQICSAHATF GTTALLPTLI TDTRAVRTAA IAAGLEAKGA
AVPGFLGLHL EGPHLSVARK GAHDPALIRR MEDDDLAEIL GCAKALGRLM LTVAPENATK
EQVRALADAG VVVSLGHTDV DYDTARAYAK AGARTVTHLF NAMSGPGHRE PGVVGAALAT
GALHAGMIAD GYHVHPASMG IALRGKKGPG QIFLVTDAMS PLGTDQTSFF LNGRKILRQD
GRLTLADGTL AGADIDMLSS VRFVHQRLGL PVEEAIRMAS AYPADAMGIA SHKGRLLPGA
DADFVLLTPE LGIRSTWIGG ERVFAA