Gene Shew_1117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew_1117 
Symbol 
ID4921047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella loihica PV-4 
KingdomBacteria 
Replicon accessionNC_009092 
Strand
Start bp1291843 
End bp1292973 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content59% 
IMG OID640162650 
ProductN-acetylglucosamine-6-phosphate deacetylase 
Protein accessionYP_001093247 
Protein GI127512050 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0425405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAGA CACTGATTGC CGAGCGAGTA TTTGACGGCG AACACTTCCA CCATAACCAG 
GCCATCACCA TAGAGGACGG GCGTATCGTC TCTTTCGATA GGGCTAGCGA TGTCGAGCCG
ATTCTGCTGG CCGGCACCCT GGTGCCTGGC TTTATCGATG TTCAGGTCAA TGGCGGCGGC
GGTGCCCTGT TTAACGACGC ACCCAGCGTC GAGAAGATAA AGACCATAGG CCAGGCCCAT
GCCAGATTTG GCACCACGGG CTTCCTGCCG ACCCTGATCA CCGACGAAAT TGAGGTGATG
CGCGCCGCCG CCGATGCGGT GGCCGAGGCG CTGGCGCTGA ATACGCCAGG CGTGCTGGGG
ATCCATTTCG AGGGGCCGCA TCTCAGCGTG CCCAAGAAGG GTGTCCATCC GGCCAATTAT
ATTCGTCGTA TCTCTGATGA AGAGCTGGCG GTTTTTGCCC GTAATGATCT GGGGACTAAG
GTGGTGACAC TGGCGCCTGA GAACGTCGCA CCAGAGGTGA TTCATGCCCT GGTCGAGTGC
GGCGTCAGGG TTTGTCTCGG TCACTCTAAT GCCGACTATG ACACTGTTAT TAAGGCGCTT
GAGGCCGGGG CAACCGGTTT TACCCACCTG TTTAACGCCA TGTCGCCCAT GGATTCCAGG
GCGCCGGGCA TGGTGGGCGC GGCGCTGGAG AGCCAGGATG CCTGGTGCGG CCTGATTGTC
GATGGTCACC ACGTGCATCC GGCCTCGGCT AAGGTAGCCA TCGCCGCTAA GCCAAGGGGC
AAGGTGATGC TGGTGACAGA CGCCATGCCG CCTGTGGGCA TGGATGATAA CGCCAGCTTC
GAGCTGTTTG GCACCCAGGT GGTGCGCCGC GGTGACAGGT TAAATGCCGT AACCGGCGAG
CTTGCCGGAT GCGTGCTGGA TATGATTGGT GCGGTCAACA ACAGCGTTAG CATGCTAGGC
GTGGCCCACG AAGAGGCGCT GCGGATGGCG GCTAGATATC CTGCCGAGTT TATCGGTCAT
CGTCAGCGCG GAGTGTTTAC CATAGGCGCC AGGGCGGATA TGGTGTTGCT GGGCAGCGAT
AATCAAGTGG CGCGCACCTA CATAGATGGC CAGTTGGTCT ATCAGGCATA G
 
Protein sequence
MKQTLIAERV FDGEHFHHNQ AITIEDGRIV SFDRASDVEP ILLAGTLVPG FIDVQVNGGG 
GALFNDAPSV EKIKTIGQAH ARFGTTGFLP TLITDEIEVM RAAADAVAEA LALNTPGVLG
IHFEGPHLSV PKKGVHPANY IRRISDEELA VFARNDLGTK VVTLAPENVA PEVIHALVEC
GVRVCLGHSN ADYDTVIKAL EAGATGFTHL FNAMSPMDSR APGMVGAALE SQDAWCGLIV
DGHHVHPASA KVAIAAKPRG KVMLVTDAMP PVGMDDNASF ELFGTQVVRR GDRLNAVTGE
LAGCVLDMIG AVNNSVSMLG VAHEEALRMA ARYPAEFIGH RQRGVFTIGA RADMVLLGSD
NQVARTYIDG QLVYQA