Gene Sde_3007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3007 
Symbol 
ID3967729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3835207 
End bp3836481 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content44% 
IMG OID637922104 
ProductN-acetylglucosamine-6-phosphate deacetylase 
Protein accessionYP_528476 
Protein GI90022649 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0014113 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTAACA AGATACTCGT TGCAGTAGGA TTACTTGCGG CTAGCCTTTC TGTGCACGCC 
GCAACAAACC GCCCAAGTGG TTATACAACA ATTTGTAAGG TTGGTGAAAC ATGCTCGGTA
AGTCAGTCTA CGAATGTAGC CTTTGGCGCG TCTGGGCAGT TTGTGTATAA AGTATTAAAC
GGTAGCTTTT CTTGTAGTGT TTCTACGTTT GGTAGTGACC CTATTCCTTC TAAATCTGTA
AAAGAATGTT CAATCCCATC AAACGGCTCT AGCTCTTCTG GCTCGTCTTC ATCTTCGTCT
AGCAGCTCTT CCGGTAGCTC TTCTGGTGGT GGCTGTGGCA GCGGTGGTGG TTCTACGGTG
TGCTTATCGG CCTCGGGTTC TAGCAATGGT ATCAATTTAA GTTGGTCTGT ATCTGGTTCT
ATATCTTCCG TGCAGCTTTA TCGCGATACC GATTCAAACC CAAGCGGTCG CACGCGTATT
GCTAGTGTAT CTAGCTCTAC TACTAGCTTT AGTGATACCG GCGCGGCATC GGGCACCACT
TATTACTACT GGGTTAAATA TTATGTAAAT GGTACTGCTT ACAACTCGGG TGTTGCTTCT
GCGGTGCGCG GTTCTTCTAG CTCTAGTAGT TCAAGTTCTT CCAGCACTTC TAGCAGTTCT
GGTGGAAAAG GTTCTAGTTG TAGCTCTACT GGTAGCCAAT CTGTGTCTTC TACTATTAAG
GTAACCAGCG GTACTTACGA TGGTGGTTGT AAAACATTTA ACCCTACCAG TGCTTTGGGT
GATGGTAGCC AATCTGAAAG CCAAAAACCT GCTTTCCGTG TAGAAAACGG TGCAACGTTA
AAGAATGTAA TTATTGGCAA TAACGGTGTG GATGGTATTC ACGTTTACAA CGGCGGTACG
TTAAATAATA TTCTTTGGAC TAACGTAGGT GAAGATGCCA TGACCGTTAA GTCTGAAGGT
AACGTGACGG TAACCAATGT TGAAGGCTAT GACGGCGAAG ATAAGTTTAT TCAGGTAAAC
GCAGTGACTA ACTTAAAAGT TTCTAACTGT ATTGTGAATA AAATGGGTAA GTTTCTTCGT
CAGAATGGTG GTAAAACATT TGCCATGTCG GTAAGTGTAG ATAACTGCGA TATATCTAAT
ATGGGTGAAG GTATCTTCCG TTCAGACAGC CCGAACGCTA CAGCGGTTAT TACTAACAGC
CGTTTACGCA ACGCTGGGGA TATTTGTATT GGGGCTTGGA AAAGTTGTAA ATCTTCCAAT
ATCAGCAGCT TTTAA
 
Protein sequence
MFNKILVAVG LLAASLSVHA ATNRPSGYTT ICKVGETCSV SQSTNVAFGA SGQFVYKVLN 
GSFSCSVSTF GSDPIPSKSV KECSIPSNGS SSSGSSSSSS SSSSGSSSGG GCGSGGGSTV
CLSASGSSNG INLSWSVSGS ISSVQLYRDT DSNPSGRTRI ASVSSSTTSF SDTGAASGTT
YYYWVKYYVN GTAYNSGVAS AVRGSSSSSS SSSSSTSSSS GGKGSSCSST GSQSVSSTIK
VTSGTYDGGC KTFNPTSALG DGSQSESQKP AFRVENGATL KNVIIGNNGV DGIHVYNGGT
LNNILWTNVG EDAMTVKSEG NVTVTNVEGY DGEDKFIQVN AVTNLKVSNC IVNKMGKFLR
QNGGKTFAMS VSVDNCDISN MGEGIFRSDS PNATAVITNS RLRNAGDICI GAWKSCKSSN
ISSF