Gene Sterm_3701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3701 
Symbol 
ID8599147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp3926402 
End bp3927550 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content40% 
IMG OID 
ProductN-acetylglucosamine-6-phosphate deacetylase 
Protein accessionYP_003310466 
Protein GI269122289 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0437627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCTAA AAAACGCCGA TGTTTTTCAG GAAAATGGTA ATTTTTTACA GCATGACATC 
TTCATTAACG GAGAATACAT AACAGATCAT AAACCGGAAC AAACCGCAGA AGACAGTACA
GTAATTGATG CTGACGGTTT ATATGCCATA CCCGGTTTGA TTGATATTCA CTTTCACGGA
TGCATGAACA GGGATTTCTG TGATGCAGAC CATGAATCTA TCAGAATAAT ATCCGAATAT
CAGTTGAATA ACGGTATCAC ATCTATACTT CCCGCCACGA TGACATTAGG CGAAGAAGAT
TTATGCAATA TCTGCCGCAC TGCATATACA TACAAAGGAA ACACAGGTTC CGAAATTCTG
GGAATTAATC TTGAAGGACC CTTTATCTCT GAATCCAAAA GAGGTGCACA GGATTCTTCA
TTCATACTAA AACCCGACAC TGCAGTATTC AAAAAATTCC AGGAAGCAGC AGGCGGAATG
ATAAAAATTG CCTGTATTGC CCCTGAAGAA GAGAACGGCA TGGAATTTAT CGAAGAACTA
AAAGATGAAG TAATATTATC AATAGCCCAT ACTGCCGCAG ATTATGAAAC AGCAGTAAAA
GCATTTGAAA AAGGCGCTGT CCATGTGACG CATCTTTACA ATGCAATGCC CGCGTTTCAC
CACCGTTTTC CCGGAGTAGT GGGGGCAGCC CGTCAGAATG AAAGCTGCTT TGTGGAACTC
ATATGCGACG GTGTTCTCCT GCATCCAAGC ACAATAAACA GTACATTCAA AATGTTCGGA
GACAACAGAG TCATAATGAT CAGCGACAGC GTCATGGCAG CAGGTATGCC GGAAGGAAGC
TACACACTCG GCGGACAGAA AATAACAGTA ACAGGAAAAA CTGCTACTGT AGATGCAACA
GGTGCCTTAG CCGGCTCAGT AAGCAATTTG ATGGAATGTA TGTGTTTATG TGTAAGAGAA
ATGGGAATTC CTCTTGGAAG TGCGGTAAAA GCCGCTTCAT CGAATCCGGC AAAAGCTTTG
AGAATATATG ATAAATATGG AAGTATTTCT CATGGAAAAT ATGCCGATAT TGTATTGTTA
GACAGAGATT TGAATATAAG AAAAATTATT TTCAGAGGAA AATTATTAGA CAGAAGGAGA
AATTTATGA
 
Protein sequence
MILKNADVFQ ENGNFLQHDI FINGEYITDH KPEQTAEDST VIDADGLYAI PGLIDIHFHG 
CMNRDFCDAD HESIRIISEY QLNNGITSIL PATMTLGEED LCNICRTAYT YKGNTGSEIL
GINLEGPFIS ESKRGAQDSS FILKPDTAVF KKFQEAAGGM IKIACIAPEE ENGMEFIEEL
KDEVILSIAH TAADYETAVK AFEKGAVHVT HLYNAMPAFH HRFPGVVGAA RQNESCFVEL
ICDGVLLHPS TINSTFKMFG DNRVIMISDS VMAAGMPEGS YTLGGQKITV TGKTATVDAT
GALAGSVSNL MECMCLCVRE MGIPLGSAVK AASSNPAKAL RIYDKYGSIS HGKYADIVLL
DRDLNIRKII FRGKLLDRRR NL