Gene Sterm_2017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_2017 
Symbol 
ID8597483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp2147135 
End bp2148790 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content30% 
IMG OID 
ProductUracil DNA glycosylase-like protein 
Protein accessionYP_003308803 
Protein GI269120626 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.490669 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTAA TATTGGATAA AAAGTTAATA AATGAAAAAG GATTAAAGGA TTTTGATCTG 
GATCCTGTTA ATAAAGACCT TGTTGCTGTG GGGAAGAAGC TTTATTTTGT ATCGCAGGAT
TTGGAAGGAA AAGTAATTAT TAAGGAACTC GGCGGAAAAT TGAAGAATAT AGAGGGAGTA
AAATTTATAA AGGAAGAAAA TCAGCTCTTT GTATCAAATT CTTTTCTTGT TCTGACTATG
TACGGGGAGA TTTTCAAATA TTATGACAGA AAACATAAAG CGTCCAAAAC AGTGTTTTCT
ATGGAAAGAA CACCTGATTA TATAAATTTT ACCACAAATG GGAAGATAAT ATATCTGATG
GACGATACTC TTTATTCATA TAATCCTAAT TCCGAAATGA CTATAAAAAA GCCTGTTATT
AATAAAAATA ATGAAAACAG AGGGAAATAC AAAATATATG TAAACGGTGA AAATATAGTA
TTAAAGCATC GTGCACTTCA TTCACAGGAA AATACCATAA GTATTTTTGA TGAGAAGCTG
GAAGAGATTT TTAATATAAA AACTGTAAAA AATCATATAT ACTCAAGTAT ATCAGAACTT
CAATATATTG CAGGCACAGA AGACGGAGAA GTGGAAATAT GGGATGTGAT CACAAAAGAG
CTTTATAATT CCGTGAAGAT AAGCGACTAT CGCATATCTT ACATTGAAAA GACAAAGGAA
AATTACCTTC TAGGTCTCTC TTCGGGAGAA TTAATCATAA CAGACGAAAA ATTCAGAATA
GAGAAAAAGC TGAATCTTCA TAAAGGCGAT ATTTTGAAAA TAAAGGCCAA TGATGAAAGA
ATATTCACAC TTGGAATGGA TTATAATATA TTAAGTCTGA AAATATTGAA AAATGAGGAA
ACTGATATTG AAAGACGCGG CTTTATGCAG GAATATAATA TAAATGACGA ATATTTTGAG
TTTTTTACTT ATGAAAGAAT AGAAGCTGTA AGAAATTTTA TAAGAGAATT AAAAATAAAA
AATATATCAT ATAATCCAAA GGAAAATCTT ATATTTAAAG TATTTTCAGA GCCGCTTTCG
GAACAAAAAA TATGTATACC GGTAAAAGAG CCGTATACTC AGGGAAATAC CGCAACAGGA
CTTGCATTGG AAATGGAAAA AAATTCATGG ACTGATCCCG AACTGAATAA TTCTTTGAGA
AACATACTGA AATTGCTTTA TAAAACATAT ATGGGCACTT CAAAAGATTT GAATTATATA
AGGGAAGATA TAGAAAAGCA TATATTTAAT ATTCTTCCTC CGGACAAAAT TTTTAAATAC
TGGCAGAAAA ACGGAGTACT TCTTTTGAAT ACGGTTTTGA CAATTGCGGA AACAAAAGCA
GCTGATCATA GTAAATTTTG GACACCTTTT ACACAGGAAC TGCTGGAATT TATCTCAGAA
AAAAATAAAA ATATCACATA TTTTTTATGG GGAAAAGATG TGCAGGCATT TGAGAAAAAT
ATAAAAAGCG GAGAAATAAT CAAACATAAT CATCCGTCTG TATGGGGAAA TCCTGAAAAT
GAGAAAGATT TTTTGAACAG CAGCTCATTT GAGAAAACAA AAGGAATTAT AAACTGGCTT
GGGTGTGAAA TGGAACGAAA GACGACATTA TTTTAA
 
Protein sequence
MKVILDKKLI NEKGLKDFDL DPVNKDLVAV GKKLYFVSQD LEGKVIIKEL GGKLKNIEGV 
KFIKEENQLF VSNSFLVLTM YGEIFKYYDR KHKASKTVFS MERTPDYINF TTNGKIIYLM
DDTLYSYNPN SEMTIKKPVI NKNNENRGKY KIYVNGENIV LKHRALHSQE NTISIFDEKL
EEIFNIKTVK NHIYSSISEL QYIAGTEDGE VEIWDVITKE LYNSVKISDY RISYIEKTKE
NYLLGLSSGE LIITDEKFRI EKKLNLHKGD ILKIKANDER IFTLGMDYNI LSLKILKNEE
TDIERRGFMQ EYNINDEYFE FFTYERIEAV RNFIRELKIK NISYNPKENL IFKVFSEPLS
EQKICIPVKE PYTQGNTATG LALEMEKNSW TDPELNNSLR NILKLLYKTY MGTSKDLNYI
REDIEKHIFN ILPPDKIFKY WQKNGVLLLN TVLTIAETKA ADHSKFWTPF TQELLEFISE
KNKNITYFLW GKDVQAFEKN IKSGEIIKHN HPSVWGNPEN EKDFLNSSSF EKTKGIINWL
GCEMERKTTL F