Gene Sterm_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_1031 
Symbol 
ID8596510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp1115745 
End bp1117406 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content43% 
IMG OID 
ProductGlycerol dehydratase 
Protein accessionYP_003307830 
Protein GI269119653 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00671582 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTCAA AAAGATTTGA AGTCCTAAGA AACAGACCGG TAAATCAGGA TGGATTCGTT 
GCTGAATGGC CGGAGGTAGG ATTTATCGCT ATGAACGGAC CAAATGATCC CAAGCCCGGA
GTAAAGGTGC AAAATGGCGA AATAGTAGAG CTGGACGGTA AAAGGAAAGA AGATTTTGAT
TCGATAGATA TGTTTATAGC AAAATATTCG ATTAATATAG AAAAAGCAGA GGAAGTAATG
AAAATGGATT CACAGAAGCT GGCAAACATG CTTTGTGATC CTAATGTGCC CCGTACTAAG
CTGATAGAGA TAACTACTGC CATGACTCCG GGGAAAATAG TAGAAGTTTT GGGACATATG
AATGTACTGG AAATGATGAT GGCGCTTCAA AAAATGCGTG CCAGAAAAAC ACCGGCAAAT
CAGTGTCATG TTACCAGTGT AAAGGATAAT CCCGTTCAGA TTGCTGCAGA AGCAGCAGAA
GCAGCAGTAA GAGGATTTGC CGAGGAAGAA ACTACAGTGG GAATAGCAAG ATACGCTCCT
TTTAATGCTC TTGGACTGCT TATAGGTTCT CAGGTGGGAA GAGGAGGAAT TCTCACACAG
TGTGCCCTTG AAGAAGCAAC GGAACTTCTT CTGGGAATGA GAGGACTTAC TTCATATGCA
GAGACAATTT CTGTTTACGG AACAGAAGAT GTCTTTACAG ACGGTGACGA CACTCCGTGG
TCGAAGTCGT TTCTGGCATC TGCGTATGCG TCAAGAGGAT TAAAAATGAG ATTTACTTCC
GGGACAGGTT CAGAGGTTCA GATGGGATAT GCCGAAGGAA AGTCAATGCT TTATCTTGAG
GCAAGATGTA TATATATAAC AAAAGGTGCA GGGGTACAGG GACTGCAGAA CGGTTCCATA
AGCTGTATCG GGATACCCGG GGCAGTACCT TCGGGAATAC GTGCCGTGCT TGCGGAAAAC
CTTATAACAA CAATGCTTGA TCTGGAAGTA GCTTCCGGGA ATGACCAGAC TTTCTCGCAT
TCGGATATAA GAAGAACTGC GAGAATGCTT ATGCAGATGG TTCCGGGGAC AGATTTTATC
TTTTCGGGAT ACAGTGCGAC CCCTAATTAT GATAATATGT TTGCCGGGTC AAATTTTGAT
GCTGAGGATT TTGATGATTA TAATATTATA CAAAGAGATC TGAAAGTAGA CGGGGGTCTT
CGTCCTGTAG TGGAAGACGA AATTGTGGCA ATCAGAAATA AGGCTGCAAG AGTACTTCAG
GCTGTATTCA GAGAACTGGG TCTTCCGGAA ATTACTGATG AAGAGGTAAC AGCAGCTACC
TATGCACATG GAAGCAAGGA TATGCCTGAC AGAAATGTGG TGGAAGATCT GAAAGCAGCA
GGAGAAATGC TGACAAGAGG TATAACCGGA GTGGATGTAG TAAAAGCTCT TCATAAAAAT
GGATATCTGG ATGTAGCACA AAATGTGCTG AATATGCTGA AACAGAGAGT TTCCGGGGAT
TATCTTCATA CATCGGCAAT TATAAACAAG GATTTTGAAG TAATAAGTGC CGTGAATGAT
CTGAACGACT ATTCCGGACC GGGAACCGGG TACAGAATAA GCGAAGAGCG TTGGAATGAG
ATAAAAGATA TTCCAAATGC AATAAAACCT GATTCAATAT AG
 
Protein sequence
MKSKRFEVLR NRPVNQDGFV AEWPEVGFIA MNGPNDPKPG VKVQNGEIVE LDGKRKEDFD 
SIDMFIAKYS INIEKAEEVM KMDSQKLANM LCDPNVPRTK LIEITTAMTP GKIVEVLGHM
NVLEMMMALQ KMRARKTPAN QCHVTSVKDN PVQIAAEAAE AAVRGFAEEE TTVGIARYAP
FNALGLLIGS QVGRGGILTQ CALEEATELL LGMRGLTSYA ETISVYGTED VFTDGDDTPW
SKSFLASAYA SRGLKMRFTS GTGSEVQMGY AEGKSMLYLE ARCIYITKGA GVQGLQNGSI
SCIGIPGAVP SGIRAVLAEN LITTMLDLEV ASGNDQTFSH SDIRRTARML MQMVPGTDFI
FSGYSATPNY DNMFAGSNFD AEDFDDYNII QRDLKVDGGL RPVVEDEIVA IRNKAARVLQ
AVFRELGLPE ITDEEVTAAT YAHGSKDMPD RNVVEDLKAA GEMLTRGITG VDVVKALHKN
GYLDVAQNVL NMLKQRVSGD YLHTSAIINK DFEVISAVND LNDYSGPGTG YRISEERWNE
IKDIPNAIKP DSI