Gene Sterm_1996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_1996 
Symbol 
ID8597462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp2126200 
End bp2127432 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content37% 
IMG OID 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_003308782 
Protein GI269120605 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000031491 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACAA AAGCGGTAAG ACTTTATGGC AAAAATGATT TGAGACTGGA AGAGTTTGAG 
CTGCCGGAAA TAAATGATGA CGAAATATTA ATGTCAGTAG TAACAGACAG TATATGTATG
TCTACCTGGA AGCTGGCAAA GCAGGGTGAG GATCATAAAA AAACACCGGA AAATATAAAA
GAAAAACCGA TTATAGTAGG ACATGAATTT TGCGGTGAAG TTATAAAAGT CGGAAAAAAC
TGGGGAAATA AATATAAGGA AAGCGAAAGA TATGTAGTGC AGGCTAATCT GCAGCTGCCG
GATGCTCCAT GGTGTCCCGG TTATTCATAT CAATACTGCG GAGGAGATGC TACATATATT
ATTCTGCCTG CTGATGTTAT GAAACAGGAC TGTCTGCTTC CGTACAACGG GGAAACTTAC
TTTGAGGGAT CGCTTATAGA ACCTTTATCT TGTGTAATAG GGGCATTTAA AGGAAACTAT
CATCTTATAG AAGGTACATA TGATCATAAA ATGGGAATAA AAGAGGGCGG AAATCTCCTT
ATATCGGGCG GAACAGGACC TATGGGTCTT CTTGCCATAG ATTATGCACT TCATGGAGAC
ATAAAACCTA AGAACATAGT AATAACAGAT GTAAATGAAG AAAGACTGAA AAGAGCTTCC
GAGCTTTATA AAACAGAGGG AGCAGTAAAA GTACATTATG TAAATACAGC TAAAATTGAT
GATGTTGTTG CTCATCTAAA AGAAACAGCC GGAGGAGGAT ATGATGACGT ATTTATATTT
GCTCCGGTGC CTGAACTGGT TACTCAGGGC TCAAAAATTC TGAATCCTGA CGGATGTCTG
AACTTTTTTG CAGGACCGCA GAACAAGGAT TTTTCTGCTG AAATAAATTT TTATGATATA
CATTATAATT TTACACATTA TGTAGGGACT TCCGGAGGGA ATACTGATGA TATGCGTGAG
GCAATAAAGC TTATAGAAGA TAAAAAGGTA AATGTGGCAA AGGTGGTAAC TCACATTCTG
GGTCTTGATG CAGTAGCAGA AACAACATTA AATCAGCCTG AAATAGCCGG CGGAAAAAAG
GTAGTCTATA CACAGAAAAA ATTTATGCTG GAATCTCTTG AAAAGCTTAT GCAGGATGAG
AACAGCGAGC TTGGAAAAAT ATTAAAGAAA AATGACGGAA TATGGTCAAA GGAAGCAGAA
GACTATATAA TGAAGAATAC AGAAGAAATA TAA
 
Protein sequence
MKTKAVRLYG KNDLRLEEFE LPEINDDEIL MSVVTDSICM STWKLAKQGE DHKKTPENIK 
EKPIIVGHEF CGEVIKVGKN WGNKYKESER YVVQANLQLP DAPWCPGYSY QYCGGDATYI
ILPADVMKQD CLLPYNGETY FEGSLIEPLS CVIGAFKGNY HLIEGTYDHK MGIKEGGNLL
ISGGTGPMGL LAIDYALHGD IKPKNIVITD VNEERLKRAS ELYKTEGAVK VHYVNTAKID
DVVAHLKETA GGGYDDVFIF APVPELVTQG SKILNPDGCL NFFAGPQNKD FSAEINFYDI
HYNFTHYVGT SGGNTDDMRE AIKLIEDKKV NVAKVVTHIL GLDAVAETTL NQPEIAGGKK
VVYTQKKFML ESLEKLMQDE NSELGKILKK NDGIWSKEAE DYIMKNTEEI