Gene Sterm_3601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3601 
Symbol 
ID8599047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp3822332 
End bp3824095 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content41% 
IMG OID 
Producthydrogenase, Fe-only 
Protein accessionYP_003310366 
Protein GI269122189 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACATA ATTTAATCAC CCTTACTATA GATCATAAAA CTGTGGAAGT ACCTGAAGGC 
ACTACCATAC TTGCTGCAGC TAAAAGTGTC GGCATTAGTA TTCCTACTCT CTGTTACCTG
AATCTTTCAG ACTTCGGCTG TGTAAATACC CCGTCTTCTT GCAGAATGTG TCTTGTAGAA
GTAGAAGGCA GAAGAAATCT TGCTCCTGCA TGCGTTACTC CTGCTTCTGA TAAAATGACA
GTCAGTACTA ACTCTGTAAG AGCACTGAAA ACCAGAAAAA CAATGCTGGA ATTACTTCTT
TCAGATCACC CGAAAGACTG CCTGACCTGT CAGAAATCAG GAAACTGTGA ATTGCAGAGC
CTTGCTGATA AATTTTCTAT AAGAGATATA AAGCTGAAAG GAGAACAGTC TGCTTACAGA
CTGGATATCT CAAAATCCCT TATCCGTGAT ATGGATAAAT GTATTATGTG CCGCCGCTGT
GAAACAATGT GCAATGAAGT ACAGACTGTA GGAGTATTAT CTGCTATAAA CAGAGGGTTT
GAATCTGTTA TTGCCACTGC TATGGAAATA AATCTGAGTG ACTCTGTATG CACATATTGC
GGACAGTGTG CTGCCGTCTG TCCCACAGGA GCCTTGGTAG AGAATGATGC TACATGGGAT
GTTGTCAAAG CTTTGGGAGA CCCGGAAAAA ACAGTCATTG TCCAGACTGC TCCTTCGGTA
AGGGCAGCAC TCGGAGAAGA ATTCGGGCTG GAGCCGGGAA CACTTGTTAC GGGTAAAATG
GTGGCAGCAT TGCGCGGTCT TGGTTTTGAC AAGGTATTTG ATACAGATTT TGGTGCTGAT
CTTACTATAA TGGAAGAAGC TTCCGAATTT TTAGACAGAT TAACACGGCA TCTTGACGGT
GACACCAGTG TAAAACTTCC TATACTTACT TCTTGCTGTC CTGCATGGGT AAACTTTTTT
GAGCATAATT TCAGCGACCT TCTGGATGTT CCTTCCACTT CAAAATCTCC TATGCAGATG
TTCAGTGCCG TAGTAAAAAA TGTTTACGCT CAGGAGCTGG GTGTAGACAG AAAAAACCTT
GTGGTTGTTT CTGTTATGCC TTGTCTTGCA AAAAAATACG AAGCAAGCCG TGATGAATTT
TCAATAGGAA ATGACTATGA TACTGATATC GTTCTTTCTA CAAGGGAACT TGCAAAATTA
ATAAAACAAT ATAATATAGA ATTTAATCTG CTGAAAGATG AAGAGTTTGA TAATCCTCTC
GGAGAATCAA CAGGTGCAAG TATTATTTTC GGAAGAACAG GGGGAGTTAT TGAAGCAGCG
CTCAGAACAG CTGCTGACTG GTATACCAAA GAAGATCTGC AGGACATTGA TTATACTCAG
GTCAGAGGAT TTGAAGGAGT TCGAAGTGCT GATGTAAAAA TCGGCGATCT GGAGCTGAAA
ATCGGAATTG CTCATGGTCT GGGAGAAGCA CGCAAGCTGC TTGAGGAAGT AAGAGCCGGA
AAATCTGCAT ACCATGCTAT AGAAATAATG GCCTGTAAAG GCGGATGTAT CGGCGGCGGC
GGACAGCCTT ACCATCACGG GAATACTGCT ATACTAAAGA AGCGAACCGA GGCGCTCAAA
ACTGAAGACG AATCTAAAAA AATCAGAAAA TCCCATGAGA ATCCTTATAT TATAAAACTA
TATAAAGAGT ATTTCGGAGA GCCTTTAAGC CACAGATCCC ACGAATTACT GCATACAAAA
TATTTCAAAA AGCATAAAAT ATAA
 
Protein sequence
MKHNLITLTI DHKTVEVPEG TTILAAAKSV GISIPTLCYL NLSDFGCVNT PSSCRMCLVE 
VEGRRNLAPA CVTPASDKMT VSTNSVRALK TRKTMLELLL SDHPKDCLTC QKSGNCELQS
LADKFSIRDI KLKGEQSAYR LDISKSLIRD MDKCIMCRRC ETMCNEVQTV GVLSAINRGF
ESVIATAMEI NLSDSVCTYC GQCAAVCPTG ALVENDATWD VVKALGDPEK TVIVQTAPSV
RAALGEEFGL EPGTLVTGKM VAALRGLGFD KVFDTDFGAD LTIMEEASEF LDRLTRHLDG
DTSVKLPILT SCCPAWVNFF EHNFSDLLDV PSTSKSPMQM FSAVVKNVYA QELGVDRKNL
VVVSVMPCLA KKYEASRDEF SIGNDYDTDI VLSTRELAKL IKQYNIEFNL LKDEEFDNPL
GESTGASIIF GRTGGVIEAA LRTAADWYTK EDLQDIDYTQ VRGFEGVRSA DVKIGDLELK
IGIAHGLGEA RKLLEEVRAG KSAYHAIEIM ACKGGCIGGG GQPYHHGNTA ILKKRTEALK
TEDESKKIRK SHENPYIIKL YKEYFGEPLS HRSHELLHTK YFKKHKI