Gene Sterm_4095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_4095 
Symbol 
ID8599539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4360803 
End bp4362212 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content36% 
IMG OID 
Productglycoside hydrolase family 1 
Protein accessionYP_003310858 
Protein GI269122681 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.362343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC TTAAATTTCC GGAAAATTTT GATTGGGGTA CAGCGTCAAG CGGACCGCAG 
AGTGAAGGGT CACAGGATAA ACCGCATGAA TCTTTATGGG ACTACTGGTA TAAAAATGAT
AAAGAGCGTT TTTACCAGAA TGTAGGACCG AAGATAACAT GCGACAGCTA TAACAGATAT
AAGGAAGACA TAAAAATAAT GGCGGAATTA GGCTTAAAGT CATTCAGAAC ATCTATTCAG
TGGACAAGAC TTATAAAAAA TCTTGAAACA GGCGAGCCGG ATCCAAAAGC TGTGGAATTT
TATAATAATT ATATCAATGA AATGCTGGAA AACGGGATTG AACCTGTAAT GAATCTTTTT
CACTTTGATA CTCCCATTGA GCTTGAGCAG AAATATGGAG GCTTTAAAAG TAAAAAGGTA
ACAGAGCTAT ATGTAAAGTT TGCTAAGACA GCTTTTGAGC TTTTCGGTGA CAGAGTAAAA
AGATGGATAA CTTTTAATGA GCCTGTTGCA CACACAAAAG GAGCATATCT GTATAATTTT
ATATATCCTG CGGAATTAAG TACAAAATCC TTTGCACAGG CGAGCTATAA CATTCTTCTT
GCACATGCGG GAGCTTGTAA AGTATATAAA AGTTTACAGC TGAGCGGAGA AATAGGAATA
GTGCTGGATC TTCTTCCGCC GATACCAAGA AGCGCAAGAA GTGCGGATAA GTATGCTGCA
AGGGTTGCAG ATCTGTTTTT TAATATGATA TTTATGGATC CGTGCGTAAA CGGACAATAC
GATGATGAAT ATCTGGAAAT TTTGGAAAAG CATGACTGTA TGTTTGATAT CCTTCCTGGG
GAAAAGGAAT TAATAAAAGA AAATACTGTA GACTTTATAG GGATAAACTA TTACCAGCCG
CAGCGTGTAA ATGCTCCTAA ATATGCTCCT AATGTAAATG CACCATTTAC ACCTAACTGG
TATTTTGATG ATTATGAAAT GCCTAACAGA CGTATGAATG TGTACAGAGG CTGGGAAATA
TATCCGAAAT CAATTTATGA TATAGCAATG CGTGTAAAAA ATGAATTTGG CAATATCAAA
TGGTTTATAT CGGAAAATGG CATGGGAGTG CAGAATGAAG AGCGTTTTAT AAATAAAGAC
GGTATGATAG AGGATGATTA CAGAATAGAA TTCATAAAGG AACATTTGGA ATGGCTTCAT
AAAGCAATGG AAGAAGGATC GAACTGTCAG GGATATCACC TGTGGACATT TGTGGATAAC
TGGTCATGGA CTAATGCTTA TAAAAACAGA TACGGTTATG TATCTCTTGA TTTGAAAACA
AGAAACAGAA CAATAAAAAA ATCAGGATAC TGGATAAAAA AAGTAATTGA AGAAAATGGC
TTTGAACCGC TTGAAGATGA ATTTGAATAA
 
Protein sequence
MKKLKFPENF DWGTASSGPQ SEGSQDKPHE SLWDYWYKND KERFYQNVGP KITCDSYNRY 
KEDIKIMAEL GLKSFRTSIQ WTRLIKNLET GEPDPKAVEF YNNYINEMLE NGIEPVMNLF
HFDTPIELEQ KYGGFKSKKV TELYVKFAKT AFELFGDRVK RWITFNEPVA HTKGAYLYNF
IYPAELSTKS FAQASYNILL AHAGACKVYK SLQLSGEIGI VLDLLPPIPR SARSADKYAA
RVADLFFNMI FMDPCVNGQY DDEYLEILEK HDCMFDILPG EKELIKENTV DFIGINYYQP
QRVNAPKYAP NVNAPFTPNW YFDDYEMPNR RMNVYRGWEI YPKSIYDIAM RVKNEFGNIK
WFISENGMGV QNEERFINKD GMIEDDYRIE FIKEHLEWLH KAMEEGSNCQ GYHLWTFVDN
WSWTNAYKNR YGYVSLDLKT RNRTIKKSGY WIKKVIEENG FEPLEDEFE