Gene Sterm_4066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_4066 
Symbol 
ID8599510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4329232 
End bp4331004 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content36% 
IMG OID 
Productglycoside hydrolase family 2 sugar binding protein 
Protein accessionYP_003310829 
Protein GI269122652 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAGAA ATGAATATCC CAGACCGGAT TTTGTGAGGG AACATTGGGC TTGTCTAAAC 
GGTATATGGG ATTTTGAATT TGATGATAAT AATATCGGAA TGTCATCAAA ATGGTATAAA
AAAGATCATA AGCTTACTGA AAAAATCAAT GTTCCTTTTG TTTTTCAATC AAAGCTAAGT
AATATCAATA TAAATGATTT TCACGATTTT ATCTGGTATA AAAGAGAATT TACAATAGAT
GATTCCTGGA AAAATAAAGA TATTTTACTG CATTTCGGAG CCGTGGATTA CAGATGTCAT
GTTTTTATCA ACGGAGAGCT GGCAGGAAGC CACGAAGGCG GACACACTTC CTTTTATCTG
AATATCACCA ATTATCTTAC ATGGAATAAA GAGGAAATAA CTGTATATGT AGAAGATCCT
TCTGATGATG AGACTATCCC AAGGGGAAAG CAATACTGGC TTGAAAATCC CGAGAGTATT
TGGTATAAGC GTTCCAGCGG CATCTGGCAG TCTGTCTGGA TTGAACCTGT TAATAAAAAC
TATATCACAG ACTTTAGATG TACTCCTTTA TTTGATCAGG GTTCTGTAGA ATTTAACATA
AAAACAAAAT CTGCCAAAAA AAATACAAAA ATCATGATAC AAATCTCATT TAGGGATACC
CTGATAGCTG AAGATATAAT AACTGTCAAT AATATTGAAA CTAAACGTAT CTACGATATT
TTTCAAAAGA AAATTTTCAG AGGCTGCACG CATGGTTCCG GATGGACATG GACTCCTGAA
AATCCTAATT TATTTGATGT TACACTTACT CTTCTTACCA ATGACAAAGT CTCAGATAAA
ATAGAAAGTT ATTTTGGAAT GAGAAAAATA CATACTGAAA ACGGAAAAGT ATACCTGAAT
AACAGACCTT ATTATCAAAG ACTGGTACTG GATCAGGGCT ACTGGCCCGA CAGCCTTATG
ACAGCCCCTT CTGATGAGGA TTTTAAAAAA GATATTATAC TGGCAAAACA GATGGGCTTT
AACGGATGCA GAAAACATCA AAAGATAGAA GACCGGCGTT TTCTTTACTG GGCTGATAAA
CTCGGCTATC TTGTATGGAG CGAGATGCCC AGCACTATCT CATATGATTC GAATTCTGTT
TCCAGAATTA CAAATGAATG GATAGAATCC GTAGACAGAG ATTATAATCA TCCCTGTATT
GTTACATGGG TTGCATTAAA TGAAAGCTGG GGTGTTTCGG AAATTAATTA TAATAAAATG
CAGCAAAGCC ACTCACTTTC TTTATATTAT ATGCTTCACT CGCTTGACAA TACCAGACCG
GTTATTGCAA ATGACGGATG GGAAGCTACA AAAACTGATA TTTGCGCTGT ACATAATTAC
CAACATGGTA CAAAAGATGA GAAAGAAAAA TATGAGAAAT TTATCAAAGA TTTGAGTACA
AAAGAAGACA TACTGGACTC TGTACCGGCG GGGCGTAATA TTTATGCTGA CGAATTCGAA
TATACCGGCG AGCCTGTTAT GCTCACAGAA TTCGGCGGGA TCGGCTATGA TAAAATCCGT
CCTGACGGCT GGGGATACAC AGTTGCTTCC AGTGAAGCAG AATTTATTCA TGATTTAGAG
CGTGTCTTCG ATGCCCTGCG AAAATCAAAA GTATTAACCG GATTCTGCTA TACACAGTTT
ACTGATGTAG AACAGGAAAT AAACGGTCTT CTTACTTATG AGCGTGAACC AAAATGTGAT
TTGGAAATTA TAAAAAATAT TGTAGAAAAG TAA
 
Protein sequence
MHRNEYPRPD FVREHWACLN GIWDFEFDDN NIGMSSKWYK KDHKLTEKIN VPFVFQSKLS 
NININDFHDF IWYKREFTID DSWKNKDILL HFGAVDYRCH VFINGELAGS HEGGHTSFYL
NITNYLTWNK EEITVYVEDP SDDETIPRGK QYWLENPESI WYKRSSGIWQ SVWIEPVNKN
YITDFRCTPL FDQGSVEFNI KTKSAKKNTK IMIQISFRDT LIAEDIITVN NIETKRIYDI
FQKKIFRGCT HGSGWTWTPE NPNLFDVTLT LLTNDKVSDK IESYFGMRKI HTENGKVYLN
NRPYYQRLVL DQGYWPDSLM TAPSDEDFKK DIILAKQMGF NGCRKHQKIE DRRFLYWADK
LGYLVWSEMP STISYDSNSV SRITNEWIES VDRDYNHPCI VTWVALNESW GVSEINYNKM
QQSHSLSLYY MLHSLDNTRP VIANDGWEAT KTDICAVHNY QHGTKDEKEK YEKFIKDLST
KEDILDSVPA GRNIYADEFE YTGEPVMLTE FGGIGYDKIR PDGWGYTVAS SEAEFIHDLE
RVFDALRKSK VLTGFCYTQF TDVEQEINGL LTYEREPKCD LEIIKNIVEK