Gene Sterm_3346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3346 
Symbol 
ID8598798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp3521246 
End bp3522568 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content40% 
IMG OID 
ProductPTS system, cellobiose-specific IIC subunit 
Protein accessionYP_003310117 
Protein GI269121940 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGGAG CAATGGACGG GGTTATAGGA TTTCTGGAAA AACATTTAAT GCCTGTAGCA 
GGAAGAATAG GAAATCAAAA ACATGTTCAG GCAATAAGAG ATGGAATTAT AGTTACAATG
CCGCTTACAA TAATAGGATC AGTATTTTTA ATAATAGGAA ACTTTCCGAT ACCGGCATAC
ACAAAATGGC TGGCGGAAAC AGGAATAGCT GAAAAGCTGG GGTATCCGGT AACAGCATCC
TTCGGACTAA TGGGAGTAAT AGCCTGTATA GGTATAGCAT ACAGACTTTC TGAAAAATAT
AATGTAGATG CGCTTACAGG GGCAGTGTTG TCTTTATGTA CTTTCGTTCT TGTAACACCT
TACAACATCC CGTTTTTACA AGACGGTAAA GAAATAGGCA CTGTAGGCGG AATCGCATTT
AGCTTTTTAG ACAGTGGAGG ATTATTCGTA GGTTTGATAA TGTCAATATT TACTGTGGAA
ATATACAGAA TAATTGTACA AAAAGATATT ATTATAAAAA TGCCTGACGG AGTACCGCCT
GCAGTGGCAA AGTCATTCGC AGCCCTGATT CCGGGAATGA TAATATTAAC TGTAGTATGG
ATAATCAGAC TTGGATTAAT GTACACTCCA TTTGAGGATA TGCATAATAT AGTAAGAGTA
ATACTGGTAG GACCGCTTAC AAAAATCGGA GGAACATACT GGGGAGCTCT CGTAGTAACA
CTGCTTATTC ACTTATTGTG GATGACGGGA ATTCACGGTG CAGCCCTTAT AATGGGAATA
ATCTCTCCGG TAACTTATAA GCTTATGGCG GAAAATAATG CTGCTTATAT GGCAGGAGCA
AGAGGAACAG AATTACCTCA CGTTGTAACA ACACAGTTTT TTGATATATT CCAGTCAATG
GGCGGATCAG GATCTACATT CTCACTTGCA ATAATATTAT TCCTCTTCTC AAAGAGCAAA
CAGCTGAAAG AAATAGGAAA ACTGGCTGTA GGACCGGCAT TCTTCAATAT TAATGAACCG
ATTCTTTTCG GGCTTCCTAT AGTAATGAAC CCGCTTATGC TGATACCGTT TGTACTGTCA
CCGGTAGTGG TAATTACAAT AACATACTGG TCAATGAAAT TAGGACTGGT GTCAAGACTC
GCAGGTATAG CGATACCATG GACAACACCG CCTGTTCTCG GAGGGGCACT GGCTACGGCG
AGTATATCCG GAGGAGTAAT ACAGGTAATA AGTATGGTAT TGACTTTCTT TATATATTAT
CCATTCTTTA AAATAATGGA TGCGCAAAAA TTAAAAGAAG AGCAGGCAGC AGTTAGTGCA
TAA
 
Protein sequence
MAGAMDGVIG FLEKHLMPVA GRIGNQKHVQ AIRDGIIVTM PLTIIGSVFL IIGNFPIPAY 
TKWLAETGIA EKLGYPVTAS FGLMGVIACI GIAYRLSEKY NVDALTGAVL SLCTFVLVTP
YNIPFLQDGK EIGTVGGIAF SFLDSGGLFV GLIMSIFTVE IYRIIVQKDI IIKMPDGVPP
AVAKSFAALI PGMIILTVVW IIRLGLMYTP FEDMHNIVRV ILVGPLTKIG GTYWGALVVT
LLIHLLWMTG IHGAALIMGI ISPVTYKLMA ENNAAYMAGA RGTELPHVVT TQFFDIFQSM
GGSGSTFSLA IILFLFSKSK QLKEIGKLAV GPAFFNINEP ILFGLPIVMN PLMLIPFVLS
PVVVITITYW SMKLGLVSRL AGIAIPWTTP PVLGGALATA SISGGVIQVI SMVLTFFIYY
PFFKIMDAQK LKEEQAAVSA