Gene Sterm_3458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3458 
Symbol 
ID8598909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp3661676 
End bp3663316 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content37% 
IMG OID 
ProductPTS system, lactose/cellobiose family IIC subunit 
Protein accessionYP_003310228 
Protein GI269122051 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0308531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAC TGGTAAAATG GATAACGAAA ATGCAGCCGT TTTTTAATAA AATATCTAAT 
AATCCATATC TTCGTGCAAT AAGAGACGGA TTTATAAGTC TGATTCCCGT AATATTATTT
TCAAGTATAT TTTTACTTAT AGCATTCGTA CCAAATGCTT TCGGGTACTT TTTGCCCGAT
AATATTGTTA CAGTATTGAT GAAGGCTTAT TCATTATCTA TGGGGATTCT GGCAATTCTG
ATGTCAAGCA CTATTGCAAG AAGTCTTACA GATAATTTCA ACCTTAAAAT GCCTAAAACA
AGACAGATAA ATACAGTATC AGTAATGATA GCAGCAATAA TATCATTTCT TCTGTTAAGT
ACTGACCTGA AAGACGGGGC TATTTCTCTT GATTATCTGG GAACAAAAGG TCTTCTTACA
TCATTTATAA TAGGCTTTAC AATTCCTAAT ATATATAAAT TCTGTGTAGG AAGAAATATT
ACAATCAAGC TTCCCAAGGA AGTTCCGGGA AATATATCAC AGACATTTGC TGATATAATT
CCTATATCAC TCTCAGTATT ATTTTTCTGG GTATTTGATA TTTTAGTAAG AAAATTCATA
GGGGTAGGTT TCAGCGAATT TATTCTGGAA TTATTCAGAC CGTTGTTTTC AGCAGCTGAC
GGATATATAG GACTTGCAGT TATTTTCGGA GCTATGGCAT TTTTCTGGTT TGTGGGAATA
CATGGTCCGT CAATAGTGGA ACCGGCAGTT GCCGCTATAT ATCTTACAAA TGTAGAAGTG
AATTTTCAGA TGTTCAGTAA AGGAGAGCAT GCTACAAAAG TTCTTTCACA GGGCTCTCAG
TATTTTGTGG CAACTTTGGG AGGAACCGGG GCTACACTTG TAATAATATT CATGATGGCA
TTTATAGCTA AATCAAAGCA GCTGAAGGCA GTAGGAAAAG CCTCTTTGAT TCCGGGACTG
TTCGGTGTTA ACGAGCCTAT TCTGTTTGGT GCTCCTCTGG TATTGAATCC GGTATTTTTT
ATACCGTTTA TACTTACACC TATTATCAAC GTATGGCTAC TGAAATTTTT TATAACTTTA
GGGATGAACG GCTTCGTCTA CAATCTTCCG TGGACAACAC CGGGACCTCT TGGATTAATA
ATAGGAACTG GATTTTCTCC TCTGGTATTT CTACTGGTTC CGCTGCTTCT TGCAGTAGAC
TTTGTAATTT ATTATCCGTT TTTGAAAACT TATGATCTGC AGCTCATCAA ACAGGAAGAA
GAGGATAATA CTGCCACAGA ACGGCCGAAG GAAGAAATTA AGGATGAAAA AATCTATGAT
ATAAAAGATA AAAAAATATC CGTGCTTGTC TTATGTGCAA ACGGAGCAAC AAGTGGAATG
CTTGCCAATG CAATAGCAGA AGGAGCAAAA CAGAAAAATA TGGATCTTGA GTCGACTGCA
ATGGCATATG GTCAGCATAA AGAGGTGCTG GATCAGTTTG ATCTGATTAT ACTGGCACCG
CAGATGGCTT CAATGCTTGA TGAACTAAAA GTGGAAACAG ATAAAGCAGG AATAAAATCA
GTTTCTACAG GGGGAAAAGA ATATGTAGGT CTGACCCGGA ACCCTGAAGA GGCATTAAAA
TTCGCACTTA AGAATATATA G
 
Protein sequence
MMKLVKWITK MQPFFNKISN NPYLRAIRDG FISLIPVILF SSIFLLIAFV PNAFGYFLPD 
NIVTVLMKAY SLSMGILAIL MSSTIARSLT DNFNLKMPKT RQINTVSVMI AAIISFLLLS
TDLKDGAISL DYLGTKGLLT SFIIGFTIPN IYKFCVGRNI TIKLPKEVPG NISQTFADII
PISLSVLFFW VFDILVRKFI GVGFSEFILE LFRPLFSAAD GYIGLAVIFG AMAFFWFVGI
HGPSIVEPAV AAIYLTNVEV NFQMFSKGEH ATKVLSQGSQ YFVATLGGTG ATLVIIFMMA
FIAKSKQLKA VGKASLIPGL FGVNEPILFG APLVLNPVFF IPFILTPIIN VWLLKFFITL
GMNGFVYNLP WTTPGPLGLI IGTGFSPLVF LLVPLLLAVD FVIYYPFLKT YDLQLIKQEE
EDNTATERPK EEIKDEKIYD IKDKKISVLV LCANGATSGM LANAIAEGAK QKNMDLESTA
MAYGQHKEVL DQFDLIILAP QMASMLDELK VETDKAGIKS VSTGGKEYVG LTRNPEEALK
FALKNI