Gene Sterm_3109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3109 
Symbol 
ID8598563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp3255815 
End bp3257911 
Gene Length2097 bp 
Protein Length698 aa 
Translation table11 
GC content35% 
IMG OID 
Productprotease-associated PA domain protein 
Protein accessionYP_003309882 
Protein GI269121705 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000536812 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TTTTGATTTT ATTTTTTACA GTGTCGTTAA TGATGATTTC GGCACCCAGA 
GAAAGCACAG ACAGCAGGGA AAATATCATA AATCATCTTG ATATTAATTA TTCTTATAAT
ATTGCCGAAT CCCTGACGAA GTTTAAGACT AACGAAAAAC TGGGATTTCG TACTGCCGGT
TCCAGTGCTG AACATGCAGC CGGAGATATG CTTTATGAAG AGTTCAAAAA GCTCGGGCTT
AAAAATGTCA GAAAAGATGA ATTTACTGTA GATGCATGGG AATTTAAAAA TGCAGAGCTT
ACGTATACCG ATAAAAAAAA TAAAAAGCAA AGGCTTACTT TAAACAGTTA TGCTGCCAAT
TTTGCGACAA ATGGTACAGA AGTGTATGAT CTGGTTTATC TGAACAAGGG TACCAGAGAT
GATTATGAAA ATGCGGATGT CAAGGGAAAG ATAGTTATGG TTGATATTAA TCAGCGTGAG
GACTGGTGGA TTAATTATCC TGCCATGCAG GCAAAGCTAA AAGGAGCAAA GGCTGTAATA
GCCGTAAATA ACGGAGGATA TGCAGAAATC AGCGATGATG CCCTGAATGT ACAGGACATG
TGCGGTCCTG ATGATACACC TGCCTTGGGA ATGTCAAAAG CAGACGGTGA TAAGCTGAAA
GCTCTTATGA ATAAAAATAG AACTGTAAAA ATAGAACTAA ATGTGGATTC TCAGGTTAAA
AGAGATCAAA AGGCCTATAA TATTGTTGGT GAAATCCCCG GAAAGGATCC TGATTCACTT
ATAATATTAA GTTCACATTA TGACGGATAT TTTGAGGCAT TTCAGGATAA TGCTACAGCA
GTTGCCCTTA CTATGGGAAT AGCCAAAAGT ATAATTGACA GCGGGTATCA GCCTGAAAAA
ACAATTATCG TTATTGCACA TGCTGCCGAA GAATGGGGAA CAGTAGATAC AAGATATGAC
TGGTCTGTAG GTGCGTATAA TCAGGTGTTT AAAGTAAGAC CGGACTGGGC AGCCAAAAGT
TTTGCCATGC TGAATTTTGA ACAGCCGGGA TCTGAACATG TAAAAACACA GGAAATAAGA
ACTGTTTATG AATATAAAAC ATTCATTGAA AGTATTGCTG ACAGAATTAA ACCGTCTGTC
TCAGGTGTAT ATGAGGGAGG AATCAAGGTT ACCACACCTC CGAGAACATG GGCGGATGAT
TTTTCATATT CTATAGCCGG GATTCCTACT ATAAGAAATG ATTATGTAGG AGCACAGTTT
ATGAAATCGA CATATCATAC AAATTATGAT ACTAAAGCAA CTTATAATGA AAAAGCCTTT
ACTTATAATC ACCAGCTTTA TGCACAGATT GTTTACGAGC TTGATCAAAA AGCAGTTATG
CCGATGGATT TTACTACACG TTTCAATGAA TTCAAAGCTA CGCTGGATAT GGATTTACTG
GCTAAAACCG GAAATGAAGG AAAAAAACTT CTGACAGACA TTGAAGAAGT AATAAAAACT
TCTGAAAATC TGAACAGACT GCTTGCAGAT ATAAATAATA AGCATGAGCA GGCTATAAAA
AGTAATAATA CTGCCGAAAT TAAGAAATAT GAAACAAAAG CAGATACTGT GAACAAACAG
CTCCTTGCTC TTTATAAATA TTGTCAGGAT TCATTTATAA AGCTTACATG GGAAGATGAT
TCCATATTCC CGCATGAACA TGCACAAAAT AATATAAATG CCCTAAATGA AGCTGTAGCT
TTACTTGAAA AGGGAGATAT AGATACTGCA GTTAATGATC ACCTTTCTTT AATTGATAAT
AACTGGTATG CACTGAGTTT TGACAAAGAA ACTTATGAAT ATTTTACGAA TCAGGTTTTA
AAACAAGATA AGGAACGTCT GAACTGGGGT GCAGGAAGAA TTATGGGACA TGAGGATCTT
TATGATATTA TTTTCTCTTT GCAGCAAAAA CAAAAATCAG GAGAAAAAAA TGTTAAGAAT
GAAATAGAAG CTTTGAAAAA GATTCTTGCT TCTCAGGAAG CACTTATGAA AAATACTGTT
ATTACAGAAA ATAAACAGCT TCTTGAAGTA AAAAAATATC TAAATAAAAT AAAGTAA
 
Protein sequence
MKKILILFFT VSLMMISAPR ESTDSRENII NHLDINYSYN IAESLTKFKT NEKLGFRTAG 
SSAEHAAGDM LYEEFKKLGL KNVRKDEFTV DAWEFKNAEL TYTDKKNKKQ RLTLNSYAAN
FATNGTEVYD LVYLNKGTRD DYENADVKGK IVMVDINQRE DWWINYPAMQ AKLKGAKAVI
AVNNGGYAEI SDDALNVQDM CGPDDTPALG MSKADGDKLK ALMNKNRTVK IELNVDSQVK
RDQKAYNIVG EIPGKDPDSL IILSSHYDGY FEAFQDNATA VALTMGIAKS IIDSGYQPEK
TIIVIAHAAE EWGTVDTRYD WSVGAYNQVF KVRPDWAAKS FAMLNFEQPG SEHVKTQEIR
TVYEYKTFIE SIADRIKPSV SGVYEGGIKV TTPPRTWADD FSYSIAGIPT IRNDYVGAQF
MKSTYHTNYD TKATYNEKAF TYNHQLYAQI VYELDQKAVM PMDFTTRFNE FKATLDMDLL
AKTGNEGKKL LTDIEEVIKT SENLNRLLAD INNKHEQAIK SNNTAEIKKY ETKADTVNKQ
LLALYKYCQD SFIKLTWEDD SIFPHEHAQN NINALNEAVA LLEKGDIDTA VNDHLSLIDN
NWYALSFDKE TYEYFTNQVL KQDKERLNWG AGRIMGHEDL YDIIFSLQQK QKSGEKNVKN
EIEALKKILA SQEALMKNTV ITENKQLLEV KKYLNKIK