Gene Sterm_3661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3661 
Symbol 
ID8599107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp3889235 
End bp3890575 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content39% 
IMG OID 
Productglycoside hydrolase family 4 
Protein accessionYP_003310426 
Protein GI269122249 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA AAAATGCCTT TTCAATTTCT ATTGCAGGCG GCGGCAGTAC CTATACACCT 
GAAATAATTC TAATGCTTTT AGAAAACCTG GATCGTTTTC CTATTAGAAA AATTATTTTT
TATGATAATG ATTATGAACG TCAAAAACTG GTAGCAGATG CATGTGAAAT AATACTCCGT
GAAAGAGCTC CCGAAATTGA ATTTTTATCA ACAACAAACC CAAAAGACGG GTTTACAGAT
ATAGATTTTG TAATGGCACA TATTAGAGTA GGAAAGTATG AAATGAGGGA AAAGGATGAA
AAAATTCCTT TGGAGCACGG AATAGTCGGG CAGGAAACCT GCGGACCCGG CGGTATAGCC
TATGGACTCA GATCAATTAC CGGAGTGGTG GAGCTTATTG ACTATATGGA AGAATACTCA
CCGAATGCAT GGATGCTGAA TTACTCCAAT CCTGCTTCAA TAGTGGCAGA AGGTGTCAGA
GTTCTGAGAC CTGATTCAAA AGTTTTGAAT ATCTGTGATA TGCCTATAGA TATAGAAGAA
CGCATGGCTG TGATTGCAGG GCTGGAATCA AGAAAAGATA TGACAGTGCG TTATTACGGA
CTAAATCACT TCGGATGGTG GGATGATATA AGAGATAAAG ACGGAAATGA CCTTATGCCT
GTGATAAAAA AGTATGTTCT GGAAAATGGA TATAATTTAG AGCTGTTAAA TAAAGAGGTA
AGGCTTAATG ATCCTGACTG GCTTGATACA TACCGTTTTG CAAAGGAAGT TTATCGTCTG
GATACTGAAA CTCTTCCGAC TACTTATTTT AAATATTATC TGTTTTCTGA TTATGCTGTA
AAGCATACTG ATAAAAATTA TACACGGGCA AATCAGGTAA TGGACGGACG GGAAAAAAGA
GTTTTTGATG AATGCAGTAC CATCGTGGAA AAGAATACCG CAAAGGAAAC AAATTTGGAA
ATAGGTGTTC ATGCCAGCTA TATTGTGGAT TTAGCAAGAG CACTTGCATT TAATACACAT
GAAAGAATGC TTCTTATTGT AGAAAATAAA GGAGCTATAT GCGGCGTGGA TCCTACAGCT
ATGGTTGAAA TCCCGTGCAT TGTCGGTGCC AACGGTCCGG AGCCTTTATC TATAGGTGAA
ATCCCGACTT TCCAAAGAGG ACTTATTCAG CAGCAGCTGG CAGTGGAAAA GCTGGCAGTG
GAAGCCTATG TAGAAGAATC GTATCAAAAA TTATGGCAGG CCTTGACGTT ATCCCGTACA
ATGGGTGATG CTGATCTGGC TAAGGTTATA TTAGATGAAC TGATTGAAGC TAATATAGAT
TACTGGCCGA ATTTAAAATA A
 
Protein sequence
MNKKNAFSIS IAGGGSTYTP EIILMLLENL DRFPIRKIIF YDNDYERQKL VADACEIILR 
ERAPEIEFLS TTNPKDGFTD IDFVMAHIRV GKYEMREKDE KIPLEHGIVG QETCGPGGIA
YGLRSITGVV ELIDYMEEYS PNAWMLNYSN PASIVAEGVR VLRPDSKVLN ICDMPIDIEE
RMAVIAGLES RKDMTVRYYG LNHFGWWDDI RDKDGNDLMP VIKKYVLENG YNLELLNKEV
RLNDPDWLDT YRFAKEVYRL DTETLPTTYF KYYLFSDYAV KHTDKNYTRA NQVMDGREKR
VFDECSTIVE KNTAKETNLE IGVHASYIVD LARALAFNTH ERMLLIVENK GAICGVDPTA
MVEIPCIVGA NGPEPLSIGE IPTFQRGLIQ QQLAVEKLAV EAYVEESYQK LWQALTLSRT
MGDADLAKVI LDELIEANID YWPNLK