Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_2047 |
Symbol | |
ID | 5876250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | - |
Start bp | 2058212 |
End bp | 2060563 |
Gene Length | 2352 bp |
Protein Length | 783 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 641542393 |
Product | peptidase U32 |
Protein accession | YP_001663655 |
Protein GI | 167040670 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG TTGAATTATT AGCTCCAGCA GGAGATTATG AGGCTTTAAC GAGTGCAGTT AATGCAGGAT GCGATGCAGT GTATCTAGGA GGAAAAAATT TTGGTGCAAG AGCTTATGCA ACCAATTTTG ACTATGATGA TTTAAAATCT GCTGTAGAAT TCTGCCATTT AAGAGATGTC AAGGTATACG TTACTGTAAA TACCCTTGTA GCGAATGAAG AATTTGAAAA GCTGGTCAAC TATTTAGATT TTTTGTATTC TATAGGTGTT GATGCAGTTA TAGTGCAGGA TATGGGAGTA TTAAAGTTTT TACGAGAAAA TTATCCTGAT TTAAAAGTTC ATGCCAGTAC ACAAATGACA GTTCATAATT TAGAAGGAGT ACAAGAATTA GCTGAAAAGG GAGTTTCCCG GGTTATTTTA TCAAGAGAGC TCACATTAAA GGAAATAAAG GATATAGTTC AAAATTCTAA CATTGAAATA GAAGTCTTTG TTCATGGAGC CCTCTGCGTC AGTTATTCTG GACAATGCTT TATGAGTAGT ATACTAGGAG GAAGAAGCGG AAACAGGGGA AGATGTGCAC AACCCTGCCG TTTAAAGTAT TCTCTTGTAG ATAAAGAGGG AAAAGTTTTA GAAAAGGATT TACACCTTTT GAGTATGGCG GATTTATGTA CTATAGAACA TATACCAAAA CTCATTGAGG CGGGAATCAC TTCTTTTAAA ATAGAAGGTA GGATGAAAAA TGCCGAGTAC GTCGCTTCTG TTGTAAAAGC CTACAGAGAA GCTATTGACA GTTTTTATGA AGGAAGGACT TTTGATTCAG GTAAAGCTAT AGAAGAGATG TCTCGAATTT TTAATAGAGG ATTTTCTACT GGCTATCTTT TTGGGGTTAA ACCCTCCAAA ATGAGTTATC TTTCACCTAA AAACACGGGA GTTGCTGCTG CAGAAGTGAT AAGTGTAACT TCAAAAACTT CAAGACTGAG GCTTTTAAGG GATATTGCAA AAGGTGATGG AATTTCTAAT GAAAAAGGAG AAAAAGGACA AAAAGTTGAA ATAATATTTA AAAATGGGAA GAAGGTAGAT AGAGCTTATG AAGGAGACAT AATAGAACTA CCTCTTAAGT TTTATGTAAA AGAAGGAGAA ATATTAAATA AAACTTATGA TGTATTGTTA AATGACAAGC TGAAAAACTT ACTTTCTAAA AAGATTCCTA TAAAAATTTA TGCGGAGTTA AAAAAGGACA AACCTTTGTA TATAAAGATA CAAGAAGGAA TTCATACAGT AGAAGTTTAT AGTGATGAAA TAAGTCAGAT AGCAGAAAAA GTTTCTATTG AGGAAGATTT TTTGAAGGAT AAACTTACTC AAATAAATGA CACAGCCTTC TACGTAGAAG AAATAGAAGT AGTAGTTGAA AAAGGCCTTT ACATGTCTGT AAAGGGAATA AAAGAGGCGC GAAGAAAAGC AATAGAAATG TTAGAAAAGA AAAAGTTAGA GTATTACAGA AGAGAAGAAA AACACACCTT TTTTAGTCTA CTTCCTTTTA AAGAGAAAAA GGAAAAAGTG AGTTTAACTT TTTATACTGA TAAAGTTGAG CACTTGAAGA TAGCCAGCCA ATTGGGTATA GAATACGTCT ATTTCAATTA CAAGCTTGAT ATAAAACTTT TAAAAAAAGG TTTAGAACTG ACAAAAGATT CAAAGACAAC GGTTATACCT GCTTTTCCTT CTATATTGAG GGAAGAGATA AAAAGAATAA AGCCTCAATT AGAATTTTTA CAAGATATGG GGATAAACAA AATTTTGGTT TCAAATTTAG GGCTTTATCA TATTGCAAAA AATTATGACT TTGAGATATT TATAGATTAT CCTTTAAATA TTTTCAACAA TTTAGCTGTA GATTACGTTA AACCTTACGC TGTGACTTTA TCTTATGAAC TTACGCTGGA GCAGATTAAA GATATTGCCA AAAGAAGTGA TGTAAAATTT GAAGCTTTAA TATACGGTAG ACTGCCTCTT ATGACAATGG AATATTGTCC AATAAGGAAT TTAGTAGGCT GTGACAGAGA AAGGTGTGAA AAAGGTTATT ATTTTCTAAA AGACAGAAAA GGCAAATTAA TGCCTCTTAA AAGCAATGGT TTTTGCAGGA TGCAGATTTT AAACGCAGAT GTGCTTTTAA TGTTAAGCAT AAAGGAGCTT AAACAAGCTG GACTTTCTTT TTTGAGGATA CATGATACAA TAGAAGAAGA TGAAGAAATA GAGAAAGTTT TAAAAATGCA TATTGAAGCT TTAAAAGGGA ATGAAATTGA AATTTTAGAA GGAAAATATA CAAAAGGACA TTTTTACAGA GGAGTTTTGT GA
|
Protein sequence | MKKVELLAPA GDYEALTSAV NAGCDAVYLG GKNFGARAYA TNFDYDDLKS AVEFCHLRDV KVYVTVNTLV ANEEFEKLVN YLDFLYSIGV DAVIVQDMGV LKFLRENYPD LKVHASTQMT VHNLEGVQEL AEKGVSRVIL SRELTLKEIK DIVQNSNIEI EVFVHGALCV SYSGQCFMSS ILGGRSGNRG RCAQPCRLKY SLVDKEGKVL EKDLHLLSMA DLCTIEHIPK LIEAGITSFK IEGRMKNAEY VASVVKAYRE AIDSFYEGRT FDSGKAIEEM SRIFNRGFST GYLFGVKPSK MSYLSPKNTG VAAAEVISVT SKTSRLRLLR DIAKGDGISN EKGEKGQKVE IIFKNGKKVD RAYEGDIIEL PLKFYVKEGE ILNKTYDVLL NDKLKNLLSK KIPIKIYAEL KKDKPLYIKI QEGIHTVEVY SDEISQIAEK VSIEEDFLKD KLTQINDTAF YVEEIEVVVE KGLYMSVKGI KEARRKAIEM LEKKKLEYYR REEKHTFFSL LPFKEKKEKV SLTFYTDKVE HLKIASQLGI EYVYFNYKLD IKLLKKGLEL TKDSKTTVIP AFPSILREEI KRIKPQLEFL QDMGINKILV SNLGLYHIAK NYDFEIFIDY PLNIFNNLAV DYVKPYAVTL SYELTLEQIK DIAKRSDVKF EALIYGRLPL MTMEYCPIRN LVGCDRERCE KGYYFLKDRK GKLMPLKSNG FCRMQILNAD VLLMLSIKEL KQAGLSFLRI HDTIEEDEEI EKVLKMHIEA LKGNEIEILE GKYTKGHFYR GVL
|
| |