Gene Teth514_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_2047 
Symbol 
ID5876250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp2058212 
End bp2060563 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content32% 
IMG OID641542393 
Productpeptidase U32 
Protein accessionYP_001663655 
Protein GI167040670 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG TTGAATTATT AGCTCCAGCA GGAGATTATG AGGCTTTAAC GAGTGCAGTT 
AATGCAGGAT GCGATGCAGT GTATCTAGGA GGAAAAAATT TTGGTGCAAG AGCTTATGCA
ACCAATTTTG ACTATGATGA TTTAAAATCT GCTGTAGAAT TCTGCCATTT AAGAGATGTC
AAGGTATACG TTACTGTAAA TACCCTTGTA GCGAATGAAG AATTTGAAAA GCTGGTCAAC
TATTTAGATT TTTTGTATTC TATAGGTGTT GATGCAGTTA TAGTGCAGGA TATGGGAGTA
TTAAAGTTTT TACGAGAAAA TTATCCTGAT TTAAAAGTTC ATGCCAGTAC ACAAATGACA
GTTCATAATT TAGAAGGAGT ACAAGAATTA GCTGAAAAGG GAGTTTCCCG GGTTATTTTA
TCAAGAGAGC TCACATTAAA GGAAATAAAG GATATAGTTC AAAATTCTAA CATTGAAATA
GAAGTCTTTG TTCATGGAGC CCTCTGCGTC AGTTATTCTG GACAATGCTT TATGAGTAGT
ATACTAGGAG GAAGAAGCGG AAACAGGGGA AGATGTGCAC AACCCTGCCG TTTAAAGTAT
TCTCTTGTAG ATAAAGAGGG AAAAGTTTTA GAAAAGGATT TACACCTTTT GAGTATGGCG
GATTTATGTA CTATAGAACA TATACCAAAA CTCATTGAGG CGGGAATCAC TTCTTTTAAA
ATAGAAGGTA GGATGAAAAA TGCCGAGTAC GTCGCTTCTG TTGTAAAAGC CTACAGAGAA
GCTATTGACA GTTTTTATGA AGGAAGGACT TTTGATTCAG GTAAAGCTAT AGAAGAGATG
TCTCGAATTT TTAATAGAGG ATTTTCTACT GGCTATCTTT TTGGGGTTAA ACCCTCCAAA
ATGAGTTATC TTTCACCTAA AAACACGGGA GTTGCTGCTG CAGAAGTGAT AAGTGTAACT
TCAAAAACTT CAAGACTGAG GCTTTTAAGG GATATTGCAA AAGGTGATGG AATTTCTAAT
GAAAAAGGAG AAAAAGGACA AAAAGTTGAA ATAATATTTA AAAATGGGAA GAAGGTAGAT
AGAGCTTATG AAGGAGACAT AATAGAACTA CCTCTTAAGT TTTATGTAAA AGAAGGAGAA
ATATTAAATA AAACTTATGA TGTATTGTTA AATGACAAGC TGAAAAACTT ACTTTCTAAA
AAGATTCCTA TAAAAATTTA TGCGGAGTTA AAAAAGGACA AACCTTTGTA TATAAAGATA
CAAGAAGGAA TTCATACAGT AGAAGTTTAT AGTGATGAAA TAAGTCAGAT AGCAGAAAAA
GTTTCTATTG AGGAAGATTT TTTGAAGGAT AAACTTACTC AAATAAATGA CACAGCCTTC
TACGTAGAAG AAATAGAAGT AGTAGTTGAA AAAGGCCTTT ACATGTCTGT AAAGGGAATA
AAAGAGGCGC GAAGAAAAGC AATAGAAATG TTAGAAAAGA AAAAGTTAGA GTATTACAGA
AGAGAAGAAA AACACACCTT TTTTAGTCTA CTTCCTTTTA AAGAGAAAAA GGAAAAAGTG
AGTTTAACTT TTTATACTGA TAAAGTTGAG CACTTGAAGA TAGCCAGCCA ATTGGGTATA
GAATACGTCT ATTTCAATTA CAAGCTTGAT ATAAAACTTT TAAAAAAAGG TTTAGAACTG
ACAAAAGATT CAAAGACAAC GGTTATACCT GCTTTTCCTT CTATATTGAG GGAAGAGATA
AAAAGAATAA AGCCTCAATT AGAATTTTTA CAAGATATGG GGATAAACAA AATTTTGGTT
TCAAATTTAG GGCTTTATCA TATTGCAAAA AATTATGACT TTGAGATATT TATAGATTAT
CCTTTAAATA TTTTCAACAA TTTAGCTGTA GATTACGTTA AACCTTACGC TGTGACTTTA
TCTTATGAAC TTACGCTGGA GCAGATTAAA GATATTGCCA AAAGAAGTGA TGTAAAATTT
GAAGCTTTAA TATACGGTAG ACTGCCTCTT ATGACAATGG AATATTGTCC AATAAGGAAT
TTAGTAGGCT GTGACAGAGA AAGGTGTGAA AAAGGTTATT ATTTTCTAAA AGACAGAAAA
GGCAAATTAA TGCCTCTTAA AAGCAATGGT TTTTGCAGGA TGCAGATTTT AAACGCAGAT
GTGCTTTTAA TGTTAAGCAT AAAGGAGCTT AAACAAGCTG GACTTTCTTT TTTGAGGATA
CATGATACAA TAGAAGAAGA TGAAGAAATA GAGAAAGTTT TAAAAATGCA TATTGAAGCT
TTAAAAGGGA ATGAAATTGA AATTTTAGAA GGAAAATATA CAAAAGGACA TTTTTACAGA
GGAGTTTTGT GA
 
Protein sequence
MKKVELLAPA GDYEALTSAV NAGCDAVYLG GKNFGARAYA TNFDYDDLKS AVEFCHLRDV 
KVYVTVNTLV ANEEFEKLVN YLDFLYSIGV DAVIVQDMGV LKFLRENYPD LKVHASTQMT
VHNLEGVQEL AEKGVSRVIL SRELTLKEIK DIVQNSNIEI EVFVHGALCV SYSGQCFMSS
ILGGRSGNRG RCAQPCRLKY SLVDKEGKVL EKDLHLLSMA DLCTIEHIPK LIEAGITSFK
IEGRMKNAEY VASVVKAYRE AIDSFYEGRT FDSGKAIEEM SRIFNRGFST GYLFGVKPSK
MSYLSPKNTG VAAAEVISVT SKTSRLRLLR DIAKGDGISN EKGEKGQKVE IIFKNGKKVD
RAYEGDIIEL PLKFYVKEGE ILNKTYDVLL NDKLKNLLSK KIPIKIYAEL KKDKPLYIKI
QEGIHTVEVY SDEISQIAEK VSIEEDFLKD KLTQINDTAF YVEEIEVVVE KGLYMSVKGI
KEARRKAIEM LEKKKLEYYR REEKHTFFSL LPFKEKKEKV SLTFYTDKVE HLKIASQLGI
EYVYFNYKLD IKLLKKGLEL TKDSKTTVIP AFPSILREEI KRIKPQLEFL QDMGINKILV
SNLGLYHIAK NYDFEIFIDY PLNIFNNLAV DYVKPYAVTL SYELTLEQIK DIAKRSDVKF
EALIYGRLPL MTMEYCPIRN LVGCDRERCE KGYYFLKDRK GKLMPLKSNG FCRMQILNAD
VLLMLSIKEL KQAGLSFLRI HDTIEEDEEI EKVLKMHIEA LKGNEIEILE GKYTKGHFYR
GVL