Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_2342 |
Symbol | |
ID | 5875920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | - |
Start bp | 2381055 |
End bp | 2382158 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641542686 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_001663938 |
Protein GI | 167040953 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000759245 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCGTC AATTAAAGAG TTTTTTTGCT GTTGCAATTA TAACTACTTT GGTTAGCTCT TTGGTATTTG TATATATTGC ACCAAAGTAT TTATGGGGTA AAGTTATTCC TATACCTTAT CCTCCTTCTT CAGGAATAAG AACAGAAATA GTGATACCAA CAAAAGAATC TCCTACAATA GCGGAAGTAG TGGCAAAAAA AGACACTCCT GCTGTTGTGG GAATTACTAC TGTAGAATTT CAAAGAGAAT ATTATTTTAT AGAAAAAGCT GTAGAAGGTG TAGGTTCAGG TTTTATTGTT AATCCAAATG GCTATATTAT AACAAACAAT CATGTTGCTA ATGAAAAATC CAAAAATATA AAGGTATATT TAAGCAATGG TAGTATATTA CCAGGCAAAG TTTTGTGGAC TGACCCTGTT TTAGACCTTT CTATATTAAA AATTGAAGCA AAAGATTTGC CAGTTATACC TTTGGGAGAT TCTGACAAGA TATCGGTAGG GCAAACTGCA ATAGCAATTG GAAATCCTTT AGGTCTTAGG TTTCAAAGGA CGGTTACTTC TGGAATAATA AGTGCTTTAA ATAGAAGTCT TCCTATTACT GAGGATGGAA AACCGCGAAT CATGGAAGAC TTGATACAGA CAGATGCTTC TATAAATCCA GGAAATAGCG GAGGACCTTT AGTGGATTCT CGAGGCTACG CAATTGGAAT AAATACTGCG AAGGTGACAA CCGCTGAAGG ATTGGGCTTT GCTATACCTA TAAATATTGT AAAGCCTATA CTTAAGAAAG TGATAGAAAC TGGTACATTT AAAGCACCTT ATATTGGCAT AGTTGCCTAT GATAAAGAAA TTGCCAGCTA CATTTCCGCA GATGTTTACA TATATGAGGG GATATATGTG GCAGATATTG ACCCTAAAGG TCCTGCTTAT AAAGCTGGCA TTAGAAAAGG TGATATTATT TTGCAGGTAG ACGGAAAACC TGTAAATACT ATGACAAGTT TAAAATGTAT TATTTATGAA AAAAATCCAG GAGACAAAAT AAAAGTCAAG TACAAAACCG TAACAGGAAA GACAGGATAT ACTACTATAA CTCTCGGGGA ATAG
|
Protein sequence | MQRQLKSFFA VAIITTLVSS LVFVYIAPKY LWGKVIPIPY PPSSGIRTEI VIPTKESPTI AEVVAKKDTP AVVGITTVEF QREYYFIEKA VEGVGSGFIV NPNGYIITNN HVANEKSKNI KVYLSNGSIL PGKVLWTDPV LDLSILKIEA KDLPVIPLGD SDKISVGQTA IAIGNPLGLR FQRTVTSGII SALNRSLPIT EDGKPRIMED LIQTDASINP GNSGGPLVDS RGYAIGINTA KVTTAEGLGF AIPINIVKPI LKKVIETGTF KAPYIGIVAY DKEIASYISA DVYIYEGIYV ADIDPKGPAY KAGIRKGDII LQVDGKPVNT MTSLKCIIYE KNPGDKIKVK YKTVTGKTGY TTITLGE
|
| |