Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_1050 |
Symbol | |
ID | 5876554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | + |
Start bp | 1083762 |
End bp | 1085006 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641541405 |
Product | PDZ/DHR/GLGF domain-containing protein |
Protein accession | YP_001662685 |
Protein GI | 167039700 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000225912 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGCTT TTATTAAAGT TTTTTGGTGG GCAATAGAGA CGATAGCTTT ATCAGTTTTT AATCCCTTTT TCTGGATAGT GATAATTCTG ATAGTAATGC AGTATAGAAA CAAAATAGCC ATAGAAAGGG AAATTATGGG ACAGGAGCAA GAACCAATGA AAGAATTGGT TCTTGACTCT GTCTTTTATG GTGTAATTGC TGCAATAATT GGGAGCTTTC TCATGATATT TTTAGGAATA ACAATAGAAA ATATAGGGCT TCAGTATGTA TGGCCTTTGG TCATAGTATT GATGCTTGTC AATCCTAGAT ATATTTGTTT TTCTTATGCA GGAGGAATAG TTTCTCTATT CAGCTTGTTT TTTGGATTTC CTAATATAAG TGTGCCTGCC CTTATGTCTA TTGTAGGTAT TCTTCATTTG ATGGAAAGCC TTTTAATATA TATAGATGGT CCAAGGAATG CCACTCCTAT CTTTGTAAGG CTTAAGGATG GGCGAGTGGC AGGAGGCTTT ACTATGCAAA AGTTTTGGCC TATTCCTTTT GCTGCATTAA CTATTGCTAC TGGGATAACA ATAACGGGAC AGGGTGTAAA TATGCCAGAT TGGTGGCCCA TCATCAAACC CTCTGGCATT GATTTAAATA ATGTAATTTT TTTGATGATG CCAGCAATTG CAGCTTTGGG ATATGGAGAT TTAGCTTTAA CTCAACTTCC AGAGAAGAAA ACGCGCATTT CTTCGCTGCG TTTATTTTGC TTTAGCATAG TTTTAATTAT ATTAGCAGTA TTAGGAATCT ATATTAAATT ATTTCAATAT TTGGCAGCTA TTTTTGCTCC AGTAGCTCAT GAGCTTTTGA TTTTGATAGG ACAAAGAGAA GAAAGGGAAA ATCCACCTCT TTTTGTAGCG CCAGATACAG GAGTGATGAT TTTAGCGACA GCAAAAGGTT CTCCCGCTAG AGAAATGGGA ATTAAACCGG GGGATGTAAT ATTAAAAATT AATGATATTC CTATTAATGA GCCAGATGAT ATAATACGAA TACTCAATGA ACGGCCTTCT GTGATGTGGA TCACTGTAAA AGATTTAAAT GGCAATTATA CTAACTATGA ATATAAGGAT CCACAGGGAA TTTTGGGATT AGGAGTTTTA ATAGTGCCAA AGGCAACTTC TATGATTTAT GAAATTAATG GGGAAGGAAT ATTTATAAAA AAATTAAAAG AAATCTTTAA AAACATTTTT AAGAAAAACA GATAA
|
Protein sequence | MTAFIKVFWW AIETIALSVF NPFFWIVIIL IVMQYRNKIA IEREIMGQEQ EPMKELVLDS VFYGVIAAII GSFLMIFLGI TIENIGLQYV WPLVIVLMLV NPRYICFSYA GGIVSLFSLF FGFPNISVPA LMSIVGILHL MESLLIYIDG PRNATPIFVR LKDGRVAGGF TMQKFWPIPF AALTIATGIT ITGQGVNMPD WWPIIKPSGI DLNNVIFLMM PAIAALGYGD LALTQLPEKK TRISSLRLFC FSIVLIILAV LGIYIKLFQY LAAIFAPVAH ELLILIGQRE ERENPPLFVA PDTGVMILAT AKGSPAREMG IKPGDVILKI NDIPINEPDD IIRILNERPS VMWITVKDLN GNYTNYEYKD PQGILGLGVL IVPKATSMIY EINGEGIFIK KLKEIFKNIF KKNR
|
| |