Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1776 |
Symbol | |
ID | 4601937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1718817 |
End bp | 1719968 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639774549 |
Product | hypothetical protein |
Protein accession | YP_921174 |
Protein GI | 119720679 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4591] ABC-type transport system, involved in lipoprotein release, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGTTT GGAGGCTCGC GTTAAAGTTC GTCGAGAGGG ACATACTCTT CAAGAAGCTC ATAGCCGCGC TAACGATCCT CGCGATAGCG AGCGGCGTCG CCACGTTCGT AAGCCTCAGG ATCCTAAGCC TTGGAAGCCG CACGGCGGCT ATGAACATTG TACAGCAGGT TCTCCCCGGG GAGGTCGTTG TCTACGGGCA GGGGCTCTAC GACGTGTCCG AGGACGTTCT CTCCGATATC AAGAGGCTTC CCGGCGTGAA GGACGTTACA CCGGCAATCC TAGTCACGGG GTACGTTGGG AGGAACGCCG TCTTCCTGCT CGGCGTCCGC CCCGAGGACA TAAAGAACGT GGTTTCGAGG TTCGTCGACG GTCAGCCCTT CACGGGGTTG ACGGGGGCCT ACGCGATAGC GGACGTCGGC TTGGCTAGAA AGCTGGGGTT AAAGGTGGGC GACAAGGTCA CCGTTAAGCC CCCCTACGGG ACCTACTTCA AGCAGTACGA GGTAGTAGGG ATAGCCGAGG TCGCTATGAA GATCGAGGAG ATAGGAGCGG CGGGGGGCTA CCTGATCCTT CCCCTCAGGG AGGCGCAGAA CCTCCTCGGG AGGCCCGGCT ACGTGAGCAT GGCGGTCGTG AAGGTCGAGG ACGGCGTCGA CCCGCAGGAG GTCAAAACAC TGATCTCGCT TGTATACCCG GGGTCTAGGG TGATGCTCCG CGAGGAGGTC ATAGGGGTAG TCTTCAAGGT AATGTCGCTG ATAGAGGGCC TCCTCCTCTC CACAACGCTG GTAGGCCTCG CTGTGGCAGT CTTCGGGACT ACGAGCACGA TTACCTCCAC GGTTAGGGAG CATCAACGCG AGATAGCGAT TATGCGCGCG GGAGGGTCTT CGAGGAGGGA CATAGCGTTG ATATTCATGC TCGAGTCGCT GGTCTACGGC GTGTCGGGAG GCATCCTTGG GATAGTGTTC GGCATAGTCG GCGCCCAGGT AGGTATAGAG GTGGTGTCTT CCTACGGCTT CCTGAACCCT CCGCTCATAC TCGAGCCCGC GACCCTACTG CTGGGCTTCC TGCTCGCCGC GGGGCTCAGC GTTCTGTCGT CGCTATACCC GGTCTGGAAG GCTACCTCGA TAAGGCCTGT CGAGGTGTTG AAGAGTGAGT GA
|
Protein sequence | MSVWRLALKF VERDILFKKL IAALTILAIA SGVATFVSLR ILSLGSRTAA MNIVQQVLPG EVVVYGQGLY DVSEDVLSDI KRLPGVKDVT PAILVTGYVG RNAVFLLGVR PEDIKNVVSR FVDGQPFTGL TGAYAIADVG LARKLGLKVG DKVTVKPPYG TYFKQYEVVG IAEVAMKIEE IGAAGGYLIL PLREAQNLLG RPGYVSMAVV KVEDGVDPQE VKTLISLVYP GSRVMLREEV IGVVFKVMSL IEGLLLSTTL VGLAVAVFGT TSTITSTVRE HQREIAIMRA GGSSRRDIAL IFMLESLVYG VSGGILGIVF GIVGAQVGIE VVSSYGFLNP PLILEPATLL LGFLLAAGLS VLSSLYPVWK ATSIRPVEVL KSE
|
| |