Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1638 |
Symbol | |
ID | 4600917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1584637 |
End bp | 1586226 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639774411 |
Product | extracellular solute-binding protein |
Protein accession | YP_921036 |
Protein GI | 119720541 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.343389 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGTAC CCAAACTAGG TAAAAAGGCT GTAGCGGTAG CGGTCGTCGC TCTGGTAGCG GTCGTCGCCG CCTACGTCGC GCTGGCGCCC AAGCCTCCGA AGTACAAGAA CGTGATAGTG ATAGGTACCA CGGACTCCGT GCAGACAACG CTCGACCCGT GCGAGGCCTA CGACTACCTC GGCGTAAACA TAATCCAGAA CATGGGTGAA GGTCTCTTCG GCTACGAGCC CGGCACAGCC AAGATAGTGA ACAAGCTCGC CGTGAGCTAT ACGATCAGCC CGGACAGGAA GGTCTGGGAG ATAAAGATCA GGAGGGACGC GAAGTTCCAG GACGGAACCC CCCTCACCGC GCAAGCCGTG GTCTGGTCCT GGGAGAGGAC GATAAAGCTG AACCAGGACC CAGCTTTCCT GATAGCCGAC CTCATAGAGA AGGCAGAGGC GGTGTCCGAC GACACTATAC GCGTGTACCT GAAACAACCC TTCGAGGAGA CCTACGTGAA GTCGCTGCTA GCTACGTGGG TAGCCTTCCC CGTGAACCCG AAAACCACGC CGATGGAGGT GGTCAAGCCT CCCAACACCG TGGACATGAT AGGGCCGTAC AAGCTCGCGA GCTGGAAGCC CGGCGAGTAC ATCGAGCTCG TAGCGAACCC GAACTACTAC GGCGAGAAGC CTAAAACCGA GAGGATTATA ATCCGCTTCT ACAAGGACGC CCAGTCCCTC AGGCTGGCCG TGGAGAACGG GGAGGTGGAC GTCGCGTTCA GGACGCTCTC GCCTGCGGAC GTTAAGGACA TAATCGCCAA GGGCCAGCTC CAGGTCGTAA GCGGGCCGGG CATAGCCCCC ATAAGGTACC TCGTCTTCAA CGTGAAGAAG GAGCCCTTCA ACGACAAGAG GGTGAGGCAG GCCATAGCCT ACCTGATAGA CAGGAACCAG ATAGTGAACA CCGTGTTCAT GGGGACATTC GCCCAGCCGC TCTACAGCAT GGTCCCCATA GGCTTCATAG GCCACAAGGA CAGCTTCAAG GAGAGGTACG GCGACAAGCC TAACGTCGAG GCGGCGAGGC AGCTGTTAAG GCAGGCGGGC TTCAGCGAGT CCAACCCGCT CAAAATGGAG CTCTGGTTCA CGCCGGTGAG GTACACGCCT GCCGAGCCGG ACATAGCCGC CATAATAAAG CAGAACCTCG AGGCCAGCGG GATGATAAAG GTAGAGCTGA AGAGCGCCGA CTGGGCTACG TACAGGTCTC TCTTCAAGCA GGAGAAGATA GCCGTCTGGC TGCTCGGGTG GTTCCCGGAC TTCCTCGACT CGGACAACTA CGTGAGGCCT TTCTACCACT CCGGGGCTAA CGGCTGGCTT CACGTGAACT ACAAGAACCC CGTCGTGGAC GAGCTGATAG ACAAGCAGGT CTTGCAGCCG ACGGAGGAGA GGATTAAGAC CCTCGCGCAG ATACAGGACA TAGTCGCCGA CGATGCGCCG ATAATCCCGC TCTGGCAGGA GGGTCAGTTC GCTGTCGCAC AGAAGAACGT GAAGGGGATA GTGCTGGACT ACTCCCAGAT ATTCAGGTAC TACCTCATAT ACGCCGAAGT GCAGGGCTAG
|
Protein sequence | MPVPKLGKKA VAVAVVALVA VVAAYVALAP KPPKYKNVIV IGTTDSVQTT LDPCEAYDYL GVNIIQNMGE GLFGYEPGTA KIVNKLAVSY TISPDRKVWE IKIRRDAKFQ DGTPLTAQAV VWSWERTIKL NQDPAFLIAD LIEKAEAVSD DTIRVYLKQP FEETYVKSLL ATWVAFPVNP KTTPMEVVKP PNTVDMIGPY KLASWKPGEY IELVANPNYY GEKPKTERII IRFYKDAQSL RLAVENGEVD VAFRTLSPAD VKDIIAKGQL QVVSGPGIAP IRYLVFNVKK EPFNDKRVRQ AIAYLIDRNQ IVNTVFMGTF AQPLYSMVPI GFIGHKDSFK ERYGDKPNVE AARQLLRQAG FSESNPLKME LWFTPVRYTP AEPDIAAIIK QNLEASGMIK VELKSADWAT YRSLFKQEKI AVWLLGWFPD FLDSDNYVRP FYHSGANGWL HVNYKNPVVD ELIDKQVLQP TEERIKTLAQ IQDIVADDAP IIPLWQEGQF AVAQKNVKGI VLDYSQIFRY YLIYAEVQG
|
| |