Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0132 |
Symbol | |
ID | 4600717 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 107087 |
End bp | 108820 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639772886 |
Product | membrane protein-like |
Protein accession | YP_919545 |
Protein GI | 119719050 |
COG category | [S] Function unknown |
COG ID | [COG1470] Predicted membrane protein |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAGGA GAAAGTTAGC CCTGGTCGCG CTCGTATCCC TAGCCTTCGT GCTAGCCGCA TCTATTCCTT CGTATCCGCA GTCGAATGTA TTCTTCTTTG GGAAGGTGGT CGACGAGAGC GGGGCGCCGG TAGGGCTTGC GGAGATAACG GTGTACAAGG GGAACTCCAT AGTTACATTC ACCAGAACCT CGCCTGACGG TAGCTTCAAC CTCTACGTAC CAGTGGGAAG CTACACGTTT CTGGTATACA AGAAGGGCTA CGCGCCCATC TACATGACCC TCGAGGTAAC CCCTGAGAAA GGGGGGAGCC TCGGCGTACT GGTTCTGAAG AAAGGAGTAA CCGTAGTACC CGATGTAACG TCTGTTTATA CTTCTCAGGG AGATACCGTC AGAATACCGG TAAAAGTGTT CAACAAGTGT CTCGACCCTG TTCTCGTAAG CTTCTCCATC GAGGTCCCCC AGGGCTGGAG AGCCTACTTC GTGGGACCAA ACAACCTAGT AGCCTCCGAT TTCTACATAG AAGCAGGTAG CAACAGGAGC CTGGTGTTCG TGGTAGAAGT GCCTCTAAAC GCCAGCGAAA AGGAGAGCGT GAGAGTTTCC TTTACTTGGT TTAACCTCTC GGGACACGTT GACTTCACGT TTACGGTTAG GCAACGCGAG TGGAGGCTCT TAGAGCTACC TACCACATCG GTGAAAAGCT TTCCTGGAGG GCAACTGTCG ATACCCATAA ACGTGTCTAC GCCGTTCAGC TACGAGGCTA CCGTAACCCT CTCGGTGTTT GCGCCGAGCA ACTTCATAGC CTCCTTGGTC GACGAGAACG GCTTGACTGT GCAGTCTGTA ACGGTGAGGC CGGGGGAGAA GCGCAGGCTT CAACTAGTGA TATACGTGCC GCCCACGGCG AGGATTTCTA CGTACAGCTT AAGGGTAGTG GCGAGGTCCG GACCTCTTCA AAGCATCGTG AACCTCGACG TCATAGTAGA AAGCGCCTAC GACCTCCTCA AGATAAGCCC AGGCGCGACG AGCATTAACG TGACCTCGGG TTCAACCGTT ACCTTCAGGG TAGCCTTGAA GAACGAAGGC AACATGCCTA CAGTCGCGTT GCTAAGGGTT CAGACGAGCT CTCCACTGCT CAGGGCGTAC ACCTCCGTAT CGGGCGAGCC CGTGGCCTCA CTCTACCTGG TGCCCGGAGA AGAGAAAGCC TTAGCCTTGA TAGTAGAAGT TGACCCCTCC ACGCCCTCCG GGATATACCT CGTATCCCTA CGCGCGAACG GCACTACGAG TACTGCGGAG CAAAGCTTCC TGGTAAGAGT GACCGGCACA AGGAAGATCG TAATTTCCAA CATAATCTTC CAGGTTACCG GCGCTCCAGG AATGACGACT ACGTACAAGC TAGGGCTGGT GAACGCTGGG AACACCCCAT TGAATAACGT TATGGTTCAA GTGGAGGCTC CGAGCGGCGG ATTCGAAGTA ACGGTGCATC CCTCCACACT GGCTTTACCG CCTAACTCCA CCGCAAGCGT AGACATATCG ATACTTATCC CCTCGAACGC TAGCGAGGGG TTCTACAACC TACCCATACA CGTGGTGGCA GGCGACATCA GGCTGGACAG AGTTCTCGTG CTCGAAGTTA GGGGGGAACA AGGACTAGGC TTCACGTATA TTGCAACCGG GCTCTTCCTC TTGAGTTTAA CGGTGGTTTC CTACTCTAAG AGGCAGAGGA GAGCCAGGGG GTAG
|
Protein sequence | MSRRKLALVA LVSLAFVLAA SIPSYPQSNV FFFGKVVDES GAPVGLAEIT VYKGNSIVTF TRTSPDGSFN LYVPVGSYTF LVYKKGYAPI YMTLEVTPEK GGSLGVLVLK KGVTVVPDVT SVYTSQGDTV RIPVKVFNKC LDPVLVSFSI EVPQGWRAYF VGPNNLVASD FYIEAGSNRS LVFVVEVPLN ASEKESVRVS FTWFNLSGHV DFTFTVRQRE WRLLELPTTS VKSFPGGQLS IPINVSTPFS YEATVTLSVF APSNFIASLV DENGLTVQSV TVRPGEKRRL QLVIYVPPTA RISTYSLRVV ARSGPLQSIV NLDVIVESAY DLLKISPGAT SINVTSGSTV TFRVALKNEG NMPTVALLRV QTSSPLLRAY TSVSGEPVAS LYLVPGEEKA LALIVEVDPS TPSGIYLVSL RANGTTSTAE QSFLVRVTGT RKIVISNIIF QVTGAPGMTT TYKLGLVNAG NTPLNNVMVQ VEAPSGGFEV TVHPSTLALP PNSTASVDIS ILIPSNASEG FYNLPIHVVA GDIRLDRVLV LEVRGEQGLG FTYIATGLFL LSLTVVSYSK RQRRARG
|
| |