Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1334 |
Symbol | |
ID | 4601309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1287476 |
End bp | 1289266 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639774109 |
Product | membrane protein-like |
Protein accession | YP_920734 |
Protein GI | 119720239 |
COG category | [S] Function unknown |
COG ID | [COG1470] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.513509 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCTTA AGGCAGGCGG TAAAGAAAGA ACTCTCCTCC ATACACGTTA CGCTCTCCTA GCGCTGATGA TCATACTGGT TATGACTCCC CGTCTATTAA CCGCGCAAAG CACTAGCGTG AATGTACATG GAGTAGTCGT TGACCCGAGG GGGGCGCCGA TAAGCGATGT ATCAATACTG ATCTTCGGCG AGGATAACAC TCTTGTCGCC AGAGTTAAAA CCAGCATGAC GGGTGACTTT TGGGCTTTAC TGGCTCCAGG CACGTACAAA GCTTCCCTAA TTAAAGTGGG CTACGAGGCT AAAACCATTT CCTTTAGCAT TTCCGGCGAC AGGCTACACG TAGAGCTGGG AGAGATCACC CTGGATTACA GCCTCTCAGT CTCCGTAGAG CTCAGAGATG TAAGAGCCAG CTGTCTTTCC ACGCTCCGCA TACCGGTGGT TTTGGCGGAG AAAGGTTCGA GAGAAGAAAC CGTCACCCTC TCGGCGAGCG CACCTTCCGG GTGGACCGCT GGCTTCTACC TGGGAGACCT CGAGGTGAAA AGCATAGTCT TAAGCCCTGG GCAGACCTTG AAGCTTGACC TAGTGCTGAA AGTACCTTAC AACGCCTCCG GTCGGTACAA CATAACGGTA GACGTTCTTG GATACACTCT TCAGAGGAGA GTGATCACAG TCGACATCGA GCATAGAGAC CCCCAGCTCG TAACATCTAC CTACCTAAGC GTAAAGGCGT CTCCCGGCTC AACGGTGAGC CTCGACGGTT TAGAGATAAC GAACAAGCTT CCCGATAGGA CTTCAGGGGT AGTCTTCCTG CTACTACCGA GCCGGTGGTC AGGGAGTATT CTCGACTCAA CTAGCGGCAA TAACCTCTAC AAGCTCTCCT TGAACCCCGG AGAAAGCGTG AAAGCTAAAG TCGTCCTGCA GATCCCCGAT ACCGCGGCAC CGGGTAACTA TACAGTAGGC GTTGTGTTCA GAGGCGTGGA CCCGTACTTC GAGTCGAAGC TCTGGCTCAA CGTTACCGTC GTAAAGGGAA AGCCCCTCGC GAAGCTAGAA ACCGAGACTC CTTTCCTCAA CACCTACGCC GGCCGTAGTG CAAGCTTTCT CGTCACTACT AGGAATATCG GCGAAGGAGA CGGAGTCGTA GAGCTAGTCG TGAAAGGACT GCCGCCCGGC TACAGCTGGC GTATAGAAGA CCTTAACGGG AATGTTATCT CCAAGCTGTA CCTGAAGGCG GGAGAAAGCA GGCAATTAAA AGTTGTCGTA AGCGTACCTC TCCTCGCCGA GCCCTCGGTC ATTCCTTTCA TACTCGAAGC CAACACGAAC TATTCTCGTG TAAGCCTACC TCTCAACCTG GGAGTTATGG GAAGCTACAG CCTCGAGTAT ACAACCCAGA ACTTCTACTT GGAGGTGACA TCGGGCTCTT CGGCCACCTT CCAATTAGGG GTCAAAAACA CCGGCTACAG CTCATTAACA AACGTAAGAA TAGAGGTGTC TAACGTGCCG AGGGGGCTCC GCGTAAACAT CTCTCCAGAA GTCGTTCTAT CGCTTAAAGC CCAGGAAAAC GCCAACTTCA CGGTAACAGT TTACGCAGAG CCGGACGTAA GCGCTGGAGA CTACTACATA ACCTTGAAGC CGCTGGCCGA CCAGCTCAGC GAGGATCAAT CCCTAGTTGC AACGAGGCAA CTCCACGTAT ACGTAAAGAC GGGGGCCGGC GCGGTATACA TAGGGCTGGG AGCTCTACTC GCACTAGTAG TGCTTTTAGT CGCGGTCTAC AGAAAGTTCG GCAGGAGGTG A
|
Protein sequence | MTLKAGGKER TLLHTRYALL ALMIILVMTP RLLTAQSTSV NVHGVVVDPR GAPISDVSIL IFGEDNTLVA RVKTSMTGDF WALLAPGTYK ASLIKVGYEA KTISFSISGD RLHVELGEIT LDYSLSVSVE LRDVRASCLS TLRIPVVLAE KGSREETVTL SASAPSGWTA GFYLGDLEVK SIVLSPGQTL KLDLVLKVPY NASGRYNITV DVLGYTLQRR VITVDIEHRD PQLVTSTYLS VKASPGSTVS LDGLEITNKL PDRTSGVVFL LLPSRWSGSI LDSTSGNNLY KLSLNPGESV KAKVVLQIPD TAAPGNYTVG VVFRGVDPYF ESKLWLNVTV VKGKPLAKLE TETPFLNTYA GRSASFLVTT RNIGEGDGVV ELVVKGLPPG YSWRIEDLNG NVISKLYLKA GESRQLKVVV SVPLLAEPSV IPFILEANTN YSRVSLPLNL GVMGSYSLEY TTQNFYLEVT SGSSATFQLG VKNTGYSSLT NVRIEVSNVP RGLRVNISPE VVLSLKAQEN ANFTVTVYAE PDVSAGDYYI TLKPLADQLS EDQSLVATRQ LHVYVKTGAG AVYIGLGALL ALVVLLVAVY RKFGRR
|
| |