Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1285 |
Symbol | |
ID | 4600593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1225156 |
End bp | 1227171 |
Gene Length | 2016 bp |
Protein Length | 671 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639774061 |
Product | hypothetical protein |
Protein accession | YP_920686 |
Protein GI | 119720191 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.328698 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCTGGTTT CGCGTTGGTT GTTTAGGGTT GGGCTTGTAG CGGTAGTCCT GCTTCTACTC GTAGCTCGAT GCGTGCGCGC CGAGAGGGTT TCGCTGCTTC CTCGCTCGAT AAGCTACTAC AGCATTAGGG TTTCCGCGGC GCCGGCGTAC GTCGTGCTCG TGAGTCATCA GCCGTGGCCG GCCGTGGCTT ACCTGGCGAC TGCCGAGGAG TGTGAGCATT TCCTCGCGCG CGGGGGTAGT GACATCGAGG TTTTGAGGGT GGTTAGCGTG CCAGGCTACG GCGTGGTCTC CTTGAGGATA GATAGCCCCG GCTCGTACTA CGTGTTTTTC TACTCGGACG AGAGGCTCTA CGTCTACGTT AGGTACTACG GCTTGCCGGC GGGCCTTGCC AGCTACCCGG ACGTCGCGGT GAATACGAGC ATGGTTCTCG GCTTCTTCAA CGTAAGCGCA GCCTCCGCTA AGAGCTACAG CAGTAGAGCC GCGGAGGCGG ACGCGTGGAG CCTCCAGCTG AACGCCGTCG TAGAGGTTTC CTTGGCGGGC GGCAGGAAGC AGTACTACTG GGTTCAAAAC ATGGTCGGCG GCATAGAGGG TACTCGTTCG TACGAGAACA AAACGGAGAA GGCTATTATG TACCAGGTTT GGAACAACAT TTGGAACAAT ACGGGGCGGC TTAGCCTGCT AAGCGACGAG AGGATATCCG GCTGCGGTGG TGTGTACAGG GATAGGGACG AGTACTACTA TGCGTGCGTG TACACCTACA GCCCCTGCGA CCTGCCCCTG GCAGGCTTCC TGGTCGTCAG GGCTTACGCG AGTAACGGCG CTGTACACGT CGATTTCGGC TACGTCACCG CTCAGAGCGG CGACTACAGG CCCGCGATTA TAACGTGGTA CGACAACGTG ACCATAAGGA CTAACCCGCC GGCCGTAGGC GCCGGCATAG TGGCTACGTC GAGCTACCTG AACGGGCGTG GGCTCCCGCT GAACGTTGAG CTCGTGTTCT GCGGTTTCTG TTGCTCGGAG CACGCTACGT TCAGCGAGCT GGAGGCGAGG CTCACCCTTG CCTACTGGAG GGGCGGCGGC TGGGCGCCTT TCCCGAATCT GTACAGCTTT GGCGTGAGCA CAAACGAGAC TGCTACCAAC GTGGCTGTGC GGTACGCTGA CGGCTTCGCG GTCGTGGAAA GAGGCGCGCT GAGCCCGGCT AAGCTCGCGG AGAGGCCGAG GCTACCGGCC CTGCCGATGA CTAGCGTCAG GTACTGTAGC ATGTTGACCG GGGAGTGCTC CGTCAGGTAC GTGTACTCCC CTGTAACGCT CGCCGAGAAG GCTAGCGTCG TGTACGACGG CAACCGCACC AGGTACGTGC TTTTAGGGTA CTACGTGGAC GGGCGGTTCA CCGAGGACCC GCCGACCATA ACTCCTAGTA GCTCGTGGTT CACGCACTAC GAGGTTAGGT CGAGCTACAA GGCTCAGCAC TTCGTGGTCG TCTCCAGCCC CCTCCCCGTA ACGGTTAACG GTACGAGGAC TACGCGCTAC GCCGGGTGGC TCGACGCGGG CTCCTCCATA GTCGTCGAAG TCCCGGATCG GGTAGTGCTC GAGAACGGGA CGCTGTTCAC GCCCCTGAGT AGCGGCGGGG TGTTCCGCGT CGACTCGCCG GTAACAGTCG AGGTAGCCTG GCAACCCTAC TACCTCGTCA CCGTTACGAG CCAGTACCCG GTGCTCGTCA ACGGCGAGAG GACGGAGAGG TACAGCAGGT ACCTGGCGCC CGGCACCCAG CTCGAGGTGA AGGCGGAGCC CGTGCCGCTG TACGGCGGGC TCGTAACCAT GGACCCGAAC GCCTCTTCGA TAAGGCTACT GGTCACCGCG CCCGTAACGC TGTCGGTGTC CTACTCGCCG AACTACACGC GCCTAGTAGC CGTAGCCGCC GTAGCCACCG TTCTAGTCGC TGTAGCCGTC GCCAGGAGAA GGAGGCGCTT CGAGGCCTTC TACGAGCCAC CCCTCGACGA GCTCTTAGCC GCCTAG
|
Protein sequence | MLVSRWLFRV GLVAVVLLLL VARCVRAERV SLLPRSISYY SIRVSAAPAY VVLVSHQPWP AVAYLATAEE CEHFLARGGS DIEVLRVVSV PGYGVVSLRI DSPGSYYVFF YSDERLYVYV RYYGLPAGLA SYPDVAVNTS MVLGFFNVSA ASAKSYSSRA AEADAWSLQL NAVVEVSLAG GRKQYYWVQN MVGGIEGTRS YENKTEKAIM YQVWNNIWNN TGRLSLLSDE RISGCGGVYR DRDEYYYACV YTYSPCDLPL AGFLVVRAYA SNGAVHVDFG YVTAQSGDYR PAIITWYDNV TIRTNPPAVG AGIVATSSYL NGRGLPLNVE LVFCGFCCSE HATFSELEAR LTLAYWRGGG WAPFPNLYSF GVSTNETATN VAVRYADGFA VVERGALSPA KLAERPRLPA LPMTSVRYCS MLTGECSVRY VYSPVTLAEK ASVVYDGNRT RYVLLGYYVD GRFTEDPPTI TPSSSWFTHY EVRSSYKAQH FVVVSSPLPV TVNGTRTTRY AGWLDAGSSI VVEVPDRVVL ENGTLFTPLS SGGVFRVDSP VTVEVAWQPY YLVTVTSQYP VLVNGERTER YSRYLAPGTQ LEVKAEPVPL YGGLVTMDPN ASSIRLLVTA PVTLSVSYSP NYTRLVAVAA VATVLVAVAV ARRRRRFEAF YEPPLDELLA A
|
| |