Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0892 |
Symbol | |
ID | 4600468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 840043 |
End bp | 841713 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639773670 |
Product | hypothetical protein |
Protein accession | YP_920296 |
Protein GI | 119719801 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTTCT CCAGCTCTCT CCAGTTTTTC GTCCTCTTCG ACCTGCGGGG CGCTTCCCAC GGAGGCGAGG AACCCTTGAT CGCGTCTAAC GCTTCGACGA GGCGCGCCTC GTACAGCTCG TCGCCCTTAA GAGTGCTCGC CAGGATACGC GAGGGGGCGC TTATCGCCCG CACGGTAACC CTCGAGCGCG CCCTCGCAGG GGCGCCCCCT CCACCATTTA AGGAAAGGTT AAGTATGGTT GGAGAGAAGA TGATGAGGAA TGAAGAGGTC GAGAAGGTTG TTAGGGAGGA GGTTTGGAGG CGTGCGGCTT TAAGCCTCTA CGCCACCAGA ACAGAGGATA AGAGGAAAGC CAGGGGCGGG AAGAAGGGCG AGATACACTA CCGCGGCCTG TATGACACTG TGAGCGAGGT TGACTGGGAC TTCACTAGGT TCCTCATAAA CGGGTACAGC GTCGTGCCGG ACGAAGTCTA CCCGAGGTTT CACCGCTTCA TAGACATCGA CGCGAGGAAG TACCTTCTAC TGAACGACGA CAGCAAGCCG AGAGAGGGAA GCGCTGTCGT AGAGCTACGC GATAGGCTAC AAGCAATCGT CGACGCTACC GAAGACGGCC TCAAAGTTGA GAGACACGGC AGAGTCTGGC ACGTTTACGT GCCCGGCGAA AACTGGTACG TAACTGTCTC GAGGCCCAAC GCGAAGAGCT GGAGGGTGCG CGTACCACTG GAAGGCTTCT GGGTTGAATC GGAGTTTCCC AAGGTTCTAG TGAATACTCC GAGCGATGTT CTCAGAAGCC TGCAGAAAGG GTGGCTTCTC ACAGATGTCA CTCCGCCGCA CGGGCGCATC AGCGACGTAC ATTTCAGCAC AACGCAGAGC TGGCAGTTGC CAGCGACGCT CGCCAGCTTC CCGGGAGAAG TCAGGCTCGG CGTCACGGCG GGCATTCTCG GCAGTACTAG GCTGAGCATC CAGTGGCAGG CGCGCGTTGA CGGTTACGAG GAGGAGCTGG GCTGGGCCTC CGAGCTCATC GGCGAGGTTA AACGTGCAGA GTTTCGTCAG CTCGTCGAGA GGTGTAAGGA GCTTAACGGT GACTCGGTGG CGCTGACGAC CGCCTTCCTG GGGGACGGCG AGCTCGAATT CTTCCTAAAG CTTCGGGTGC TCTTCTTCAA GGTTGGACAT GAAGACATCT ACTTGCCGGC GGAAAGCGCT ATAGCTAATG CCCGCTTAGC CGTGGAGAGA GCTAGCGAGT ACGTGCGCTT TGTCTCGCTG GTAACTAAGT GTCCAAAGAT TAAACACTTC CTCTTCGTCG GCTACGGATT GCCGCAGAAG AGAGGTAAGA AGGACGGCCA AAGAAATAAC CCGTTCTACG CCGAGGTGGC AGGGGCGCAG CTACACCTTG TCTACATCTC TACGAGGAAC CACGTCTACG CTAGAATCGC AGTCGAAGCT GTGCCTTCGG GCTGGGTGGA GGAGGCACGC GCTCAAGGCT GGGACGTCCG AGTGGTTCGA ATGGGTGGCA GGGAGTACTA CCAGGTCACT CATGCTTCTT TGATGGAGCA TGCCTGTCAC GACGAAGCAC TGCGTATAAC ACTCCTCGCC TTCACGAAGT ACAAGGCCGA GCGATACCCC AAGGCGCAAA GCCTCGTAGA ACGCCTCGAA AAACTGGGGA CAGAAGACTA A
|
Protein sequence | MSFSSSLQFF VLFDLRGASH GGEEPLIASN ASTRRASYSS SPLRVLARIR EGALIARTVT LERALAGAPP PPFKERLSMV GEKMMRNEEV EKVVREEVWR RAALSLYATR TEDKRKARGG KKGEIHYRGL YDTVSEVDWD FTRFLINGYS VVPDEVYPRF HRFIDIDARK YLLLNDDSKP REGSAVVELR DRLQAIVDAT EDGLKVERHG RVWHVYVPGE NWYVTVSRPN AKSWRVRVPL EGFWVESEFP KVLVNTPSDV LRSLQKGWLL TDVTPPHGRI SDVHFSTTQS WQLPATLASF PGEVRLGVTA GILGSTRLSI QWQARVDGYE EELGWASELI GEVKRAEFRQ LVERCKELNG DSVALTTAFL GDGELEFFLK LRVLFFKVGH EDIYLPAESA IANARLAVER ASEYVRFVSL VTKCPKIKHF LFVGYGLPQK RGKKDGQRNN PFYAEVAGAQ LHLVYISTRN HVYARIAVEA VPSGWVEEAR AQGWDVRVVR MGGREYYQVT HASLMEHACH DEALRITLLA FTKYKAERYP KAQSLVERLE KLGTED
|
| |