Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1771 |
Symbol | |
ID | 4601948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1711010 |
End bp | 1714345 |
Gene Length | 3336 bp |
Protein Length | 1111 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639774544 |
Product | hypothetical protein |
Protein accession | YP_921169 |
Protein GI | 119720674 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.621318 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGGCT ACCCCCTTTT AGGCGGCACG CGCTCGAGGC TACTCGTAGC TTTCCTGGCT GTCCTCCTGG TGGCGCTGGT ACGGGTAGCC GTCTCCGCGC CTGCATACAC TTCCACGCAC TTCGAGGTTT ACGACTCCGC GGGAGCGGGC GTAGCGTACG CGCAGAGTGT CGCCGCGGAG CTGGAGTCGG CGTACTCGGC TCTCAGCGGC TCCGGGGCTA GCCTAGCGCC TCCGTGCTCT GGTAGCCTCT ACGTCGTGAA CGTGACCCAG TTGCCTGGCG GGGAGGGAGG ATACGTGAGA TGGACGTACT ACTACGATTC GTCGGGCAGG ATAACTTCTT CCTGTGTGAA CTACATCGCG ATAGCGCCTG GGCTTACCTC GTCGACTTTG AGGCATGTCG CCTTCCACGA AATGGTTCAC GTGTCGCAAG TCTCCTACTT CAGGTACACT GCGGTGCCGA GTAGCTACCC GTGGTACATA GAGGCCAGCG CGGAGGGGAT CTCCAGCGAG CTTAGCGGTG TCTGCGGGTG GGAGCCCTAC TACTTCAGCA ACTCCCTCTA CTCCAGCAAC CCCTACAGCT ACTCGGGCTC CGCGTGGCAG AGCTACGCCT ACGGAGCCTT CTACTACTGG GTGCTCGCTT CCGGGACCTC TACAGTTCCG GGAGCCCTTT CCGGTAGCGT CTCGGGTAGT AGCGTGGACT CTGCGTGGGT TGACTCCACC TACGTGTCCT TCCTCCTAGC GATAGTCAAG GGCGTATCCA TCTGCGGAAG CATCTACAAG CCGAGCTTTC AGAGCGTCTC CGTTACCGGC ACATCTACGA GCTTCCAGGT GTCTCTCAGC GGTCTCAGCG CATCGTACTA CAGCGTCTCG CTACCGGGGC CCGGCATGGT CACGGTGAGC GTGTCCGGCA ACGTGCGTAG CAACATAGCC TTGAACCAGG CTTTCTACGA GTCTAACTCC TCGCTCCTCC TAGCCTTGGT CAACCCGTCT ACCTCCAGCG CTACCTACCA GGTTACGATA ACCTTCTCGC CGCCGCTCCT GGTAAAGGTG TCCGGCGGGA CCTTCTACCC CGTGGACGGC AGGCTCAGCC TCCAGCTCTA CGTAACGTAC GCGGGTCAAC CCGTGACGGG GACGGTGAGG GTAAACGGGG TAGACGTCCA GGCATCCTCG GGCTACGCGT CCGTAACGTT GACGAACGTT ACCTGGGGTA GCGTCCAGCT AACGGTGGAG TACTCCGGCT ACTCCTCCAC GCTGAGCGTC GCACTGCAGA AACCATCGGT TACTCTATAC ACGCAGACCC CGCTCTTCCT GTCGCCGAGC GGGTACGGGG ACCTCGTCCT CAAAGTGTCG AACCCGAACC AGGTCGCTCT CTCGCTCCCC CTGGTGGTAC GCCCGCCGGT GAACAGCTCC CTAGCTTTCC AGAGCCTAAG CCAGACCCTG AGCCTTAGCC CCGGCGATAA CACCGTTAGG CTGAGCTTCT CGGTGACGGG CGCGCCCGCC CAGGGGACGG GCTACGTGGA CGTACTCACG GGGAGCAACG ACAAGGTATC CGTGCCCTTC TCCGTTGTCC CAGCGTCTCT CTCGGTCGTA AAGGCTAGCT ACGACGGGGC GCGCGGCAAG ACGATAGTAG ACGTCGCGGT TCAACCAGCC GCCCTAACGG TTACCGTCGA GGTAGGCGGG CTCGGCGGGA AGGCGGCTGT GCCTCTCTCC ACGTACATCG TGGGCGTGGT GGAGGTATCG ATACCAGCAC CGTCGGCGTC CCTGTCCGCA AAGCCAGAGC TCGTGGCGCC GTCCTGGTTT ACGGCTAGAG TCTCCGTCTC GCTGTCGGCT CAGGGTAGCT GTCCGCCGTA CCCCGTGAGT TACTCGCTGA GCCTGAGGGT GAACGGCTCC GACATCGGGA GCGCCTACTT CGCATGCGGA GCCTCCAAGG ACGTGGAGGC AACGGTTAAC GCGTCGCGCT CAGACAGGGA GGTCTACCTC TTCGTGCTCA ACGGGAACCC CTCCTGGAGC GCAAAGGTCA GGGTAGTACC CCCGACCATC AGCGCCTCCC TGGTCAGGTG GACTGTACTT GGGAACGGTA GCATCGTGGA GGCTAAGGTC GCGGTCGCGG GGCCGCACAA GTACCTGGTT CTCGGCAGGG TCTTGTCGAA CGAGAGCTTC ACGCTGACCC GGAGCCTGCC GGCCGGCGTC AAGGCGCTCG ACGTGGACAC CGGCTTCGCC AAGTTGCGCC TCGAGATGCC GCCCGTGAAG GTCCACGTCT CTGCGCCCGA GGTAGTCCTC ATACCCAACG CCGTCACCGT GAGGGTAACC GTGGAGACGG AGGCTGTCGT CAAAGCCTCC CTGGAGGTCA GGCTGAACTC CTCCCTGCAG AGAGAGCTTC CGGTAGACAC GTCGCTGAAC AGGACCTCGT TCGTGTTCGA GCTGAAGCCG GGCGCCCCCG GAGTGTACAC GCTGAGCGTA GGCTCCTGGT TCGGCGGGAG CGAGGCGTCC TTCTTCTACG TGGTAGTCAA GGGGGTCGAC GTCGAAGCGC CTCCCTTCGT GCTCGTAAAC GAGCAGGCAG AGGTTCACGT AAAGCTGTAC GTCTACCCGA AGCTACCGAT CCACGCGAAC CTCTCCGTGG CCGGATGCGG GGTGGGCGAG TACAGGAGGA TCCTCGGGAA CCAGTCCTTA ACGCTACGCT TCGACAAGGC TTGCTCCGCC GTAGTAACGG TGTCCGTGCT CAACTTCACG GCCTCCAGGA GGATTTACTG GGACTACCTC AACCTGGCGC TTGAGAACGT GCTCGGAACC CTCGACGGGG CCCCCATAGT GGGTAACGGC ACGGTGACGG GCAGGGCTTA CTTTGCAAAC GGGTCCGCCG TACCCGCCAG GGTGAAGGTC GACGGCGAGT ACTCGGTAAC CGTGGCGGAG ACGGGCGAAC GGGTATTCAG GCTTTCCGTG GAGTACCTCG GCTGGCACAA CGAAACCACG GTCAAAGCTT TCCTAGTGCC GGCGACCCTC TACCTGAAGG CAGTCAACGT GAGCAGGGAG CTCGGAAGCC CGGCGAGCCT CGAGGAACGC ATAAGGCTCG CGGTGGTCTC GGGGGAGTGG GGTAGCCTTT CCCGCGCGCT GAAGGTCTAC GAGGAGTCGC GCGAGAAGGC TAAGGCGGCA GACCCACTGG CGATGCTCGC AAAGAGCCTC TCGGAGAGGT GGGCTACCGA GGGGGACGAG AAGCTGATAG AGTACTCCGA CTTCATACTG AGGTACGAGG TGCCTATATA CGCGTCCATC GCGCTGTTAG CCGTGGCGGT CCTAGCCGCC CGGAGGCTCC GGCGCAGGCG CGGCGAGAAG GTCTAG
|
Protein sequence | MRGYPLLGGT RSRLLVAFLA VLLVALVRVA VSAPAYTSTH FEVYDSAGAG VAYAQSVAAE LESAYSALSG SGASLAPPCS GSLYVVNVTQ LPGGEGGYVR WTYYYDSSGR ITSSCVNYIA IAPGLTSSTL RHVAFHEMVH VSQVSYFRYT AVPSSYPWYI EASAEGISSE LSGVCGWEPY YFSNSLYSSN PYSYSGSAWQ SYAYGAFYYW VLASGTSTVP GALSGSVSGS SVDSAWVDST YVSFLLAIVK GVSICGSIYK PSFQSVSVTG TSTSFQVSLS GLSASYYSVS LPGPGMVTVS VSGNVRSNIA LNQAFYESNS SLLLALVNPS TSSATYQVTI TFSPPLLVKV SGGTFYPVDG RLSLQLYVTY AGQPVTGTVR VNGVDVQASS GYASVTLTNV TWGSVQLTVE YSGYSSTLSV ALQKPSVTLY TQTPLFLSPS GYGDLVLKVS NPNQVALSLP LVVRPPVNSS LAFQSLSQTL SLSPGDNTVR LSFSVTGAPA QGTGYVDVLT GSNDKVSVPF SVVPASLSVV KASYDGARGK TIVDVAVQPA ALTVTVEVGG LGGKAAVPLS TYIVGVVEVS IPAPSASLSA KPELVAPSWF TARVSVSLSA QGSCPPYPVS YSLSLRVNGS DIGSAYFACG ASKDVEATVN ASRSDREVYL FVLNGNPSWS AKVRVVPPTI SASLVRWTVL GNGSIVEAKV AVAGPHKYLV LGRVLSNESF TLTRSLPAGV KALDVDTGFA KLRLEMPPVK VHVSAPEVVL IPNAVTVRVT VETEAVVKAS LEVRLNSSLQ RELPVDTSLN RTSFVFELKP GAPGVYTLSV GSWFGGSEAS FFYVVVKGVD VEAPPFVLVN EQAEVHVKLY VYPKLPIHAN LSVAGCGVGE YRRILGNQSL TLRFDKACSA VVTVSVLNFT ASRRIYWDYL NLALENVLGT LDGAPIVGNG TVTGRAYFAN GSAVPARVKV DGEYSVTVAE TGERVFRLSV EYLGWHNETT VKAFLVPATL YLKAVNVSRE LGSPASLEER IRLAVVSGEW GSLSRALKVY EESREKAKAA DPLAMLAKSL SERWATEGDE KLIEYSDFIL RYEVPIYASI ALLAVAVLAA RRLRRRRGEK V
|
| |