Gene Tpen_1771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1771 
Symbol 
ID4601948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1711010 
End bp1714345 
Gene Length3336 bp 
Protein Length1111 aa 
Translation table11 
GC content63% 
IMG OID639774544 
Producthypothetical protein 
Protein accessionYP_921169 
Protein GI119720674 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.621318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGGCT ACCCCCTTTT AGGCGGCACG CGCTCGAGGC TACTCGTAGC TTTCCTGGCT 
GTCCTCCTGG TGGCGCTGGT ACGGGTAGCC GTCTCCGCGC CTGCATACAC TTCCACGCAC
TTCGAGGTTT ACGACTCCGC GGGAGCGGGC GTAGCGTACG CGCAGAGTGT CGCCGCGGAG
CTGGAGTCGG CGTACTCGGC TCTCAGCGGC TCCGGGGCTA GCCTAGCGCC TCCGTGCTCT
GGTAGCCTCT ACGTCGTGAA CGTGACCCAG TTGCCTGGCG GGGAGGGAGG ATACGTGAGA
TGGACGTACT ACTACGATTC GTCGGGCAGG ATAACTTCTT CCTGTGTGAA CTACATCGCG
ATAGCGCCTG GGCTTACCTC GTCGACTTTG AGGCATGTCG CCTTCCACGA AATGGTTCAC
GTGTCGCAAG TCTCCTACTT CAGGTACACT GCGGTGCCGA GTAGCTACCC GTGGTACATA
GAGGCCAGCG CGGAGGGGAT CTCCAGCGAG CTTAGCGGTG TCTGCGGGTG GGAGCCCTAC
TACTTCAGCA ACTCCCTCTA CTCCAGCAAC CCCTACAGCT ACTCGGGCTC CGCGTGGCAG
AGCTACGCCT ACGGAGCCTT CTACTACTGG GTGCTCGCTT CCGGGACCTC TACAGTTCCG
GGAGCCCTTT CCGGTAGCGT CTCGGGTAGT AGCGTGGACT CTGCGTGGGT TGACTCCACC
TACGTGTCCT TCCTCCTAGC GATAGTCAAG GGCGTATCCA TCTGCGGAAG CATCTACAAG
CCGAGCTTTC AGAGCGTCTC CGTTACCGGC ACATCTACGA GCTTCCAGGT GTCTCTCAGC
GGTCTCAGCG CATCGTACTA CAGCGTCTCG CTACCGGGGC CCGGCATGGT CACGGTGAGC
GTGTCCGGCA ACGTGCGTAG CAACATAGCC TTGAACCAGG CTTTCTACGA GTCTAACTCC
TCGCTCCTCC TAGCCTTGGT CAACCCGTCT ACCTCCAGCG CTACCTACCA GGTTACGATA
ACCTTCTCGC CGCCGCTCCT GGTAAAGGTG TCCGGCGGGA CCTTCTACCC CGTGGACGGC
AGGCTCAGCC TCCAGCTCTA CGTAACGTAC GCGGGTCAAC CCGTGACGGG GACGGTGAGG
GTAAACGGGG TAGACGTCCA GGCATCCTCG GGCTACGCGT CCGTAACGTT GACGAACGTT
ACCTGGGGTA GCGTCCAGCT AACGGTGGAG TACTCCGGCT ACTCCTCCAC GCTGAGCGTC
GCACTGCAGA AACCATCGGT TACTCTATAC ACGCAGACCC CGCTCTTCCT GTCGCCGAGC
GGGTACGGGG ACCTCGTCCT CAAAGTGTCG AACCCGAACC AGGTCGCTCT CTCGCTCCCC
CTGGTGGTAC GCCCGCCGGT GAACAGCTCC CTAGCTTTCC AGAGCCTAAG CCAGACCCTG
AGCCTTAGCC CCGGCGATAA CACCGTTAGG CTGAGCTTCT CGGTGACGGG CGCGCCCGCC
CAGGGGACGG GCTACGTGGA CGTACTCACG GGGAGCAACG ACAAGGTATC CGTGCCCTTC
TCCGTTGTCC CAGCGTCTCT CTCGGTCGTA AAGGCTAGCT ACGACGGGGC GCGCGGCAAG
ACGATAGTAG ACGTCGCGGT TCAACCAGCC GCCCTAACGG TTACCGTCGA GGTAGGCGGG
CTCGGCGGGA AGGCGGCTGT GCCTCTCTCC ACGTACATCG TGGGCGTGGT GGAGGTATCG
ATACCAGCAC CGTCGGCGTC CCTGTCCGCA AAGCCAGAGC TCGTGGCGCC GTCCTGGTTT
ACGGCTAGAG TCTCCGTCTC GCTGTCGGCT CAGGGTAGCT GTCCGCCGTA CCCCGTGAGT
TACTCGCTGA GCCTGAGGGT GAACGGCTCC GACATCGGGA GCGCCTACTT CGCATGCGGA
GCCTCCAAGG ACGTGGAGGC AACGGTTAAC GCGTCGCGCT CAGACAGGGA GGTCTACCTC
TTCGTGCTCA ACGGGAACCC CTCCTGGAGC GCAAAGGTCA GGGTAGTACC CCCGACCATC
AGCGCCTCCC TGGTCAGGTG GACTGTACTT GGGAACGGTA GCATCGTGGA GGCTAAGGTC
GCGGTCGCGG GGCCGCACAA GTACCTGGTT CTCGGCAGGG TCTTGTCGAA CGAGAGCTTC
ACGCTGACCC GGAGCCTGCC GGCCGGCGTC AAGGCGCTCG ACGTGGACAC CGGCTTCGCC
AAGTTGCGCC TCGAGATGCC GCCCGTGAAG GTCCACGTCT CTGCGCCCGA GGTAGTCCTC
ATACCCAACG CCGTCACCGT GAGGGTAACC GTGGAGACGG AGGCTGTCGT CAAAGCCTCC
CTGGAGGTCA GGCTGAACTC CTCCCTGCAG AGAGAGCTTC CGGTAGACAC GTCGCTGAAC
AGGACCTCGT TCGTGTTCGA GCTGAAGCCG GGCGCCCCCG GAGTGTACAC GCTGAGCGTA
GGCTCCTGGT TCGGCGGGAG CGAGGCGTCC TTCTTCTACG TGGTAGTCAA GGGGGTCGAC
GTCGAAGCGC CTCCCTTCGT GCTCGTAAAC GAGCAGGCAG AGGTTCACGT AAAGCTGTAC
GTCTACCCGA AGCTACCGAT CCACGCGAAC CTCTCCGTGG CCGGATGCGG GGTGGGCGAG
TACAGGAGGA TCCTCGGGAA CCAGTCCTTA ACGCTACGCT TCGACAAGGC TTGCTCCGCC
GTAGTAACGG TGTCCGTGCT CAACTTCACG GCCTCCAGGA GGATTTACTG GGACTACCTC
AACCTGGCGC TTGAGAACGT GCTCGGAACC CTCGACGGGG CCCCCATAGT GGGTAACGGC
ACGGTGACGG GCAGGGCTTA CTTTGCAAAC GGGTCCGCCG TACCCGCCAG GGTGAAGGTC
GACGGCGAGT ACTCGGTAAC CGTGGCGGAG ACGGGCGAAC GGGTATTCAG GCTTTCCGTG
GAGTACCTCG GCTGGCACAA CGAAACCACG GTCAAAGCTT TCCTAGTGCC GGCGACCCTC
TACCTGAAGG CAGTCAACGT GAGCAGGGAG CTCGGAAGCC CGGCGAGCCT CGAGGAACGC
ATAAGGCTCG CGGTGGTCTC GGGGGAGTGG GGTAGCCTTT CCCGCGCGCT GAAGGTCTAC
GAGGAGTCGC GCGAGAAGGC TAAGGCGGCA GACCCACTGG CGATGCTCGC AAAGAGCCTC
TCGGAGAGGT GGGCTACCGA GGGGGACGAG AAGCTGATAG AGTACTCCGA CTTCATACTG
AGGTACGAGG TGCCTATATA CGCGTCCATC GCGCTGTTAG CCGTGGCGGT CCTAGCCGCC
CGGAGGCTCC GGCGCAGGCG CGGCGAGAAG GTCTAG
 
Protein sequence
MRGYPLLGGT RSRLLVAFLA VLLVALVRVA VSAPAYTSTH FEVYDSAGAG VAYAQSVAAE 
LESAYSALSG SGASLAPPCS GSLYVVNVTQ LPGGEGGYVR WTYYYDSSGR ITSSCVNYIA
IAPGLTSSTL RHVAFHEMVH VSQVSYFRYT AVPSSYPWYI EASAEGISSE LSGVCGWEPY
YFSNSLYSSN PYSYSGSAWQ SYAYGAFYYW VLASGTSTVP GALSGSVSGS SVDSAWVDST
YVSFLLAIVK GVSICGSIYK PSFQSVSVTG TSTSFQVSLS GLSASYYSVS LPGPGMVTVS
VSGNVRSNIA LNQAFYESNS SLLLALVNPS TSSATYQVTI TFSPPLLVKV SGGTFYPVDG
RLSLQLYVTY AGQPVTGTVR VNGVDVQASS GYASVTLTNV TWGSVQLTVE YSGYSSTLSV
ALQKPSVTLY TQTPLFLSPS GYGDLVLKVS NPNQVALSLP LVVRPPVNSS LAFQSLSQTL
SLSPGDNTVR LSFSVTGAPA QGTGYVDVLT GSNDKVSVPF SVVPASLSVV KASYDGARGK
TIVDVAVQPA ALTVTVEVGG LGGKAAVPLS TYIVGVVEVS IPAPSASLSA KPELVAPSWF
TARVSVSLSA QGSCPPYPVS YSLSLRVNGS DIGSAYFACG ASKDVEATVN ASRSDREVYL
FVLNGNPSWS AKVRVVPPTI SASLVRWTVL GNGSIVEAKV AVAGPHKYLV LGRVLSNESF
TLTRSLPAGV KALDVDTGFA KLRLEMPPVK VHVSAPEVVL IPNAVTVRVT VETEAVVKAS
LEVRLNSSLQ RELPVDTSLN RTSFVFELKP GAPGVYTLSV GSWFGGSEAS FFYVVVKGVD
VEAPPFVLVN EQAEVHVKLY VYPKLPIHAN LSVAGCGVGE YRRILGNQSL TLRFDKACSA
VVTVSVLNFT ASRRIYWDYL NLALENVLGT LDGAPIVGNG TVTGRAYFAN GSAVPARVKV
DGEYSVTVAE TGERVFRLSV EYLGWHNETT VKAFLVPATL YLKAVNVSRE LGSPASLEER
IRLAVVSGEW GSLSRALKVY EESREKAKAA DPLAMLAKSL SERWATEGDE KLIEYSDFIL
RYEVPIYASI ALLAVAVLAA RRLRRRRGEK V