Gene Tpen_1029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1029 
Symbol 
ID4600508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp970336 
End bp971733 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content66% 
IMG OID639773807 
ProductAIR synthase-like protein 
Protein accessionYP_920432 
Protein GI119719937 
COG category[O] Posttranslational modification, protein turnover, chaperones
[S] Function unknown 
COG ID[COG0309] Hydrogenase maturation factor
[COG1992] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0402037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTGACCC CGTTGGAGAA CCGCCTCGGG AAGATCCCCC TAGCCGCGCT GCGGGAGCTT 
CTGCTCGGCG CTGGCGCCGG GCTCGAGCTG GACGCGAACC CCCTAACGGG AGACCTGCTC
GTAACGACGA ACCCGGCGAT AGGCGTCCCG GAGGAGTGGC TCGGCTTCTT CGCCTACCAC
TACTCCGCAT CCAACCTGGC TGTTAGCTTT GCGCGCCCGG AGCTAGCCGT GCTCAGCGCG
GTGTTCCCGG AGGGCTACCC CGGCGAGGCG GTGGAGCGCG TACTGAGGTC TTTCGTCGAG
GAGTGCAGGA AGTACGGGAC CAGGCTCGTG GGAGGGCACA CCGCCAGGTA CAGGGGGCTG
GAGCTACCCC TCCTATCCTC GACCATGGTG GGCAGAGCCG GGAGGCGTAG GGAAAAGCCG
GCGGGCGGCG ACAGGGTAGT CATCGTCGGC GAGGTGGGCG CCGAGGCGGC GTGGCTCGCG
GGCGGAGACG TAGACCCCTC GACCCTCACC CCGCTCCCCG CCGCCCTCGC ACTACAGGAA
GAACCATCCA TCAAGCTCAT GCACGACGTC TCCGAGGGAG GGGTCATCGG CGCCCTTCTA
GAGGTAGCCC AGAGGTACGG TGTAGCGCTG GAGGTCTCGT CGAGGGACGT AAAGGTGTTC
GAAGGGCTCC CGCGAGGGGT GGAGCCTCTC TCGGCGCCGA GTTACGGAGC GATAATCGCT
GTTACCGGCG ACGCGGAAAG CCTGCTGAGC GCATGCGAGG AGAAAGGCCT CACGTGTAGC
AACGCGGGAG TAGTGGCTGG GCCCGGCAAC CCCCTGGTGG TAGTCGACGG GAGAGAGTAC
GCGGAGCCCC CCGTGAGCCC CCTCGTGTAC CTTTACGGCG CGGAGAGAAG GTCGCCCGAG
GAGGCAGCCG TCGCCCTGGC CGCCGAGGAG CTCGTAAGGA TCTCCAGGCT GCTAGACGTG
ATACCGGAGG TCGGCGCTAA CATCGCCTAC GCGCCTGGCC CCGTCAGGAG CCCTAGAGAC
GTCCTAGCGC TCGACGGCAG GATAGTCAAG ACGACGGAGG GGGCTAGGCT GTGCGGGAAG
CCGCGCTACG GGGCATCGAG GCACCTCGCC GAGGTCCTCG CATCCGCCAG GGAGGCCGGC
ATGCCTTACA GCGCCGCGGT AAACCTCAAG TACTCCGAAG ACCTCCTCGA AAAACTGAAG
TCCATAGGCC TCCCGGTGTG CGACGCGACG CCCTACGAGG CCTCGTGCCC AGTCTCGGAG
GCAATAAGGT CCGGGTGTAG GGCGCAGGTC TACTTCTACG CCGGTAAGCC GGGGCTCGAA
GCATCGATAG TCTTACTAGC ACGCGACCCC CTAGAAGCCA TAGAAATACT GAGGAAAGTC
GTCGAGCAGA AACGCTAG
 
Protein sequence
MVTPLENRLG KIPLAALREL LLGAGAGLEL DANPLTGDLL VTTNPAIGVP EEWLGFFAYH 
YSASNLAVSF ARPELAVLSA VFPEGYPGEA VERVLRSFVE ECRKYGTRLV GGHTARYRGL
ELPLLSSTMV GRAGRRREKP AGGDRVVIVG EVGAEAAWLA GGDVDPSTLT PLPAALALQE
EPSIKLMHDV SEGGVIGALL EVAQRYGVAL EVSSRDVKVF EGLPRGVEPL SAPSYGAIIA
VTGDAESLLS ACEEKGLTCS NAGVVAGPGN PLVVVDGREY AEPPVSPLVY LYGAERRSPE
EAAVALAAEE LVRISRLLDV IPEVGANIAY APGPVRSPRD VLALDGRIVK TTEGARLCGK
PRYGASRHLA EVLASAREAG MPYSAAVNLK YSEDLLEKLK SIGLPVCDAT PYEASCPVSE
AIRSGCRAQV YFYAGKPGLE ASIVLLARDP LEAIEILRKV VEQKR