Gene Tpen_1191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1191 
Symbol 
ID4600436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1130627 
End bp1131784 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content64% 
IMG OID639773967 
Producthypothetical protein 
Protein accessionYP_920592 
Protein GI119720097 
COG category[S] Function unknown 
COG ID[COG1415] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACCG GGGTAGCAGA GCTGCCTCTG CACGACGGGA GCGTCCCCCG CTGGCTCATA 
GCCAGGATGG AGAGGCTAGC GGGGATCCTC GTAGAGATGA TAGTCGAGGA GTACGGTACG
CGGGGCCTTC TCGAAAGGTT AGCCGACCCG GTGTACTTCC AGGCTATCAA CAACATAATA
GGCATGGACT GGGACAGCTC GGGCTCGACG ACGGTGACTA CGGCCGTGCT GAAGAAAGTG
CTGGAGGAGA GGGAGCTGGG GGTAAAGGCT TGCGGTGGGA AGGGCTCTTC GAGCAGGAAG
GCCCCCGAGG AGATAAGGCT CCACGCCGAG AAGTACGGGC TTGACCCGTC GGGGCTCGTG
TCCACCTCCT ACCTCGTCGC GAAGGTGGAC AGCGCGGCGC TACAGGCCGG CTACCAGCTG
TACCACCACG CGTTCTTCTT CGACGAGGAG GGCAGGTGGG CTGTCGTCCA GCAGGGTATG
AAGCCTTCTA CTCGCACTGC TAGGAGGTAC CACTGGTTCT CGGAGCGCGT GGGCGACGTC
ACCGTCGAGC CTCACAGCGG CATCCATGGG TTCAGGGAGC CCTTCGCGCT CAACACGGTA
GCCGCCGAGG CGGGGGAGTT CCGCAGGCTG GTAGTCGACC TCGTAGGTGA GGGGGCCTCC
AGGCTTGAAC GCCTCGTGAG CGAAGCGCTC CGCGTTCTGG AAGGCTACAG CCCGCTCGTC
AGCTACGCGC CCTACAGCGC CGAGAAGGCG CGGTCCCTGC GCGAGAGGAT GAGGCGCCTG
GGCAAGCCCT CCCTAAGCCG CGAGGCTCTA GCATCGCTCG CAGGCAGGGG CGTGGAGAGC
TTCAGGGATA TTCTCGCCGC GAAAGCCGTG GGGCCCTCAG CTATAAGGGC GCTCGCGCTA
GTCGCCGAGC TCGTATACGA GACGCCCCCG TCGTGGCGCG ACCCCGTAAC GCACCAAGTG
GACCCCTTCA AGTTCGCGTA CGCGGTGGGG GGAAAGGACG GGGTACCGTT CCCCGTGGAC
AGGAAGACGT ACGACGAGCT AATCTCGATA CTCGAAGAGT TGAAGCAACG CTTCAGAGGC
GAGCCGGGAG TATTCAGAAG ACTCGCCGAG CTTACGAAGA ACTGGACACC GCCGCCCGAG
GAGAAAGTAC CCACCTAG
 
Protein sequence
MKTGVAELPL HDGSVPRWLI ARMERLAGIL VEMIVEEYGT RGLLERLADP VYFQAINNII 
GMDWDSSGST TVTTAVLKKV LEERELGVKA CGGKGSSSRK APEEIRLHAE KYGLDPSGLV
STSYLVAKVD SAALQAGYQL YHHAFFFDEE GRWAVVQQGM KPSTRTARRY HWFSERVGDV
TVEPHSGIHG FREPFALNTV AAEAGEFRRL VVDLVGEGAS RLERLVSEAL RVLEGYSPLV
SYAPYSAEKA RSLRERMRRL GKPSLSREAL ASLAGRGVES FRDILAAKAV GPSAIRALAL
VAELVYETPP SWRDPVTHQV DPFKFAYAVG GKDGVPFPVD RKTYDELISI LEELKQRFRG
EPGVFRRLAE LTKNWTPPPE EKVPT