Gene Tpen_1498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1498 
Symbol 
ID4601407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1448655 
End bp1450703 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content63% 
IMG OID639774273 
Producthypothetical protein 
Protein accessionYP_920898 
Protein GI119720403 
COG category[R] General function prediction only 
COG ID[COG4880] Secreted protein containing C-terminal beta-propeller domain distantly related to WD-40 repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGATAGGA GGATGCTCGT AGCCGTAGTG TTGCTAGCCC TTATCGGCGC GCTCACACCC 
TTCGCGCTCT ACGTAGCGCT GGGGCCCCGC GCGGCCCCCA CGGCTGAGAC TGGCGCCCGG
GGCCAGCGGC TCCTCGAGTT CTACCCGCTG GGTGGCGAGA CGTTCGCGTC CCTGAAGGAA
ATCCTGGACT TCGCGGAGTC CAGGTTGAAG GCACTGGAAA CAGCTCGGGC CTACACGGCG
GGCGCCGTGC TCCCCACCGA AGCTACGCGG GCGGCAACCG CGGAGGTATC GAAGACGAAC
GTCCAGGTAG TCGGGGTGGA CGAGCCGGAC ATCGTGAAGG CAGGTAGCGG GGTAATAGCT
GTAGCGAGGG GGTCGGAGGT CTACATCGTG GGGATCGCCG AGAGAAAGGT TCTCGGCAAG
ATCTCAGCTG GGTCCCAGGT TTTCGGCCTC TTCCTGGAGG GGTCTAGGCT CGCCGTGATA
ACGTCGACGC CACTGATCAG ACCCCTCGTA GTCCTCCCGG GAGCCGGGAG TCCCCCCTAC
CTGGGCGGCG TTGCCAACAC GACGCTCCTG GTGTACTCTA TCGAGGATCC GTCTAGCCCC
AGGCTTCTGT ACTCCTCTTC GGTGTCTGGC TACCCGCTCG GCGCGCGCAT GCTGGGCGGC
GTGGTGTACA TCCTCACGTC CGCGCCCCTA GAGGTAAAGC TCCCGCTGGT AGACGGCGAG
CCCATACCGC CAGGCTCGGT CGCGAAGATA GACCCGTCGG CATCCTGGTA CCTGGTGCTC
CTCTGCATGG ACCTCTCCAC GGGGAAGCAC TCGGCGTACG CGTTCACCTC AGCCCCGAGT
AGCTGGATCT ACATGGGCGA GAACAGGCTC TACGTCGCGT CCTACCCATC AGTATACGAG
GAGGCCCTCA AAGAGTTCCT GGAAGCTGTG TCCAAGCGCC TACCCGGCGG CGTCTCCGGC
AGGGTCTCCG GGCTGGCTTT TCAAGGGTTG CTCGGCGAAG CTCTCAACGC GCTGGAGGAC
TACCTATCCT CGGTGAGCTA CGATGTAGCG AGGGATATTC TGGAGAAGGC TGCAGCAGAG
GTCCCGCCGA TACCGGACAA GACGATCTTC AAGGTCTTCG CCGTCAGCGG GCTGAAGGTC
AGCTACCGGG GCTCCGTCGA GGTACCCGGG AGGGTTCTCG ACCAGTTCTC GATGGAGGAG
CTCGGCGGCT ACTTCGTCGT AGCCACGACC TCGGGGGAGT GGAGGGTGAG AGCCTCGATT
GCGAAGACGC TCATTACGCC CCCGAGCCCC CCAAGCCGCA ACGTAACGGT GGAGGTTTGT
AGCGGTGGCT CGTGCCGCGA AATCGTCGTG CCGATCACAG TGCAACCCAC CTCGCTACGC
GCCGGCAGGC CTATAGTCTA CGTGGGCGTG GAGCCCGCGG CGGACACCTC TAACAACGTT
TTCTCCGTAA GCCTAGAGGA CCTGAAGGTC AAGGGCAACC TCACCGGGCT AGCCCCCGGC
GAGAGAGTCT ACGCCTCGAG GCTCGTCGGG AGCACGATGT ACCTAGTTAC CTACAGGCAG
GTCGACCCGC TCTTCGCAGT GGACCTCTCG GACCCGTCCA GCCCGCGCGT TCTCGGCTAC
GTGAAGGCTC CCGGCTTCAG CGAGTACCTA CACCCCGTCA CCGGCAAGCT ACTCCTAGGG
GTGGGCTTCA CAGACGATAG GAGGCTCAAG GTATCCCTCT TCGACGTCTC CGACCCGAAG
GCTATAAGGG AGGCCTCCAC GGTCACTATC GCCGCCTCGT CCCCCGTAAC GTCCGACCAC
CACGCGTTCA GCTTCGACCC GAGCAACGGG AGGGCGTACA TACCCGTCAG CCTCTGGTAC
ACGGGGTCCG GCGGCGTAAT GGTGGTCGAA GTCAAGAATG GGAGGCTCTC CTTCGTGAAG
CTACTGGAGC ACCCGGGCGC CCTGAGGACA GTGTACACTC CAGACGAGGT ATTCACGGTG
TCGCAGGCAT CCGTCAATGT GTACTCCTCC AGCACCCTTG AGAAAGTAGG CGAAATACCC
CTCGACTAG
 
Protein sequence
MDRRMLVAVV LLALIGALTP FALYVALGPR AAPTAETGAR GQRLLEFYPL GGETFASLKE 
ILDFAESRLK ALETARAYTA GAVLPTEATR AATAEVSKTN VQVVGVDEPD IVKAGSGVIA
VARGSEVYIV GIAERKVLGK ISAGSQVFGL FLEGSRLAVI TSTPLIRPLV VLPGAGSPPY
LGGVANTTLL VYSIEDPSSP RLLYSSSVSG YPLGARMLGG VVYILTSAPL EVKLPLVDGE
PIPPGSVAKI DPSASWYLVL LCMDLSTGKH SAYAFTSAPS SWIYMGENRL YVASYPSVYE
EALKEFLEAV SKRLPGGVSG RVSGLAFQGL LGEALNALED YLSSVSYDVA RDILEKAAAE
VPPIPDKTIF KVFAVSGLKV SYRGSVEVPG RVLDQFSMEE LGGYFVVATT SGEWRVRASI
AKTLITPPSP PSRNVTVEVC SGGSCREIVV PITVQPTSLR AGRPIVYVGV EPAADTSNNV
FSVSLEDLKV KGNLTGLAPG ERVYASRLVG STMYLVTYRQ VDPLFAVDLS DPSSPRVLGY
VKAPGFSEYL HPVTGKLLLG VGFTDDRRLK VSLFDVSDPK AIREASTVTI AASSPVTSDH
HAFSFDPSNG RAYIPVSLWY TGSGGVMVVE VKNGRLSFVK LLEHPGALRT VYTPDEVFTV
SQASVNVYSS STLEKVGEIP LD