Gene Tpen_1559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1559 
Symbol 
ID4600908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1508827 
End bp1510182 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content61% 
IMG OID639774332 
ProductL-fucose isomerase and related proteins-like 
Protein accessionYP_920957 
Protein GI119720462 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2407] L-fucose isomerase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.252286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTTGAGC TGACCGTGAA GCCTGTTCTT GCGTACAGCG TGTACGAGAG GCGCGAGGCT 
ACGAGCTGGC GCGCCTGGGG AGGGATAGCG GACGAGGAGA GCGCTGAGGA GGAGAGGCGT
AGGATCAAGG CCGAGCTAAA GAACCTCGAG CAGAGAGCCG GCTTCCCCCT CAAGTTTCTA
CCAGTAGGCA TGGTGAAGAG CTACGGGGAC GTGGATCGCC TGGAGAAGGC GGACGTTTAC
CTGCTCTACG CCGCCGGCGG GGACGAGGGG CTCCTGATAG GCGTTGCCTA CCGCGGGCCG
ACAGTTGTTT TCCTCAGGCA TAGGTCCGGC CCCCTCTACC TGTGGTACGA GATAATAGAC
GCCAGGCTGA TAAGGCGGTA CAACGACCGC ATCGGTCAGG TTTGGCTGGA CTACGACGAC
GTGGTCGTAG ACGACTACGA GGAGCTTCTG AGGAGGCTCC GGGCGCTCTA CGCCTTGAAG
AACACCCTCG GGGCTAGGGT TGTCGCGGTT GGAGGCGCGT CGGGGTGGGG TATAGGCGGG
AAGGCTGTCG AGCTTGCGAG GGCGAGGTGG CACCTCGACA TAGTCGAAGT CTCCTACGCC
GAGCTTGCCG AGAGGATAAA GAAGGCTATG GGCGACGAGC GGTGCTTGGA GGAGGCTAAG
CGGATGGCTA AGGAGTACCT CTCGGAGGAG GGGGTACGCC TGGAGACTAG GGAGGAGTTC
GTCGTGAACG CCTTCGTCCT GTACCTCGTG TTTAAGCAAC TCCTCGAGGA GCACGAGGCC
AGGATAATAA CTGTGAACGA GTGCATGACA ACGATAATGC CTATCGCGAA GACCACCGCG
TGCCTCGCGC TCAGCCTCCT GAACGACGAG GGTTACCTCG CGCTCTGCGA GAGCGACTTC
GTCGCTATAC CCGCGGCGAT ACTTCTCCAC TACGCCTCCG GGAAGCCCGT GTTCCTCGCA
GACCCCACGT TGCCCCACGA CGGCATAGTG ACTGTAGCCC ACTGCACAGC GCCTAGGCTC
ATGGACGGCA GAAGTAGGGA GCCGGCCCGG ATACTCACGC ACTTCGAGTC CGACTACGGA
GCGGCACCCA AGGTGGAGTT CAGGAAGGGG CAGGTAGTGA CGGTGCTCAT CCCAGACTTC
GAGGAGAAAA CTTGGGTGGG CTTCAGGGGG AAGATTGCAG AAGCCCCCTT CCTGCCCATC
TGCAGGAGCC AAGCCGAGAT CGAGATAGAG GGGGACTGGC GACGCCTCCT AAGGGAGCTC
AGGGGCTTCC ACTGGCTGAT AGTATACGGG GACTACCTGC GCGAGGTTAG CTACGCGCTC
AAAAAGGTGG GGATGGAATT CGTGGAGATA GGCTAG
 
Protein sequence
MVELTVKPVL AYSVYERREA TSWRAWGGIA DEESAEEERR RIKAELKNLE QRAGFPLKFL 
PVGMVKSYGD VDRLEKADVY LLYAAGGDEG LLIGVAYRGP TVVFLRHRSG PLYLWYEIID
ARLIRRYNDR IGQVWLDYDD VVVDDYEELL RRLRALYALK NTLGARVVAV GGASGWGIGG
KAVELARARW HLDIVEVSYA ELAERIKKAM GDERCLEEAK RMAKEYLSEE GVRLETREEF
VVNAFVLYLV FKQLLEEHEA RIITVNECMT TIMPIAKTTA CLALSLLNDE GYLALCESDF
VAIPAAILLH YASGKPVFLA DPTLPHDGIV TVAHCTAPRL MDGRSREPAR ILTHFESDYG
AAPKVEFRKG QVVTVLIPDF EEKTWVGFRG KIAEAPFLPI CRSQAEIEIE GDWRRLLREL
RGFHWLIVYG DYLREVSYAL KKVGMEFVEI G