Gene Tpen_0660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0660 
Symbol 
ID4601618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp611240 
End bp612280 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content54% 
IMG OID639773433 
Productbifunctional phosphoglucose/phosphomannose isomerase 
Protein accessionYP_920065 
Protein GI119719570 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0166] Glucose-6-phosphate isomerase 
TIGRFAM ID[TIGR02128] bifunctional phosphoglucose/phosphomannose isomerase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGTTC AGGCTGTAAC TTTCCCAGAG TCCCTGATGC ACGGGATAGG TGCTTACTCC 
AGGTTGCACA AGCTACTGCA CTCAAAGATC CCGGTGAGTC CAAAGGGCAT CGTAGTGAGC
GGTATGGGGG GCTCCTTCAT AGGTGGTCTT TTCCTCCAGG ACGTGCTTTA CAGCAGGGCG
AAGGTTCCGC TGATACTCCT GAGGGATACT TACCTCCCAG CGTTCGTAGA CGAGAACTAC
CTCTTGGTAG CTGTTAGCTA CTCGGGGAAC ACGGAGGAAA CGATACGCGT AGTTGCCCAG
GCTACACAGG CGAAGGTCCC GGTGGTCGCC GTGACTTCTG GAGGGCTTTT ACAGAGGTTT
GCCGAGAAAT ACGGGTTGCC CGTAGTTTCT CTTCCCATGG GGTTGCCTCC CAGGGCCGCT
TTCCCGTACA TGGCATGCGC TCTCTCTGCG ATTGTAGAGG TGGCGATTGG GGAGGCTAAC
CTGCTCTCGG AGATCGAGAG TTGTGCCGAA AACCTATCTG CGCGTAGAGA CGAGGTCTTC
TCGAGGGCCT CCGAGAGCGC CGAGAACGTA AAGTCCCTAG TCGAGAAGGG GCTTACCCCC
CTCGTATACT CGTATAGACC GTACATCTCG GCCGGCTATA GGTTCAAGAC CCAGCTAAAC
GAAAATGCCA AAATACACGC CTTCTATGCT GACCTCCCGG AAGCCAACCA CAACGAAATA
ATGGGCTGGT CCTCGCCTCT AGCCGGAAAA TTCTTCGCGG TGCTAATCAG GGGACGAGAG
GAGCCTTTCT ACATGGACGC AAGGATAAGT TTCCTGCGAG AGCTCTTCGA AACGCAGGGT
ATACCATTCT TGGAGCAAAA ACCCGTCTTC GACGCTCCGC ACACGTGTGA GTTACTACAG
CTGATCTACA TGCTCGACTT GGTAAGCGTA GCTGTCGCCT TAAAAATGGG GGTCGACCCA
ACACCTGTAG ACACGATTAC AAAGTTAAAG AAAGTCCTAG ATGCACGCAT AAACCTAAAG
GAGGAACTCG GCCTAGAGTA G
 
Protein sequence
MLVQAVTFPE SLMHGIGAYS RLHKLLHSKI PVSPKGIVVS GMGGSFIGGL FLQDVLYSRA 
KVPLILLRDT YLPAFVDENY LLVAVSYSGN TEETIRVVAQ ATQAKVPVVA VTSGGLLQRF
AEKYGLPVVS LPMGLPPRAA FPYMACALSA IVEVAIGEAN LLSEIESCAE NLSARRDEVF
SRASESAENV KSLVEKGLTP LVYSYRPYIS AGYRFKTQLN ENAKIHAFYA DLPEANHNEI
MGWSSPLAGK FFAVLIRGRE EPFYMDARIS FLRELFETQG IPFLEQKPVF DAPHTCELLQ
LIYMLDLVSV AVALKMGVDP TPVDTITKLK KVLDARINLK EELGLE