Gene Tpen_1660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1660 
Symbol 
ID4601242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1606886 
End bp1608049 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content62% 
IMG OID639774433 
Productmyo-inositol-1-phosphate synthase 
Protein accessionYP_921058 
Protein GI119720563 
COG category[I] Lipid transport and metabolism 
COG ID[COG1260] Myo-inositol-1-phosphate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCGGTAG ACGTGGTTCG GGTAGCCCTG GTCGGGCAGG GTTACGTCGC AACGATCTTC 
GCGTGGGGGC TCTGCAAGCT GAAGAAAGGC GAGATCGAGC CCTGGGGAGT ACCGCTCGCA
GACGTGGACT TCGGCGTCCC GGTGGAGGAC CTCGAAATTG CCGGGAGCGT AGACGTGGAC
GAAAGAAAGG TCGGCAAGAG CCTCCGCGAA GTGGCACCTA TGTACGGGCT TAGCCCGGAG
CCAGAGCTCG GAGAGGTCGT CGTGGCGCCC GGGCTTAAGC TGCGGAGCAC GCCCGGGTTC
ATAAGGACGA AGGCTCTGGA CGACTCGAAG CCCCTGGCGG ACGCTTACGG GGCGTTCGAG
GAATGGCTAG ACGACGTGAA ACCGGACGTC GTCGTAGACG TTACGAGTAC CGTTGCTTCC
AGCCCCCTCT ACTCGTGGCG GGAGGTCGAG GAGAAAGCAT ATAGGGGCGA TCTACCCCAC
TCGCAGGTCT ACGCTTTCCT GGTTCTAAGG CACGGGAGAT CCTCCTACGT GAACCTCCAG
CCCGCTTACG TAGCTTGTAG CCCAGCGTTC GTAGAGAAGG CGCGGGAGAA CGGGTTGCTG
GTTCTCGGCG ACGACGGCGC CACGGGGGCG ACCCCCCTCA CCGTTGACCT AGCCGAGCAC
CTGAAGGAGA GGAACAGAAG GGTTCTATCG ATAGCACAGT TCAACATAGG GGGCAACACG
GACTTCCTGG CGTTGACGGA GCCCGAGAGG AACCTGGCAA AGGAGAACAC TAAGTCGGGC
TTCTTAAAGG ACATACTCGG CTACGAGCCT CCCCACTTCA TAAGGCCTAC CGGCTACCTC
GAACCCCTCG GCGACAAGAA GTTCGTCTCG ATGCACGTTC AGTGGGTCTC CTTCGGGGGC
TTCACGGACG AGCTCGTAGT GAACATGCGG ATAAACGATA GCCCGGCGCT AGCCGGGTAC
ATCGTGGACC TCGCGAGGCT CGCCTACGCC CTCGCGAAGG CCGGTCTCCG CGGAACAGTA
CCGGAGGTTA ACAGGTTCTA CATGAAGAGG CCGGGACCCC TGGACGCCAG GCACACCTCG
AAGATCCAGG CTTACCGCGA GATGCTCGGG CTCCTCGAGG AGAAGCTCGG GGCGCGCCTC
CGCGCGAAGC CTCTCAGCGC TTGA
 
Protein sequence
MAVDVVRVAL VGQGYVATIF AWGLCKLKKG EIEPWGVPLA DVDFGVPVED LEIAGSVDVD 
ERKVGKSLRE VAPMYGLSPE PELGEVVVAP GLKLRSTPGF IRTKALDDSK PLADAYGAFE
EWLDDVKPDV VVDVTSTVAS SPLYSWREVE EKAYRGDLPH SQVYAFLVLR HGRSSYVNLQ
PAYVACSPAF VEKARENGLL VLGDDGATGA TPLTVDLAEH LKERNRRVLS IAQFNIGGNT
DFLALTEPER NLAKENTKSG FLKDILGYEP PHFIRPTGYL EPLGDKKFVS MHVQWVSFGG
FTDELVVNMR INDSPALAGY IVDLARLAYA LAKAGLRGTV PEVNRFYMKR PGPLDARHTS
KIQAYREMLG LLEEKLGARL RAKPLSA