Gene Tpen_1056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1056 
Symbol 
ID4601442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp995603 
End bp996544 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content57% 
IMG OID639773834 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_920459 
Protein GI119719964 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.355645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCCGTAT CCTCCGTTAA GAAGTATAGC GCCCCCATAC TCTTCCTCCT ACCGGCGCTC 
GTAGTGCTCG CCATGGTAAG CTTCTACCCA CTAGTATTCG GCTTCTACAT GGCGTTCACG
GACATGTCCA CAACGAAGGG TAAGTGGATC TCCTGGGACT TCGTAGGCCT CGAGAACTTC
CGGAGAATAG TGGAGGAGCT GCTAACGCCC GAGCTCGGCC TGGGTAGAGC CTTGCTCAAC
ACAGTGTTCT TCACGGTGGT AAACGTCGCC CTGCAGGTAC TAGTCGGGGT AGCCTTCGCC
TTCATGCTTA ACTCGGACAA GCTACTCTGG CGCAGGTTCT GGCAGGCTGT GTTCATCGTT
CCCTGGGCCG TCCCAGGCTA TATAAGCATA ATGGCGTGGT TCTTCCTCTA CCAGTACTCC
TACGGCTACT TCAACCAGAT ACTCATGAAC CTGGGGCTCG AGAGAATAGA CTGGCTCGGG
CAGAGGCTGG ACACGTTCTG GCCCTGGGTG TCCATCAACG CTACGAATGT CTGGCTGGCG
TACCCCTTCA TAATGACGGT CACCCTGGCG GCTCTGCAAA CGCTTCCGAG GGACCTGCTG
GACGCGGCGA AGGTCGACGG CGCGGGCGGG TGGCAGACGT TCCGCTACGT AGTGCTCCCG
CACATAAAGC CGCCCCTAAC GATCGCGACG GTTTTGACTA CGATAACGAC CTTCCAGCAG
TTCGGCGTAG TGTGGCTCTT GACGGGGGGA GGCCCGCAGA TATTCGTGAA GGCTATCGGC
ACCACCATCT ACGTTACCGA CCTGCTCATG ACCTATGGCT ACAGGACGAT ATGGCAGTTC
GGGGACTACG GCTACGGAGC GGCCTTCTCC ATATTCCTGG CACTGCTCGT AGTACCGGCC
AGCATATACG CCATGAAGAA GCTGAGAGTC GTGGGTGAGT AG
 
Protein sequence
MPVSSVKKYS APILFLLPAL VVLAMVSFYP LVFGFYMAFT DMSTTKGKWI SWDFVGLENF 
RRIVEELLTP ELGLGRALLN TVFFTVVNVA LQVLVGVAFA FMLNSDKLLW RRFWQAVFIV
PWAVPGYISI MAWFFLYQYS YGYFNQILMN LGLERIDWLG QRLDTFWPWV SINATNVWLA
YPFIMTVTLA ALQTLPRDLL DAAKVDGAGG WQTFRYVVLP HIKPPLTIAT VLTTITTFQQ
FGVVWLLTGG GPQIFVKAIG TTIYVTDLLM TYGYRTIWQF GDYGYGAAFS IFLALLVVPA
SIYAMKKLRV VGE