Gene Tpen_1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1100 
Symbol 
ID4600967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1037629 
End bp1039233 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content56% 
IMG OID639773877 
ProductPTS system mannose/fructose/sorbose family IID component 
Protein accessionYP_920502 
Protein GI119720007 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3715] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIC
[COG3716] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.220553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACCA TAGGCCCACT CGAGGTAGGG CTTCTAGGAC TGCTAGCCTT CATCTTCGGA 
CTGGACTACA TCTGGGTAAC ACCTCTCGGA ATCTGGCGCC CCGTGGTCGC GGGGACCCTC
ACAGGCATTA TTCTAGGAGA CCCCTTGACA GGGCTACTCG TAGGCTCGCT ACTAGAGTTC
GTGTTCGCTG GGCTATTCAC GATAGGAGGC GGGACCGTTC CGGAGGCTGC TAGCGGGACG
ATAGCCTCCG TGGTTGTCGC CGTTACTACG GGGTTAAAGC CTGAGGCGGC GGTACCGCTG
GCTATACCCG TAGCAGTGCT TACAATGAAC TTGGAGATAG TTGTCAGGTC TTTCGACGCG
GTGTTCACGC ACTGGGCTGA CAGGGAGATA GAGAGGGGGA ACTACGGGGC AATCCCCCTG
ATAAACATTC TCGGCGCGGT ACCATGGGGG CTCAGCAGGG CAATACCTAT CTGGCTATTC
GCGGGAGCCT TAGCCATAAA CCCACAGGCC GTTAAAGCGG CGATAGATGC CCTCCAAGTG
GTTCAAATCG GACCCTTCAC GGTGCGCTTC TGGGACGCGA TGGCAGTCGC GGGCGCCGTA
CTCCCAGCGC TCGGTGCCGC CATTCTAATG AAACTCATGA TCTCGCGTAG GAACGTGATG
TTCTTTGTAC TCGGCTTCGC TCTCGCAGCC TACCTGAAGC TAAGCCTCCT GGCGATAGCC
CTCGTAGCGG GATCCATAAT CTTCGCTATC TACTACTTCA CCCACCGCGA GGCATTGGAG
GCTGGAGCCG CGGTAACCAC AGCGGCACCA CCCACCGGCA AGGCAACGAC GAAGGACTTC
ATAAGGTGGT TCGGGGTCTC ATGGTTCATA CAGTCGTCCT GGAACTACGA GAGAATGATG
GGGACAGGCT TCGCGCACGG TATGCTTGAA ATAGAGAAGA AGCTTAGAAA GGACCCGGAG
GAGCTGAAGT CCTGGATGAG GCTACACAAC GAGTTCTACA ACACCGAGCC CCACCTCCAC
AACGCCATTT ACGGGATGGT GATATCCCTA GAGGAACAGG GGGCGGATCA GGATACGATA
AGAGGAGTCA AAACAGCGCT TATGGGTCCA TTCGCAGGGC TCGGAGACTC GATAATGTGG
TTCACGATCC TCCCGATAGC GTTCCTCTTA GGAGCCTCGC TGGGAGCCCA GGGCAACATA
CTCGGCCCGG TAATAGCGCT ACTGATATGG ATACCAGTCT CCTGGGCCGT TAAGTACTAC
ACGCTCGTCT ACGGGTACAA GTACGGCTTA TCCCTGGCGG AAATACTCAA GGGAGAAGTC
CTGAAGATAT TTAGGGAGGG TATAGCAGCC TTCGCGATGG CAATGGTCGG AGGAATCGCG
GCGACATACG TCAGGGCGAC AACCCCGATA GTACTGGCCC AGTACGCTGG TCACGCCATT
AAGCTACAAC CAGTACTGGA CCAGTTGATG CCATCCCTGC TCCCACTGCT CTTCACCCTC
TACGCCTACT GGCTAATAAA GGTCAAAGGC TACAGCTACG GTAAAGCCGT CGTCATACTC
TTCCTCACGG CATTCATACT CGCACTGCTA GGAGTACTCG GTTAA
 
Protein sequence
MATIGPLEVG LLGLLAFIFG LDYIWVTPLG IWRPVVAGTL TGIILGDPLT GLLVGSLLEF 
VFAGLFTIGG GTVPEAASGT IASVVVAVTT GLKPEAAVPL AIPVAVLTMN LEIVVRSFDA
VFTHWADREI ERGNYGAIPL INILGAVPWG LSRAIPIWLF AGALAINPQA VKAAIDALQV
VQIGPFTVRF WDAMAVAGAV LPALGAAILM KLMISRRNVM FFVLGFALAA YLKLSLLAIA
LVAGSIIFAI YYFTHREALE AGAAVTTAAP PTGKATTKDF IRWFGVSWFI QSSWNYERMM
GTGFAHGMLE IEKKLRKDPE ELKSWMRLHN EFYNTEPHLH NAIYGMVISL EEQGADQDTI
RGVKTALMGP FAGLGDSIMW FTILPIAFLL GASLGAQGNI LGPVIALLIW IPVSWAVKYY
TLVYGYKYGL SLAEILKGEV LKIFREGIAA FAMAMVGGIA ATYVRATTPI VLAQYAGHAI
KLQPVLDQLM PSLLPLLFTL YAYWLIKVKG YSYGKAVVIL FLTAFILALL GVLG