Gene Tpen_1453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1453 
Symbol 
ID4600579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1403839 
End bp1405086 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content58% 
IMG OID639774228 
Productextracellular solute-binding protein 
Protein accessionYP_920853 
Protein GI119720358 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0104666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGAAA AACAACAAAA ACAAGCCTCT AAGAAGACCC TGACAATAGT AGCAGCAGTT 
GTTCTAGTCC TCGTTGTACT GGGAGTAGCC GCCATGCTAC TAACGCAGAA ACCCTCCGCT
CCTAAACGGA ACGTAACGAT CGTTATCTGG CACGCCATGG GTCCAGAGGA AGTTAAGACG
CTCGAAGACG TCATCGCGGA CTTCCACGTT CAACACCCGG AGATCACCGT GAAGCTTGAG
CAGAAAGCGG ACCTCGAGAC CTCCCTCAAA ACGGCCATCC CGGCTGGGCA GGGCCCAGAC
CTCTTCATAT GGGCTCACGA CTGGATAGGT AAGTTTGCAG AGGCCGGCCT CCTCGAGCCG
ATCGACGAGT ACGTGACTCC CAGCGTGCTG AACAAGTTCA GCCCGATAGG GCAGAACGCT
ATAGAGTACC GCGGGCACTA CTACGCGATG CCCCTCGCCG CCGAGACGGT CGCCCTGATC
TACAACAAGG CCTTGGTGCC TAACCCGCCG AAGACCTTCG ACGAGATGAA GAGCATAATG
GCCAAGTTCA CTAACCCGGA CAAGGGTACG TACGGCCTTG CGACGCCGAT AGACCCCTAC
TTCCTCTCCG GGTGGGTGCA CGCCTTCGGA GGCTACTACT TCGACGATAA GACCAAGCAG
CCCGGGCTGG ACAAGCCTGA GACGATAAAG GGCTTCAAGT TCTTCTTCGA GCAGGTATAC
CCCTACGTCG CTAAGACCCG CGACTACAAC GCGCAGGTAA GCCTCTTCCT CGAGGGCAAA
GCCCCCATGA TGATCAACGG TCCTTGGAGC ATCGGCGACG TCAAGAAGGC TGGCATAAAC
TTCGGCGTAG CCCCGTTGCC ACCGATAGAC AGCTCGAGCG TGCCGCACCC GTACGGCGGC
GTGAAGCTGG TCTACGTAGC TAAGGGAGTT AAGGACAAGG CCGCGGTCTG GACGTTCCTC
GAGTGGTTGA CCACGAATCC GAACGTCATC AAGCAGTTCG CCATACGCAA CGGCTATATC
CCCGTGCTCA AAGAGGTCCT CAATGACCCG GAGATACAGA ACAACCCCGT GATCTACGGC
TTCGGGCAGG CCGTCCAGAA CGCTATCCCG ATGCCTAAGA GCCCCGAAAT GGCGGCCGTC
TGGGGACCCG TGGACACTGC CATCACGAAC ATCATGGGCG GAAAGCAGAG CATAGAGGCC
GCACTGACAG CCGCGCAGCA GGAGGTTCTG TCCGCCTTGA AGAAGTAA
 
Protein sequence
MTEKQQKQAS KKTLTIVAAV VLVLVVLGVA AMLLTQKPSA PKRNVTIVIW HAMGPEEVKT 
LEDVIADFHV QHPEITVKLE QKADLETSLK TAIPAGQGPD LFIWAHDWIG KFAEAGLLEP
IDEYVTPSVL NKFSPIGQNA IEYRGHYYAM PLAAETVALI YNKALVPNPP KTFDEMKSIM
AKFTNPDKGT YGLATPIDPY FLSGWVHAFG GYYFDDKTKQ PGLDKPETIK GFKFFFEQVY
PYVAKTRDYN AQVSLFLEGK APMMINGPWS IGDVKKAGIN FGVAPLPPID SSSVPHPYGG
VKLVYVAKGV KDKAAVWTFL EWLTTNPNVI KQFAIRNGYI PVLKEVLNDP EIQNNPVIYG
FGQAVQNAIP MPKSPEMAAV WGPVDTAITN IMGGKQSIEA ALTAAQQEVL SALKK