Gene Tpen_1588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1588 
Symbol 
ID4600552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1538899 
End bp1540257 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content58% 
IMG OID639774361 
Productextracellular solute-binding protein 
Protein accessionYP_920986 
Protein GI119720491 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGC AACAAAACAG GTTGGTTTTA GCCTCTATCG CCGTAGGGAT AATAGCTCTC 
CTAGTGAGCC TCTACGCCGT TGTAACCGTT CAGTCGCTGG CATCCGCGAT TAGCGATCTC
AAAGCCTCGA TAAGCAACCT CCAGGGCACG GTTTCGAGCC TCCAGCAACA AGTCGGGAAG
ATATCCCAGC AACCCGCGCC CACTACACCC CCCGTACAGA AAGTAAAGCT GGTGGTGATC
GGCCCGTGGG CGGGTGACGA GGCGAAGTAC TTCCAGGCCG TGATCGACGC CTACGTAAAG
ACGCACCCCA ACGTCGAGAT TGAGTACAGG ACTATGCGCG CCGAGGACGT AGCGGCGACA
ATGCCCATAC AGTTCGCCGC CGGCGTAGCT CCTGGAGACG TAATCTTCGG GTGGGCGTGG
TTCATCGCCA AGATGGGCAA GGAGGGGCAT CTAGTAGACC TCACAGGCAT AATCAAGGAA
AACGAGTACG TACCGGGAAT CGTCGACGCC GTGAAAGCCG ACGGCAAGAT CTGGGGTGCA
CCGTTCACGA TGTGGCTAAA GCCTGGCTTC TGGTACAGGA AGTCCTTCTT CCAGAAGTAC
GGGCTAACAG AGCCGAAGTC CTACGCCGAG TTCGTGCAAC TACTGGAGAA GATCAAGGGG
ATACCCGGCG TGAAGAACCC CATAGCCACG GGAGACGGGG TCGGGTGGCC GATAAGCGAC
ATTGTCGAAC ACTTCATAAT CGCGTACGGC GGTCCACAGA TGCAACTAAA CCTCATATCC
GGGAAGACTA GGTTCACGGA TCCCAGCGTG AGGAAGGTGT TCTCCGACTA CCTCATACCG
CTACTCCAGA AAGGCTACTT CAGCGAGCCG ATAGAGTGGA CCACTGTCAT ACCCAAGTGG
TGGGCAGGCG AGTACGGGCT CTACTTCATG GGGACCTGGA TCACGGGGAT GGTCGAAGAC
CCGAACGACC TAGACTTCTT CCCCTTACCC GAGAGTAAGG GCGTCGTTGG AGGCGCGGAC
TACGCCTTCG TGCCGAAGTA CTCGAGGAAC GTTGACGCCG CGCTCGACTT CATCAAGTAC
CTCGCCACGG AGGGGCAGGT CGTACACGCG AGCGTACCTT CCGGTAAGAT CCCGACGTGG
ACGAAGGCAC CCGTCGAGAA GCTATGGAAG CCCATGCAGA GTGTATACAC CAAGATCACC
GGCAAGGGGC TAGCCATACT GCCTGACCTC GACGACTCTA TCGGGGGAGA CTGGCAGAAG
CTCTTCTGGG ATCAGCTGAA GCTGTTGTGG GTCAACCCAG GAGCCCTAGA CTCCGTCCTG
AAGACGCTGG AGAGCAGTCA GCCGAAGCCG TCGGGTTAA
 
Protein sequence
MSQQQNRLVL ASIAVGIIAL LVSLYAVVTV QSLASAISDL KASISNLQGT VSSLQQQVGK 
ISQQPAPTTP PVQKVKLVVI GPWAGDEAKY FQAVIDAYVK THPNVEIEYR TMRAEDVAAT
MPIQFAAGVA PGDVIFGWAW FIAKMGKEGH LVDLTGIIKE NEYVPGIVDA VKADGKIWGA
PFTMWLKPGF WYRKSFFQKY GLTEPKSYAE FVQLLEKIKG IPGVKNPIAT GDGVGWPISD
IVEHFIIAYG GPQMQLNLIS GKTRFTDPSV RKVFSDYLIP LLQKGYFSEP IEWTTVIPKW
WAGEYGLYFM GTWITGMVED PNDLDFFPLP ESKGVVGGAD YAFVPKYSRN VDAALDFIKY
LATEGQVVHA SVPSGKIPTW TKAPVEKLWK PMQSVYTKIT GKGLAILPDL DDSIGGDWQK
LFWDQLKLLW VNPGALDSVL KTLESSQPKP SG