Gene Tpen_1152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1152 
Symbol 
ID4600958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1090712 
End bp1091893 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content57% 
IMG OID639773928 
Productextracellular solute-binding protein 
Protein accessionYP_920553 
Protein GI119720058 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0401617 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATAAGGT TCGCCGGGTG GAGCGCCGGA GAAACAGAGA TGAAGAACTA CCAGAAGATA 
ATCGAGGACT TCCAAAAGGC TAACCCGGAT ATAATCGTCA AGTACGAGGT AATAACCCAG
ATGTTCTCCG AGAACATTCT AGCAAGCTAC GCGGCCGGCG CCGCTCCCGA CATATTCTAC
GTTGACTCTG CGTGGGCGCC CACGTTCATC AGTAAGGGCG CTCTCTACCC CATAGGCGAC
AAGCTACCCA AGGACTTCAT CGACCAGTTC TACCCGTTCC TCCTCGAACC GTTCAAGGGG
CCTGACGGGA AGATCTACGG ATTACCGAAG GACTGGTCCG TGCTCTCGCT GTTCTACAAC
AAGAAGTTGT TCGCCCAGGC AGGAGTACCC GAGCCCACCG CCGACTGGAC CTGGGACGAC
CTCTTCAACG CCGCTAAGAC CATCTACCAG AAGACCGGTA AGCCCGGGCT AGTCGTACAC
GCAGAGCTCA ACAGGTGGGT ACCCTTCCTC GTCTCCAACG GTGCTCCCCC ACCGCGCTTC
GACTCGGCGG CCGACGCCGC CTACTTCGAC AAGCCCGAGG TTAGGAACGC GATTTCGAAG
ATGATAGCCA AGATACAGGA GGGGCGTAAA GAGGGCTACA TAGTCCTGCC CTCGGACGTG
AACGCCGGCT GGAACGGGGA GGCCTTCGGC AAGCAGCTCG CCGCGATGAC TATCGAGGGT
AGCTGGATGA TACCCTACCT CGCGGACCAG TTCCCCAACT TCAAGTACGG CTCCGACTGG
GACCTCGCGA TGCTACCTAA GGGTCCCGCC GGCAGGGCAA GCATGGCTTA CACCGTGGCG
CTCGGAGTGA ACTCCAAGAC CGAGAACCTG GACGCGGCGC TGAAGTTCCT GCAGTACGTT
GAAGGCATTG AAGGGCAGAA GCTACTAGTG GTGAAGATGG GTCATACCCT CCCGTCCATA
AAGGCGCTAG CAAACGACCC AGACCTCTGG CCCTCTCACG CTAAGGAGCT ATCGTTCGTG
AACAAGTACG ACCGCGTGGC GCTCTTCTTC TACGGCCCGA AGACAGGACA GATAGAGGGA
AGCATAAACC AGATCATCCA GTCTGCGGTC AGGGGGGAGA TAACGATAGA CGAAGCGCTA
AGGCTGATGA AAGACAAGGT TGCAGAAGCC TTTAAATCCT AG
 
Protein sequence
MIRFAGWSAG ETEMKNYQKI IEDFQKANPD IIVKYEVITQ MFSENILASY AAGAAPDIFY 
VDSAWAPTFI SKGALYPIGD KLPKDFIDQF YPFLLEPFKG PDGKIYGLPK DWSVLSLFYN
KKLFAQAGVP EPTADWTWDD LFNAAKTIYQ KTGKPGLVVH AELNRWVPFL VSNGAPPPRF
DSAADAAYFD KPEVRNAISK MIAKIQEGRK EGYIVLPSDV NAGWNGEAFG KQLAAMTIEG
SWMIPYLADQ FPNFKYGSDW DLAMLPKGPA GRASMAYTVA LGVNSKTENL DAALKFLQYV
EGIEGQKLLV VKMGHTLPSI KALANDPDLW PSHAKELSFV NKYDRVALFF YGPKTGQIEG
SINQIIQSAV RGEITIDEAL RLMKDKVAEA FKS