Gene Tpen_1676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1676 
Symbol 
ID4600928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1622391 
End bp1624301 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content49% 
IMG OID639774449 
Productextracellular solute-binding protein 
Protein accessionYP_921074 
Protein GI119720579 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.24706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGG CAAAAGTTGC ACCAATACTG GTACTAGTAG TTCTCGCGAC AATCGCCCCT 
GTTTTCGCAG CGCCCGAGAC TATACCCAGG GAGGAAGCCA TCTACGTCGG CGGCGGAATG
TGGTCTCCAC CGAACAACTG GAACCCCCTA ATACCGTGGT CCGCTGTAAC GGGTACAATA
GGGCTTGTCT ACGAAACATT GTATCTATAT AACCCTGCAA ACGGAACTTT CATTCCTTGG
ATAGCCGATG GTCAACCCAG CTTCCAGGTA AGCGGTAATA CAGTGAGAAT AACAATTAAG
CTAAAAGAGG CAAGATGGCA GGACGGGCAA CCGCTTACAA GCGAAGATGT TGCGTACACG
TTCTACGAGT TTCCGAAGAA AAACACAGCC GTCTATTATT CTAGCATTCT AAACTACCTG
CTGTCCGTTG AAACACCCGA CACCAGAACC GTAGTTTTTG TCTGCAATGC CACAACCGTA
AACTACCCGC AGATCTACGA TTTCCTGAGA TCAGTCGCTA TAATCCCGAA ACATGTATGG
GTTAACAAGG AAAAACCATT GGAAGATGCA AACTGGCCGC CCATAGGCTC CGGGATGTAC
AAGGCTAGTA GCTACACGTC TGACCGCATG ATATGGGTTC GCGACGATAA CTGGTGGGGC
ACGAAGTACT TCGGAACACC CGGTCCAAGG TATATCGTAT ACGTCATGGT ATCGAGTAAC
GCTGTTGCTC TAGCGATGCT GGCTCGAGGA GAGCTGGACT GGAGCAACTA CTTCCTGCCA
GGCTTCTCCA GTCTTGTACA ACAATACCCG TTCTTAGTAA CCTGGTATAA TAAGTCACCG
TGGAACCTTC CTGCAAACGT TGCATTCCTC TTTGTGAACA CGAAAAAAAC GCCCATGGAT
AACCCTACTT TCCGCAAAGC TCTCTACTAC GCTATAGACG TGGATAAAAT AATCAACACA
GTATTTGAGG GGGGCGTTAT CAAAGCCTTA CCTATAGGCA TACTCGACAT ACCTGGATAC
AAGCCTTTCA TCGACACAGA ACTTATATCT AGGTATGGGT ATAAGTACGA CCCCGAAAAA
GCAAAGCAAT TGCTTGACAG CATAGGGATA AAGGACTATA ATGGAGACGG GTGGAGAGAG
CTACCCGGAG GCAAGCCTTT AAAACTTACT ATAATTGTGC CGTATGGCTG GACTGATTGG
ATGGAGGCAG CCAGACTTAT CGCAAGCGAT CTCCAGAGAG TAGGACTTTA CGTCGAGGCG
CAGTTCCCGG ACTACTCGGC GTATAGCGAG GCATTGTACA AAGGAACATT TGACATGTTG
ATAAACAACT TTGGCAGCTT TGCCTCTATA TCCCCCTGGG TTATCTACAA CTGGGCACTA
TGGCCCGATG CCCCACCCGT AGGCGAATAC TCCTGGAGCG GGAACTTTGG CAGGTACTCT
AATCCGAAGG TAACAGAGCT TTTGCACACA ATAGCGAATA CACCTCTCAG CGATGTTACC
AAGCTGAAGC AACTCTACGG TCAACTTGAA CAAATATACC TAGACGAGAT GCCTTACATA
CCATTATGGT ACAACGGCTA CTGGTTCATT GGCTCGAAGC TGTACTGGAC AGGATGGCCT
AGCGCCGATA ACCCGTACGG CGTACCGGTG ACGTGGCCTG GGAGGTGGCA AGACGGCGGC
TTATTGGTGC TCCTTAAACT CAGACCTGTG AAAACTCCCA CAACTACTCC AACAACTACA
CCGACCACTC CAACTACTCC GACTACCCCC ACTGCTCCGA CGACGCCTAC CGCTCCAGCA
CCAGACTACA CTCCGTACAT AGTGGCGCTA ATAGTGATAG TCGCTATACT TGCCGTAGCT
TACATGTTCT TTGTACAGCG CAAAAAGAAG GAAGAAACCA AGCCTCAATA A
 
Protein sequence
MKKAKVAPIL VLVVLATIAP VFAAPETIPR EEAIYVGGGM WSPPNNWNPL IPWSAVTGTI 
GLVYETLYLY NPANGTFIPW IADGQPSFQV SGNTVRITIK LKEARWQDGQ PLTSEDVAYT
FYEFPKKNTA VYYSSILNYL LSVETPDTRT VVFVCNATTV NYPQIYDFLR SVAIIPKHVW
VNKEKPLEDA NWPPIGSGMY KASSYTSDRM IWVRDDNWWG TKYFGTPGPR YIVYVMVSSN
AVALAMLARG ELDWSNYFLP GFSSLVQQYP FLVTWYNKSP WNLPANVAFL FVNTKKTPMD
NPTFRKALYY AIDVDKIINT VFEGGVIKAL PIGILDIPGY KPFIDTELIS RYGYKYDPEK
AKQLLDSIGI KDYNGDGWRE LPGGKPLKLT IIVPYGWTDW MEAARLIASD LQRVGLYVEA
QFPDYSAYSE ALYKGTFDML INNFGSFASI SPWVIYNWAL WPDAPPVGEY SWSGNFGRYS
NPKVTELLHT IANTPLSDVT KLKQLYGQLE QIYLDEMPYI PLWYNGYWFI GSKLYWTGWP
SADNPYGVPV TWPGRWQDGG LLVLLKLRPV KTPTTTPTTT PTTPTTPTTP TAPTTPTAPA
PDYTPYIVAL IVIVAILAVA YMFFVQRKKK EETKPQ