Gene Tpen_1440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1440 
Symbol 
ID4601370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1391091 
End bp1392710 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content64% 
IMG OID639774215 
Producthypothetical protein 
Protein accessionYP_920840 
Protein GI119720345 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.828058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGGCTCG GCGTCGTCGT GTGGGCTGAG GGCGTCACCG TGAAGTTCAG GATAGCCGAG 
GACTCTGTCG TCGAGAGGGG GATGCTGGTA AAGGTCGTAG ACCGCGGCAG GAGGTACGTG
TTGAAGGTAG TGGACTTCAA GCCCGAGAGC CTCTTGACGC CGGCGGAGGT AGCGATCATT
AGTAGCAAGG CGGACAGAGG CGAGAAGCCC CAGCTGAGGG ACAGGGACCT CAGGCTCTAC
GACACAGCCA TAGCTACCAT AGTGGCGCAG ATAGACGAGG ACGGCGAGGT TCACGGGCCG
AGCAGCGTCC CGGCCCTCTT CTCCCCCGTC GAGGAGCTCG GCGAGGAGGA CTTGAAGATG
CTCAGGCTCC ACACGGGGGA CATACCGATA GGCAGGGTGA GGTTCGGCCA CAGCGCCGTG
GACGTCGAGG TCGCGCTGGA CGGCGCGAAG ACAATCCCCC ACCACATACT CGTGGTCGGC
GGGACGGGGG CGGGTAAGAG CAACTTCGGC AGGGTCCTGG CAGCCTCGAT ACTGCACCTC
AGGGGGAGGT ACAGCCTCGT CGTATTCGAC TGCGAGAGCG AGTACCTGCT GGGCTCGAAG
CCGGGCGAGA TGGGACTGGC GCACCTGCCG TTCTCCGAGG AGTACCTCTT CCTCGTGTCG
GCCAGGGTCC CCAGGCCCGG GAGGCTGAGG GTAGAGCTCC CGGAGCTCGG GGAGGAGAGG
AGCATACCGG CCTACCCCCT CAAGATGGAC GTCTCGCGCC TCAAGCCGGG GGACTTCGCG
CTCACGGGCG AGTTCTCGAG CCCCCAGGAG GAGCTACTCT GGCTTGCCTA CAAGCAGTTC
GGGGAGGGGT GGGTGGAGGC GCTCCTCTCG GGGGACGCGA GGACAGTCTA CTCGAGGCTC
GGCAGGATGG CGGGCGTCAA CACGATCAGC GTTACTAAGA GGAAGATCAG GTACCTGACG
GGGGACGGCT CGGTATTCTG CAGGGACTGC GGCTACGACC TGGCGGCGGC CGTTCTCTCG
ATGGTCCTCA AGGGCAGGGT CGTTCTCATA GAAACCCCCT TCGCGACGGA AGGCGAGGAA
AAGCTCCTAG CGACCGTGGT CGCCGAGAGG GTATTCAGGG CCTACGAGAG GATGCGCAAG
GAGCTCCCGG AGAAGTGGAG CCAGCTCCCG CCCGTGCTCA TAATGGTGGA GGAGGCGCAC
AGGTACCTCG GGTCCCAGGC GCTGGGCGGG AAGGGGGAGG TCAGGGAGAA CGTGTTCTCC
ATAATAGCGA AGAGGGGCAG GAAGTACAAG GTCGGAGGGC TCTACATAAC GCAGATGCCC
AGCGAGCTCA TGGACGCGGT GGTGAGGCAG GCGCTCACCA AGGTCATACT CTCCCTTCCA
ACGAGGCCCG ACTACCTCGC CGTCATGAAC CACAGCCCGT ACCTCGACGA GGCCGAAGCC
GAGATAAAGA CCCTCGACAG GGGGGAGGCG ATAGTGGTCA GCCCCCCGAG CGGCTTCAGG
CTCGCGGTGT CCGCGAAGAT ATACCAGTAC GAGGAGTACG CCCTCAGACT CATACAGGAA
GAAAAGCGTC TGCTAGCTAC CCGAGAGGTC TCGAGGACGC CCGAGTACGC CGATGCCTAG
 
Protein sequence
MRLGVVVWAE GVTVKFRIAE DSVVERGMLV KVVDRGRRYV LKVVDFKPES LLTPAEVAII 
SSKADRGEKP QLRDRDLRLY DTAIATIVAQ IDEDGEVHGP SSVPALFSPV EELGEEDLKM
LRLHTGDIPI GRVRFGHSAV DVEVALDGAK TIPHHILVVG GTGAGKSNFG RVLAASILHL
RGRYSLVVFD CESEYLLGSK PGEMGLAHLP FSEEYLFLVS ARVPRPGRLR VELPELGEER
SIPAYPLKMD VSRLKPGDFA LTGEFSSPQE ELLWLAYKQF GEGWVEALLS GDARTVYSRL
GRMAGVNTIS VTKRKIRYLT GDGSVFCRDC GYDLAAAVLS MVLKGRVVLI ETPFATEGEE
KLLATVVAER VFRAYERMRK ELPEKWSQLP PVLIMVEEAH RYLGSQALGG KGEVRENVFS
IIAKRGRKYK VGGLYITQMP SELMDAVVRQ ALTKVILSLP TRPDYLAVMN HSPYLDEAEA
EIKTLDRGEA IVVSPPSGFR LAVSAKIYQY EEYALRLIQE EKRLLATREV SRTPEYADA