Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1440 |
Symbol | |
ID | 4601370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1391091 |
End bp | 1392710 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639774215 |
Product | hypothetical protein |
Protein accession | YP_920840 |
Protein GI | 119720345 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.828058 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGGCTCG GCGTCGTCGT GTGGGCTGAG GGCGTCACCG TGAAGTTCAG GATAGCCGAG GACTCTGTCG TCGAGAGGGG GATGCTGGTA AAGGTCGTAG ACCGCGGCAG GAGGTACGTG TTGAAGGTAG TGGACTTCAA GCCCGAGAGC CTCTTGACGC CGGCGGAGGT AGCGATCATT AGTAGCAAGG CGGACAGAGG CGAGAAGCCC CAGCTGAGGG ACAGGGACCT CAGGCTCTAC GACACAGCCA TAGCTACCAT AGTGGCGCAG ATAGACGAGG ACGGCGAGGT TCACGGGCCG AGCAGCGTCC CGGCCCTCTT CTCCCCCGTC GAGGAGCTCG GCGAGGAGGA CTTGAAGATG CTCAGGCTCC ACACGGGGGA CATACCGATA GGCAGGGTGA GGTTCGGCCA CAGCGCCGTG GACGTCGAGG TCGCGCTGGA CGGCGCGAAG ACAATCCCCC ACCACATACT CGTGGTCGGC GGGACGGGGG CGGGTAAGAG CAACTTCGGC AGGGTCCTGG CAGCCTCGAT ACTGCACCTC AGGGGGAGGT ACAGCCTCGT CGTATTCGAC TGCGAGAGCG AGTACCTGCT GGGCTCGAAG CCGGGCGAGA TGGGACTGGC GCACCTGCCG TTCTCCGAGG AGTACCTCTT CCTCGTGTCG GCCAGGGTCC CCAGGCCCGG GAGGCTGAGG GTAGAGCTCC CGGAGCTCGG GGAGGAGAGG AGCATACCGG CCTACCCCCT CAAGATGGAC GTCTCGCGCC TCAAGCCGGG GGACTTCGCG CTCACGGGCG AGTTCTCGAG CCCCCAGGAG GAGCTACTCT GGCTTGCCTA CAAGCAGTTC GGGGAGGGGT GGGTGGAGGC GCTCCTCTCG GGGGACGCGA GGACAGTCTA CTCGAGGCTC GGCAGGATGG CGGGCGTCAA CACGATCAGC GTTACTAAGA GGAAGATCAG GTACCTGACG GGGGACGGCT CGGTATTCTG CAGGGACTGC GGCTACGACC TGGCGGCGGC CGTTCTCTCG ATGGTCCTCA AGGGCAGGGT CGTTCTCATA GAAACCCCCT TCGCGACGGA AGGCGAGGAA AAGCTCCTAG CGACCGTGGT CGCCGAGAGG GTATTCAGGG CCTACGAGAG GATGCGCAAG GAGCTCCCGG AGAAGTGGAG CCAGCTCCCG CCCGTGCTCA TAATGGTGGA GGAGGCGCAC AGGTACCTCG GGTCCCAGGC GCTGGGCGGG AAGGGGGAGG TCAGGGAGAA CGTGTTCTCC ATAATAGCGA AGAGGGGCAG GAAGTACAAG GTCGGAGGGC TCTACATAAC GCAGATGCCC AGCGAGCTCA TGGACGCGGT GGTGAGGCAG GCGCTCACCA AGGTCATACT CTCCCTTCCA ACGAGGCCCG ACTACCTCGC CGTCATGAAC CACAGCCCGT ACCTCGACGA GGCCGAAGCC GAGATAAAGA CCCTCGACAG GGGGGAGGCG ATAGTGGTCA GCCCCCCGAG CGGCTTCAGG CTCGCGGTGT CCGCGAAGAT ATACCAGTAC GAGGAGTACG CCCTCAGACT CATACAGGAA GAAAAGCGTC TGCTAGCTAC CCGAGAGGTC TCGAGGACGC CCGAGTACGC CGATGCCTAG
|
Protein sequence | MRLGVVVWAE GVTVKFRIAE DSVVERGMLV KVVDRGRRYV LKVVDFKPES LLTPAEVAII SSKADRGEKP QLRDRDLRLY DTAIATIVAQ IDEDGEVHGP SSVPALFSPV EELGEEDLKM LRLHTGDIPI GRVRFGHSAV DVEVALDGAK TIPHHILVVG GTGAGKSNFG RVLAASILHL RGRYSLVVFD CESEYLLGSK PGEMGLAHLP FSEEYLFLVS ARVPRPGRLR VELPELGEER SIPAYPLKMD VSRLKPGDFA LTGEFSSPQE ELLWLAYKQF GEGWVEALLS GDARTVYSRL GRMAGVNTIS VTKRKIRYLT GDGSVFCRDC GYDLAAAVLS MVLKGRVVLI ETPFATEGEE KLLATVVAER VFRAYERMRK ELPEKWSQLP PVLIMVEEAH RYLGSQALGG KGEVRENVFS IIAKRGRKYK VGGLYITQMP SELMDAVVRQ ALTKVILSLP TRPDYLAVMN HSPYLDEAEA EIKTLDRGEA IVVSPPSGFR LAVSAKIYQY EEYALRLIQE EKRLLATREV SRTPEYADA
|
| |