Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0983 |
Symbol | |
ID | 4600457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 931492 |
End bp | 934551 |
Gene Length | 3060 bp |
Protein Length | 1019 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639773761 |
Product | hypothetical protein |
Protein accession | YP_920386 |
Protein GI | 119719891 |
COG category | [R] General function prediction only |
COG ID | [COG1483] Predicted ATPase (AAA+ superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.397538 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCTCG TGAATTTGCT TGCTGAGGGT AAGGTTAGCC CCGCGGTAGA CATCTACGAG GTGTATAAGA GCCTCTTCAA GGGGGAAAAG CCTGAGAAGA TCTACGAGCC TTACTGTGAC CCCCGCTTGT TCTTCCAGCT GACGTTTGTG ACGGACGGGT TTAAGCAGTA CCTCGGCGAT TTTCTCTCGA AGCTCGCGTC CGGAGAGTCT GAAGTGTACG TTATGCCGGC TCTTCTGGGG GCGGGTAAAT CGCATTTCTT GGCTTTCGTA CTGCATATAC TCAGACTTTA CAGGGATTGC AGGGGCGCTG GGGAGTGTGT CGAGAAAGCT TTGGAGGAGC TGGGAGTGAA GCTTAAAGTT CCCTCTCTCG AGAAGGTTCC CGAGGTTTTA GTGTTCCACG GGGAGCACAA CGTTGACCTT AAGCCGTTGG ACTTTTCAAG CAAGGATACT CTTAAAGCCT CCTTGAAGCC CCCCGTGGTC TTGATATTCG ACGAGACACA GCACTTCGAG GCAAAGATTC GTGACTTCCC GTTGCTCATG CAGATGCTCG CAGAGGCTGT CGAGGAGCGG AGGGGGGTCT TTCTCTTCGT GTCCTTCTCC CTATTCTCCG GTGAAAGACC GGATCTTGCG GCTCCCAAAT CCTTGGATGC TGTGCGCCGT GTCCACTACG TGACCGTCTC GCTGGATGTA ACGCGGAACA TAGTCGAAGT ATTCAGGAGG TGGGCAGGGC TTAGTGGCGC GAGAAGCGTG GAGCTAGCCG GGCTCAAGGG CATTGTAACC GATGAAAGGC TCAGGGAGTT TGAGAACAGG CTTCGAGGCT CCTACCCATT CAACCCGTAC CTGCTGGATG CTGTTTTACA GCTTGCAGAC GAGTCCCTAG TTGAGAAGAC AAGGGTTCAG CTGACTAGAG GGCTTCTGAG GATACTGGCC TCTGCCTACG TTAACAGGAG AGGCGAATTA GTGATATTCG CAGATCTACC AGAGCCAAAA GAGGTAGTTA TTGCCGGCGA TGTTTTTGCC GGGCAACTGA ACGTCATCTT GAGGCTCTAC GAGGACGACG CTAGGAAGGT TTCTGGAAGC ATAGCTGCTC TTTCCGTGCT ACGCCACATT CTCCTTGCAA CCTTCTTCGC CAGGCTTCTT CCACATCGTC GAATGTATCC AACCGAGGAG GAACTTATAC TCGGTAGCTA CGACCCGGCG AGGGTGAAGC CTCTTGACGT TAAGATGTTC CTCGAGGATG CCGCGAGGCA GGGCTTGCAT ATAGAGAAAG TCAACGGTCG CTACATGTAC TGGTTCATCG GAGGCATAGA AGAGAAAGTC AGGGACGCCA TGTACAGGTT CGGCGATGAT GACGGACTTG AAGTGGCTAC GGACGAGGTC GCAAGCCTTG CCAGGGAGAG GGCAGGACCT TTCTCGAGTG TAGTAATCGC GGGCGTCGGA GGTACTAAGG CTCTCGGCAA GGTTAAAGTC GTGTCGAGTA GGGACGAGTG GGAGAAAGAG CTAAAGGATC AGGATAAGGC TATACTCGCT ATAGACCTCC TTAACTTCGG AGTACCGGTT AAGCGGAATA ATCTCATCGT TGTGAGAAGA TACGATGAGG GAGAACCTCC GCAGACTACC TTAGAGCTGT TAAAAAGAGT AGGCGAGGGA CCCAGAACTG TAAGGGAGGC TGTCGTGGAT CTCGGACGCC TAGTAAAGGG GGTAGACGAG GTCTACGCGA ACCTTATTGA CTACTTCCCG GAACTTCTAG AGGAGGAGAT GGAGGATATT CTTCGAAGGG AGTTGGAGCA ACTTATTCGA GGAAGGCTTG AAAACCTGAA AAGCCGTGCA AAAGCGTACC TTAGGGAAAG CGTGGGGCTA TGGTTGCGGC GTGGCGTTGT GGGCTTTAAA GACGTAGAGA AGCGCGGCTT CGACGAGTTG GTAGGAGAGC TTGTCAAAGA TAAGAGAGAC AGGCTTCGCG GAGTAGTCAA GGAGATATTC ACGGGCGACC TTATAAACTG GGATAGCTTC AAGAAGGTTG GAGACCTTTG GAGCCTTTTC CTAAACAATG AATCATTCCC AGCGATTCCG GCGTCCTTCG AGGAGTTCCT AGAAGCACTG AGGGAGTACT GCAAGGGTTG TAACTGTTTG TTCGAGGAGG ATGGAGAGGT TAAGTGGCTC GGCGAGAATG GATGTGTCAT GCCGGAGCTC GATAAAGACG TGGGTGTAGC GCCGTTCATG TACAAGAAGA GGGTTACAGA GTGGGCTGTC GAAGGTTTCT TGAAGCAATA CGGGTCCTCG GCGAAAAGAA GGGTTTACAT TGTGTATAGG AAGCCTAGCG GTCCGGAGGC TAGAGCGACC CCAGAGGAAC TTTTGTCGAA GCAGAATGAA TGGATTTACC TTGAGGGCGG GAGGCTTGAA ATCGAAGAAG TCCAGAAAGG CATCTCGGTA TCCGTGGATG GCGTGGAAAC GGTGAGCGTG GAGAGGCCTA GAGGCGCCAC AATACTGGTC GAGGTGGAGT CTTCCTATGA TTTGAAGAGC ATCGAGTACA CCTTGAATGG CGTGAAGAAA GTTTTCGACG TGAAGGGGAA GAGGCACGCC TTCAACGTGA AGGTTCCAGG AGAACCGGGT AGGTATGTTC TCAAAGTCAG AGCTGTTTTC GCCGACGATA CCTTCGATGA GAGAGATGTG GCCATCATAG TGAGGGGGAA GTGTAAAAGA AAGATCACTG TCTTAAGCGT GAGCGTCGGA GAGGAAATAG TCGGGCTTAA AGCTGATACG GCTCAAGACG GGGAGATTCT TTTGAGGTAC TTCAGGGATA GAGGGGTCCC GTTTAAGGCT ACTGTATCCA CTGAGTATAG CTATGGAGAC GAGGAGATGA TCGTTAACGT GAGGAAAAAG GTAAATAGTC CCGACGATGC AGACAAACTG CTCAAGATTC TTAAGGCAAT TCAAGCGTTA ACGCCTAACG CAGAGGTTAC ATTCGAGTTT ATGGAGCCGC AGAAAGTGGA CGAAGATATG GAGAAGAGGT TTAGGGGCCT TAAGGTTGTC TTTAGCGTAG AGCGGGAGGA GGAATGCTGA
|
Protein sequence | MSLVNLLAEG KVSPAVDIYE VYKSLFKGEK PEKIYEPYCD PRLFFQLTFV TDGFKQYLGD FLSKLASGES EVYVMPALLG AGKSHFLAFV LHILRLYRDC RGAGECVEKA LEELGVKLKV PSLEKVPEVL VFHGEHNVDL KPLDFSSKDT LKASLKPPVV LIFDETQHFE AKIRDFPLLM QMLAEAVEER RGVFLFVSFS LFSGERPDLA APKSLDAVRR VHYVTVSLDV TRNIVEVFRR WAGLSGARSV ELAGLKGIVT DERLREFENR LRGSYPFNPY LLDAVLQLAD ESLVEKTRVQ LTRGLLRILA SAYVNRRGEL VIFADLPEPK EVVIAGDVFA GQLNVILRLY EDDARKVSGS IAALSVLRHI LLATFFARLL PHRRMYPTEE ELILGSYDPA RVKPLDVKMF LEDAARQGLH IEKVNGRYMY WFIGGIEEKV RDAMYRFGDD DGLEVATDEV ASLARERAGP FSSVVIAGVG GTKALGKVKV VSSRDEWEKE LKDQDKAILA IDLLNFGVPV KRNNLIVVRR YDEGEPPQTT LELLKRVGEG PRTVREAVVD LGRLVKGVDE VYANLIDYFP ELLEEEMEDI LRRELEQLIR GRLENLKSRA KAYLRESVGL WLRRGVVGFK DVEKRGFDEL VGELVKDKRD RLRGVVKEIF TGDLINWDSF KKVGDLWSLF LNNESFPAIP ASFEEFLEAL REYCKGCNCL FEEDGEVKWL GENGCVMPEL DKDVGVAPFM YKKRVTEWAV EGFLKQYGSS AKRRVYIVYR KPSGPEARAT PEELLSKQNE WIYLEGGRLE IEEVQKGISV SVDGVETVSV ERPRGATILV EVESSYDLKS IEYTLNGVKK VFDVKGKRHA FNVKVPGEPG RYVLKVRAVF ADDTFDERDV AIIVRGKCKR KITVLSVSVG EEIVGLKADT AQDGEILLRY FRDRGVPFKA TVSTEYSYGD EEMIVNVRKK VNSPDDADKL LKILKAIQAL TPNAEVTFEF MEPQKVDEDM EKRFRGLKVV FSVEREEEC
|
| |