Gene Tpen_0983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0983 
Symbol 
ID4600457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp931492 
End bp934551 
Gene Length3060 bp 
Protein Length1019 aa 
Translation table11 
GC content52% 
IMG OID639773761 
Producthypothetical protein 
Protein accessionYP_920386 
Protein GI119719891 
COG category[R] General function prediction only 
COG ID[COG1483] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.397538 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTCG TGAATTTGCT TGCTGAGGGT AAGGTTAGCC CCGCGGTAGA CATCTACGAG 
GTGTATAAGA GCCTCTTCAA GGGGGAAAAG CCTGAGAAGA TCTACGAGCC TTACTGTGAC
CCCCGCTTGT TCTTCCAGCT GACGTTTGTG ACGGACGGGT TTAAGCAGTA CCTCGGCGAT
TTTCTCTCGA AGCTCGCGTC CGGAGAGTCT GAAGTGTACG TTATGCCGGC TCTTCTGGGG
GCGGGTAAAT CGCATTTCTT GGCTTTCGTA CTGCATATAC TCAGACTTTA CAGGGATTGC
AGGGGCGCTG GGGAGTGTGT CGAGAAAGCT TTGGAGGAGC TGGGAGTGAA GCTTAAAGTT
CCCTCTCTCG AGAAGGTTCC CGAGGTTTTA GTGTTCCACG GGGAGCACAA CGTTGACCTT
AAGCCGTTGG ACTTTTCAAG CAAGGATACT CTTAAAGCCT CCTTGAAGCC CCCCGTGGTC
TTGATATTCG ACGAGACACA GCACTTCGAG GCAAAGATTC GTGACTTCCC GTTGCTCATG
CAGATGCTCG CAGAGGCTGT CGAGGAGCGG AGGGGGGTCT TTCTCTTCGT GTCCTTCTCC
CTATTCTCCG GTGAAAGACC GGATCTTGCG GCTCCCAAAT CCTTGGATGC TGTGCGCCGT
GTCCACTACG TGACCGTCTC GCTGGATGTA ACGCGGAACA TAGTCGAAGT ATTCAGGAGG
TGGGCAGGGC TTAGTGGCGC GAGAAGCGTG GAGCTAGCCG GGCTCAAGGG CATTGTAACC
GATGAAAGGC TCAGGGAGTT TGAGAACAGG CTTCGAGGCT CCTACCCATT CAACCCGTAC
CTGCTGGATG CTGTTTTACA GCTTGCAGAC GAGTCCCTAG TTGAGAAGAC AAGGGTTCAG
CTGACTAGAG GGCTTCTGAG GATACTGGCC TCTGCCTACG TTAACAGGAG AGGCGAATTA
GTGATATTCG CAGATCTACC AGAGCCAAAA GAGGTAGTTA TTGCCGGCGA TGTTTTTGCC
GGGCAACTGA ACGTCATCTT GAGGCTCTAC GAGGACGACG CTAGGAAGGT TTCTGGAAGC
ATAGCTGCTC TTTCCGTGCT ACGCCACATT CTCCTTGCAA CCTTCTTCGC CAGGCTTCTT
CCACATCGTC GAATGTATCC AACCGAGGAG GAACTTATAC TCGGTAGCTA CGACCCGGCG
AGGGTGAAGC CTCTTGACGT TAAGATGTTC CTCGAGGATG CCGCGAGGCA GGGCTTGCAT
ATAGAGAAAG TCAACGGTCG CTACATGTAC TGGTTCATCG GAGGCATAGA AGAGAAAGTC
AGGGACGCCA TGTACAGGTT CGGCGATGAT GACGGACTTG AAGTGGCTAC GGACGAGGTC
GCAAGCCTTG CCAGGGAGAG GGCAGGACCT TTCTCGAGTG TAGTAATCGC GGGCGTCGGA
GGTACTAAGG CTCTCGGCAA GGTTAAAGTC GTGTCGAGTA GGGACGAGTG GGAGAAAGAG
CTAAAGGATC AGGATAAGGC TATACTCGCT ATAGACCTCC TTAACTTCGG AGTACCGGTT
AAGCGGAATA ATCTCATCGT TGTGAGAAGA TACGATGAGG GAGAACCTCC GCAGACTACC
TTAGAGCTGT TAAAAAGAGT AGGCGAGGGA CCCAGAACTG TAAGGGAGGC TGTCGTGGAT
CTCGGACGCC TAGTAAAGGG GGTAGACGAG GTCTACGCGA ACCTTATTGA CTACTTCCCG
GAACTTCTAG AGGAGGAGAT GGAGGATATT CTTCGAAGGG AGTTGGAGCA ACTTATTCGA
GGAAGGCTTG AAAACCTGAA AAGCCGTGCA AAAGCGTACC TTAGGGAAAG CGTGGGGCTA
TGGTTGCGGC GTGGCGTTGT GGGCTTTAAA GACGTAGAGA AGCGCGGCTT CGACGAGTTG
GTAGGAGAGC TTGTCAAAGA TAAGAGAGAC AGGCTTCGCG GAGTAGTCAA GGAGATATTC
ACGGGCGACC TTATAAACTG GGATAGCTTC AAGAAGGTTG GAGACCTTTG GAGCCTTTTC
CTAAACAATG AATCATTCCC AGCGATTCCG GCGTCCTTCG AGGAGTTCCT AGAAGCACTG
AGGGAGTACT GCAAGGGTTG TAACTGTTTG TTCGAGGAGG ATGGAGAGGT TAAGTGGCTC
GGCGAGAATG GATGTGTCAT GCCGGAGCTC GATAAAGACG TGGGTGTAGC GCCGTTCATG
TACAAGAAGA GGGTTACAGA GTGGGCTGTC GAAGGTTTCT TGAAGCAATA CGGGTCCTCG
GCGAAAAGAA GGGTTTACAT TGTGTATAGG AAGCCTAGCG GTCCGGAGGC TAGAGCGACC
CCAGAGGAAC TTTTGTCGAA GCAGAATGAA TGGATTTACC TTGAGGGCGG GAGGCTTGAA
ATCGAAGAAG TCCAGAAAGG CATCTCGGTA TCCGTGGATG GCGTGGAAAC GGTGAGCGTG
GAGAGGCCTA GAGGCGCCAC AATACTGGTC GAGGTGGAGT CTTCCTATGA TTTGAAGAGC
ATCGAGTACA CCTTGAATGG CGTGAAGAAA GTTTTCGACG TGAAGGGGAA GAGGCACGCC
TTCAACGTGA AGGTTCCAGG AGAACCGGGT AGGTATGTTC TCAAAGTCAG AGCTGTTTTC
GCCGACGATA CCTTCGATGA GAGAGATGTG GCCATCATAG TGAGGGGGAA GTGTAAAAGA
AAGATCACTG TCTTAAGCGT GAGCGTCGGA GAGGAAATAG TCGGGCTTAA AGCTGATACG
GCTCAAGACG GGGAGATTCT TTTGAGGTAC TTCAGGGATA GAGGGGTCCC GTTTAAGGCT
ACTGTATCCA CTGAGTATAG CTATGGAGAC GAGGAGATGA TCGTTAACGT GAGGAAAAAG
GTAAATAGTC CCGACGATGC AGACAAACTG CTCAAGATTC TTAAGGCAAT TCAAGCGTTA
ACGCCTAACG CAGAGGTTAC ATTCGAGTTT ATGGAGCCGC AGAAAGTGGA CGAAGATATG
GAGAAGAGGT TTAGGGGCCT TAAGGTTGTC TTTAGCGTAG AGCGGGAGGA GGAATGCTGA
 
Protein sequence
MSLVNLLAEG KVSPAVDIYE VYKSLFKGEK PEKIYEPYCD PRLFFQLTFV TDGFKQYLGD 
FLSKLASGES EVYVMPALLG AGKSHFLAFV LHILRLYRDC RGAGECVEKA LEELGVKLKV
PSLEKVPEVL VFHGEHNVDL KPLDFSSKDT LKASLKPPVV LIFDETQHFE AKIRDFPLLM
QMLAEAVEER RGVFLFVSFS LFSGERPDLA APKSLDAVRR VHYVTVSLDV TRNIVEVFRR
WAGLSGARSV ELAGLKGIVT DERLREFENR LRGSYPFNPY LLDAVLQLAD ESLVEKTRVQ
LTRGLLRILA SAYVNRRGEL VIFADLPEPK EVVIAGDVFA GQLNVILRLY EDDARKVSGS
IAALSVLRHI LLATFFARLL PHRRMYPTEE ELILGSYDPA RVKPLDVKMF LEDAARQGLH
IEKVNGRYMY WFIGGIEEKV RDAMYRFGDD DGLEVATDEV ASLARERAGP FSSVVIAGVG
GTKALGKVKV VSSRDEWEKE LKDQDKAILA IDLLNFGVPV KRNNLIVVRR YDEGEPPQTT
LELLKRVGEG PRTVREAVVD LGRLVKGVDE VYANLIDYFP ELLEEEMEDI LRRELEQLIR
GRLENLKSRA KAYLRESVGL WLRRGVVGFK DVEKRGFDEL VGELVKDKRD RLRGVVKEIF
TGDLINWDSF KKVGDLWSLF LNNESFPAIP ASFEEFLEAL REYCKGCNCL FEEDGEVKWL
GENGCVMPEL DKDVGVAPFM YKKRVTEWAV EGFLKQYGSS AKRRVYIVYR KPSGPEARAT
PEELLSKQNE WIYLEGGRLE IEEVQKGISV SVDGVETVSV ERPRGATILV EVESSYDLKS
IEYTLNGVKK VFDVKGKRHA FNVKVPGEPG RYVLKVRAVF ADDTFDERDV AIIVRGKCKR
KITVLSVSVG EEIVGLKADT AQDGEILLRY FRDRGVPFKA TVSTEYSYGD EEMIVNVRKK
VNSPDDADKL LKILKAIQAL TPNAEVTFEF MEPQKVDEDM EKRFRGLKVV FSVEREEEC