Gene Tpen_1840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1840 
Symbol 
ID4600341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008696 
Strand
Start bp47 
End bp1714 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content62% 
IMG OID639772438 
Producthypothetical protein 
Protein accessionYP_919098 
Protein GI119709758 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTGAGG AGAGGAAGAA GCCTGTAGAG GTCAAGGAGT ACGAGGCTTC CGAGTACCTC 
AAACAGCTGG GCTACTCCGA GGAGGAGCTT AGGCGGATGG GCGTTCTGCC CGCGGAGGAG
GTGCCGGAGC CCGTCGAGGT CGAAATCGAG GAGATACCCG AGGCGGAAAT CGAGGAGGCG
CCCGAGGTGG AGGTGGAGAT AGAGGAGTAC CCGGGCGAGG AGGAGGTTGC AAAGGCTAAG
GAGGAGCACG AGAGACAGGT CGCCGAGGTC ATGGACAAGA TTAGGGACGC CGTCCGCGGC
ATGATGAAGA CGCGCGCTTG GGTCATGGTT CACGGTAAGC CCTTGGAGAC ACTGCCCCCG
CTTAAAAAGG AGGACATAGC CGCGGTGGCG AAGACTCCGG AGGGCGACGT CTCGATAATC
ACGAAGGAGG GTAAGGAGTA CGTAGTCGTA GCCGGCGAAA AGGTATACGA GGTGGAGCTA
CCCCAAGAGG ACAAGCTCGC GGTTCAGAGG CTAACGGCTG AGCTCAGCAA GAGGTACGGA
GCCGCCGCCG TCACGGAGGC TGTGGACAGG ATGCTGAAAG AGTACGCGCC GAGGCTGTAC
CCCGCGCTCA AGGAGAAGGC TCCCGCCGCG GTCGCCGCGG TCGGCAAGGC TAAGGAGGCT
GTTAGCGGTG CCACGAAGGG CTGGAAGCTC GCGCTCTACC GGGCGTTGTA CCCGGGCGTG
CTGAGCACGT ACGACTACTT CTCGCGGATG ACGGAGGCAA CGCTGAACAT GGTGGCGGCT
CTGGGCTTGA CGGGGCTCCT AGCCTACCTG ATTACGTGGG CTAGGGGCTT CGCCGCGTCG
GGGGCGGGCG GGCTTTGGGG GTTGATGGAG GAGAAGGACG TGGTCGTCGC GCTGTTCAAC
TTCCTGCTGG ACTTGGCTAG CTACACGCTG ATACCCGCCT TGCTAGCCGT CTTCGTATTC
CTCTTGGTGT ACTTGGGCGG GGCAATCACG GGGGCGGTTC GCCTGATAAG GGTGGGGGGA
GAGTCTAGGA GCGTCGTTGT ACCCCTGTTC AGCTTCGTCT CCGCCGTCCT AAACGCGATG
TTCAGGCAGA TGCTCGTAGG CTACTTGGTG GGGGCTGTGG CGGAGTACGT GGCTATAGTA
ATCGTGGCGC TCTTGTCTAG CGTGGTTCTG CTGTGGCTCG GACTGCTAGC CGTCTCCGCT
TTGACGGGCT TGTTCTTGGG CGGCATACAG GGAGCCTTGG TCTTCGCGCT CCTAGTATTC
CTCCCCGTGG CTAGCGTGCT CGCGGGGGCG CTGACGCTCG TACTGCTGGG CAGGAGTAGC
AGGGCTATGC TGAGGCTCAC CGATTTGCCC ACGCTGTTGC TAGCCACGGT GGTAGCAGTT
AGCTACAAGT TCGCCTTCAT ACAGCCCCTG CTCTGGCTGG TCTTGGCCGC CGTGGTCGTC
ATCCTAGGCA TAATAGCGGC GTCTAGACCG GTCGAGAGGC TCATGGTCTT GCTGAGGGGA
GTCGCGGTGA TAGCCGGGGC GATATTCGCG GTTCACTTGG GCTACTCGGG GATAGAGGAC
TTCGTGGTGC CCGCCTTCCG CCTGTACTGC CAGATTCTAG GGCTCCCCGA GGGGGTAGTC
AACACCGCGG TGGACTGGGC TAGGAAGATT CTCTACGTAA TACCGTAA
 
Protein sequence
MAEERKKPVE VKEYEASEYL KQLGYSEEEL RRMGVLPAEE VPEPVEVEIE EIPEAEIEEA 
PEVEVEIEEY PGEEEVAKAK EEHERQVAEV MDKIRDAVRG MMKTRAWVMV HGKPLETLPP
LKKEDIAAVA KTPEGDVSII TKEGKEYVVV AGEKVYEVEL PQEDKLAVQR LTAELSKRYG
AAAVTEAVDR MLKEYAPRLY PALKEKAPAA VAAVGKAKEA VSGATKGWKL ALYRALYPGV
LSTYDYFSRM TEATLNMVAA LGLTGLLAYL ITWARGFAAS GAGGLWGLME EKDVVVALFN
FLLDLASYTL IPALLAVFVF LLVYLGGAIT GAVRLIRVGG ESRSVVVPLF SFVSAVLNAM
FRQMLVGYLV GAVAEYVAIV IVALLSSVVL LWLGLLAVSA LTGLFLGGIQ GALVFALLVF
LPVASVLAGA LTLVLLGRSS RAMLRLTDLP TLLLATVVAV SYKFAFIQPL LWLVLAAVVV
ILGIIAASRP VERLMVLLRG VAVIAGAIFA VHLGYSGIED FVVPAFRLYC QILGLPEGVV
NTAVDWARKI LYVIP