Gene Tpen_0785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0785 
Symbol 
ID4601133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp735267 
End bp737153 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content59% 
IMG OID639773561 
Producthypothetical protein 
Protein accessionYP_920190 
Protein GI119719695 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGGAGAG AGCACTGGTC GACGTTTCTG CTGTGTTTTC TCTTGCTGGT TCCAGCGCTC 
CGCGCTGCGC CCATCGCAGA GCACTACTAC GTGGAGGTCC TAGAGTACGT GGCTGGAGCT
TGGAAGAGTA GGTACGTGCC CGTCAACGTC AATTCCCCCT CGCTGACGGT TCAAACAGTT
GCAAGCGTGC TCGTCGTGCG CTCGGATCCC TCGTCCGGCC TGCAACCCGT ATCCGTAAGT
GTCGACGGCT CCAGGTACAG CGTTGTGAAC GGGACGGGCG TTCTGTGGTT CGCGGCGTCC
GTGGCGCTAG ACGGGAAAAT GCACGTGGTG GAGGTAAGGT TCCAGAAGGT CAGCCCGGGG
CCCGTAGTTT CCGGAGTTAT AGGGGTGCAG TCAAGCCTTC CGTCGTCGCC TAGCTTCAAC
GTGTCGGTAC CGCCGCTTCC CGGCTTCGTC GCCGCCGGCG TGAGGTTGGA GCTTTTACTC
CTGTCGCCCG GCGACGTGTT CAAAGTCCTC GACAAGCCAT TCTTCGTGCT AAATTCCTCT
TCGATAAGAG TCTTAGGGCA GGATATATTC GTCGCGGACG TAGTCGTGCC TTTCCTCAAC
GTGTCGCTAC GACCCGGTGC TATCAGCGTG AAGGCGTGGT ACCTCTACTT TATACCTCCG
AGCGACGACG AGGTGGTGTA CCCGCCTTAC AGCTTCAGGC TATTCTTCGC AAACCACCCC
TCGCTCCCGG GAGTCGAGGA AAACGCTCTA GCCGGGCGGC CACCCCACGT ATTGCTACGT
TTCGCGAGGG ACCTCTCGGA GAAGGCTGGG CTACGGAGCT ACAACGTTAG CGTGACCGTA
GCTAAGCCTG AGAGCCTCTG TGGGGTCAAG GACTACTCGT ACAGGGTTAT TCCCCCCGAG
GGAGGCGTGT TGGAAGGCTC TAGAGTAATC CTGGGCGCCA ACACTACCAT CACTGTGAGG
TTTTTCTCCG CCGGCATCTC CCTGGGAGAC GTCGTCGTCT ACACGCCGCC CCCCGAGCTA
CTCGTACAGC CGCCTATCTA CAGCCTCTCG TTGAAATTCA CTGATATCGC GGGCTACCCG
TTAAACAACA CATACTTCGT AGTGTACAGG GCCGGCGTCC CCGTGTACTC GGGCATCGCC
AGGGGAGGGG AAGCCGTGGT CTGCCCCCTA GCCGCGGGTA CCTACGACGT CGTAGCATAC
GTAGCGTCTA GGGTCGTGGG GAGGGGGCGC GTGACGCTCC TAGGCGACTC GGCCGCAGAA
ATACTCACGA ACACCACGAC TGTGAGCTTC CAGTTCGTGC GGCAAGGAGC CGGCGAAGTG
CTCACCTCTT ACAAGGCTGT CCTCAAAGGA GCTGTTGAGC TCGTCGCGAA CAGTTCCGCG
GAGGGCCTAG CCGTGTTCCA CGGAGTCCCC CCGGGTACCT ACTCGCTCGA GGTGTACTGG
AACAACACTA GGCTTGCGAG GTACTCAGTC GAGGTAGACT TGAAGGGCGG CAGGAGCGTT
CTCTCGATAC AGGCATACAG GCTTCAAGTG TTGGTGAGAA ACCTCCTGGA CCAACCGGTG
AAGGGTGCCG TGGTTTTCCT CGAGGGAGGA GGCTTCTCGT CTACCAGGCT CACAGACGAG
GCTGGAAGGG CGGACTTCGG GCTAGTACCG GCGGGGAACT ACACGTTGAT AGTAGAGGGG
GCCCAGCCCC AAACCGTCCG GCTGATCTCC GACACGTTCA GAGTCGTACA GGTAGACGAG
ATCGTGAAAA TAGGCGGTTT CACGGTGACC GGTAAAGTCG CGCTTTACGC ATTGGTGGCA
ACAGTCTTCC TCGCAGCAAT CGTTGCGGTT AGGCGAGCTT TGAAACGGAG GGAAAAGGGT
ATAGAGGAGG TAGACTTTGC GCGATGA
 
Protein sequence
MRREHWSTFL LCFLLLVPAL RAAPIAEHYY VEVLEYVAGA WKSRYVPVNV NSPSLTVQTV 
ASVLVVRSDP SSGLQPVSVS VDGSRYSVVN GTGVLWFAAS VALDGKMHVV EVRFQKVSPG
PVVSGVIGVQ SSLPSSPSFN VSVPPLPGFV AAGVRLELLL LSPGDVFKVL DKPFFVLNSS
SIRVLGQDIF VADVVVPFLN VSLRPGAISV KAWYLYFIPP SDDEVVYPPY SFRLFFANHP
SLPGVEENAL AGRPPHVLLR FARDLSEKAG LRSYNVSVTV AKPESLCGVK DYSYRVIPPE
GGVLEGSRVI LGANTTITVR FFSAGISLGD VVVYTPPPEL LVQPPIYSLS LKFTDIAGYP
LNNTYFVVYR AGVPVYSGIA RGGEAVVCPL AAGTYDVVAY VASRVVGRGR VTLLGDSAAE
ILTNTTTVSF QFVRQGAGEV LTSYKAVLKG AVELVANSSA EGLAVFHGVP PGTYSLEVYW
NNTRLARYSV EVDLKGGRSV LSIQAYRLQV LVRNLLDQPV KGAVVFLEGG GFSSTRLTDE
AGRADFGLVP AGNYTLIVEG AQPQTVRLIS DTFRVVQVDE IVKIGGFTVT GKVALYALVA
TVFLAAIVAV RRALKRREKG IEEVDFAR