Gene Tpen_0427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0427 
Symbol 
ID4602106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp387754 
End bp389349 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content57% 
IMG OID639773192 
ProductDNA ligase I, ATP-dependent Dnl1 
Protein accessionYP_919839 
Protein GI119719344 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase 
TIGRFAM ID[TIGR00574] DNA ligase I, ATP-dependent (dnl1) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGCTCT TCCGAGAGCT CGTCGAAGCC GCCGAAGCCG CTGGGAGGGT TTCCGGCAGA 
AAGGAGAAAG TACGCATTCT CTCAGAGTTC CTTGGGAGGC TTTCCCCGGA GGAGGCTTCC
ATTGCGGCTA GGTTCCTCGC GGGGCACGTC TTCCCTGAAT ACGATAACCG AGAACTGGGC
GTCGGCTACT CACTTGTAAG AGAAGCGCTA AAGGTTCTCG ATCAAAAGCC TCTTTTTCTC
GTAGAACCTC AACCACCCGC GATCACCGAG GTTTACCGGC TACTCGAGCG GATTGCCGGC
GTGAGCGGTG AGGGTAGCAG GGAAAAGAAG CTGAAGCTTC TTGAGGGGCT ATTCTCGAGG
TTAAGCAGGA AAGAGGTTGA GTACTTGCTT CGCATACTGT TCGGGGAGGT TAGGATAGGT
GCCAGCACAG GGCTACTCCT AGAAGCCATA TCTAGGCTAT CGGGTGTGCC GTACGGAGAC
GTGCTTCACG CATACATGGT GCTGAGCGAC GTGGGGGACG TCGCCTTCAA AGCACTGGAA
TCGCCTGCCG CGCTTGGAAA GGTGGACGTA TCGCTGTTCC ACCCTGTCAA GCCGATGCTT
GCAGACATGG CTTACAATGT CGACGAGCTC TTCCAGGAGC ATGGTAGCCC TCTCCACCTC
GAGTACAAGT ATGACGGGCT ACGCGTACAG ATTCACATGG GGAGGGGGAA GGTCAGAGTC
TTTTCGAGGC GGCTCAGCGA CATAACTGAG TACGTCCCAG ACGTCGTGGA GCGCGCGGTG
GAGGGGCTAC ACGTAGAGGA GGCGGTTGTT GACGGAGAAG CTGTAGCCGT AATTGGCGGT
GCCCCAGTCC CTTTCCAGGA GTTGTTGCGA AGGGTTAGGC GGAAGAACGA GAGGGAAGAC
TTCCTCAAAA GCCTCCCCTT CGAGCTCCAC CTCTTCGACG TGATATACCT TAACGGAAAG
TCGCTAGTCA GAGAGCCTTA CTCTACTAGG AGCCGGCTAC TCCGCGAGAT CGTCGTGTCA
GAGGAAATCC TGGCGAAGAA GACCGTGGCG TATACACGCC AGGAAGCCTT GGAGTTCTAC
GAGTCCGCTG TAAGGAGCGG TAACGAGGGG GTGATGTCTA AGCGGCACTC CTCGGTGTAT
AAACCCGGGA TACGTGGGAG CGACTGGCTG AAGTTGAAGT CCTTCGACAC CATCGACTGC
GTCATTATAG CCGCGGAGTG GGGGCACGGC AGGAGGAGCG GGTGGCTAAG CGACTACCAC
CTTGGAGTGT ACGACGAGGA GAGCGGGAGG TTCCTCTCTG TAGGCAAAAC GTTCAAAGGG
CTGAGTGATG CGGAGTTCGA AGAGATGACG AAGCGGCTCT TACAGCTGAA GGTTAGGGAG
GAGGGTTACG TCGTCTACGT GAAGCCCGAG ATAGTGGTGG AGGTGGATTA CAGCGAGATT
CAGAGGAGTA AGCGGTACCC CTCGGGGTTC GCATTGAGAT TCGCGCGCAT TAGGCGTATA
CGCTTCGATA AGAGCCCCTA CGAGGTTACA ACCTTAAGGG AGCTCAGGGA AAGGTACTTG
CGCTCCATAC GCGCGAAGCG GTGGTCTCCG CAGTGA
 
Protein sequence
MALFRELVEA AEAAGRVSGR KEKVRILSEF LGRLSPEEAS IAARFLAGHV FPEYDNRELG 
VGYSLVREAL KVLDQKPLFL VEPQPPAITE VYRLLERIAG VSGEGSREKK LKLLEGLFSR
LSRKEVEYLL RILFGEVRIG ASTGLLLEAI SRLSGVPYGD VLHAYMVLSD VGDVAFKALE
SPAALGKVDV SLFHPVKPML ADMAYNVDEL FQEHGSPLHL EYKYDGLRVQ IHMGRGKVRV
FSRRLSDITE YVPDVVERAV EGLHVEEAVV DGEAVAVIGG APVPFQELLR RVRRKNERED
FLKSLPFELH LFDVIYLNGK SLVREPYSTR SRLLREIVVS EEILAKKTVA YTRQEALEFY
ESAVRSGNEG VMSKRHSSVY KPGIRGSDWL KLKSFDTIDC VIIAAEWGHG RRSGWLSDYH
LGVYDEESGR FLSVGKTFKG LSDAEFEEMT KRLLQLKVRE EGYVVYVKPE IVVEVDYSEI
QRSKRYPSGF ALRFARIRRI RFDKSPYEVT TLRELRERYL RSIRAKRWSP Q