Gene Tpen_0667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0667 
Symbol 
ID4601625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp617952 
End bp619091 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content50% 
IMG OID639773440 
ProductDNA topoisomerase VI subunit A 
Protein accessionYP_920072 
Protein GI119719577 
COG category[L] Replication, recombination and repair 
COG ID[COG1697] DNA topoisomerase VI, subunit A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTCTC AGAGTCCTAA AGGAAAGGTG GCGGCGCCCT CCGACTCCGA GGTTCTTGCG 
AAGCTTGAGT CGCTGGCGAA GAAACTAGTG GAGCAGATGG AATCAGGAGT GCCACCTTAC
CTTGTCATTC CAGTGAGAAC CATGGCGAAC ACGATCTGGG ATCGGAAACG TAAACTGCTA
GTCCTAGGGC CCAAGACTGC CAGGAGAGAG TTTTTCGACA TAGGAGAATC CAAGCGGTTT
ATGCAGACAG TACTCATGCT TTCACTCATC GTTCAGGCTC GGCGGGAGGG CGATTACCCG
ACCATAAGGG AACTCTACTA TAGGGGTAAG CGCACAATCA AGTATACGGA CGAGCGCGGC
AATAAGGCCA GCGAGGAGAC CTGGGATGAC CAGAGAGAAT CCAACGCCGT GATACAAGAC
ATAGAGGTCG CCACAGGGCT ACTAAGAGAG CACATGGGTG TCACGCACGA CGCTAAGGGG
AGGATAGTTG GAAACATGAT TATACGCTCG AAGGGCGACG AGATAGATTT ATCAAAGATG
GGTAGCGGTG CCTGGGGGAT TCCGAGCTTC GTCGATAAAA TTGAAATACT TAGGGTTGAA
GCCGACTACG TGCTAGTTGT GGAGAAAGGA GCTGTTTTCG AGAAGCTAAA CGAGGAGGAG
TTCTGGAAGA AGAACAACTG TATCCTGGTA ACGGGCAAGG GGCAACCAGA CAGGAGTACG
AGAAGGATGG TTAGGAGGCT ATGGGAGGAG TTCGGCCTTC CGGTATACAT CTTGACCGAC
GGAGATAGCT ATGGCTTTTA CATTTACAGC GTCTACAGAA GCGGCTCAAT CTCGCTGAGC
TACGAAAGCG AGAGGCTAGC TACTCCCGAG GCGAGGTTCC TGGGAGTCTC GCCTTCTGAC
ATAGAACGCT ACGAACTACA GGGATTCACT ATAAAAGCCA CGGAGAGGGA TCTTAAACGC
GCGAAAGAGC TGATGAATTA CCCGTGGTTT AAGGAGTCTA AGCGCTGGAT GGAAGAACTG
GAACTCTTCT TAGAGAAGAA GGAAAAGGTC GAAATAGAAG CTTTCGCTAA GCACGGCCTT
AAATACCTCA GCTCAGAGTA TATACCTAAA AAGATAAAGA ATAAGCAGTG GATAATCTAA
 
Protein sequence
MSSQSPKGKV AAPSDSEVLA KLESLAKKLV EQMESGVPPY LVIPVRTMAN TIWDRKRKLL 
VLGPKTARRE FFDIGESKRF MQTVLMLSLI VQARREGDYP TIRELYYRGK RTIKYTDERG
NKASEETWDD QRESNAVIQD IEVATGLLRE HMGVTHDAKG RIVGNMIIRS KGDEIDLSKM
GSGAWGIPSF VDKIEILRVE ADYVLVVEKG AVFEKLNEEE FWKKNNCILV TGKGQPDRST
RRMVRRLWEE FGLPVYILTD GDSYGFYIYS VYRSGSISLS YESERLATPE ARFLGVSPSD
IERYELQGFT IKATERDLKR AKELMNYPWF KESKRWMEEL ELFLEKKEKV EIEAFAKHGL
KYLSSEYIPK KIKNKQWII