Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0915 |
Symbol | |
ID | 4602128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 861919 |
End bp | 862983 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639773694 |
Product | hypothetical protein |
Protein accession | YP_920319 |
Protein GI | 119719824 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03605] SagB-type dehydrogenase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGGTCA AGGAGAAACC CTCGTTTAGA CCTCGCCCTC TACTCAGGGA GGCTTTGTGG TGCCTAGCGA ACGGGCAACC TTTAAGGCCC GGGGAGGGCG ACGGGGATGC TTTGAGGAGG ATGCTTTTCC ACGTTCAGGG GTGCCGCGAG GGCAGGTTTA GGACTGTCCC CTCGGCTGGC GCCACTTACC CCTTGGAGGT CTACGTGTCG TCTAGGGGGG AGGTCTTCCG GCTGGAGAGA GGGCTCTACA GGTATGTTCC CTGCGGGGGC GGTTTGTCGA GGGTTGGGGG TGACGCCGGC TTCGAGGGCT TCGTCTTGAC GGCTTTCCCG GGCCGCACGA CGACGTACTA CGGCGAGAGG GGGTACAGGT ACGTCCGCAT GGAGCTCGGG CACGCCCTTC AGAACCTCTT CCTCTCTATC TACAGCCACG GCTTGGCTGG CTCGGTGGAG CTCGTGGACT ACGAGTTCAG GGCTGGGTCC GCCGAGTACG TGCTCGCTAG GGTAGCTGTT TCCAGGCCTG CGGGCTACTG CGCCGGCTTC GCCTTGGAGA AGGGGCTCCC CCTTGACGAC GTCGTAGCTG CGAGGCGTAG CGTGAGGAAC TACGCGAGGG CGTCGCTGGG CCTCGAGGCG CTGGACTCCA TAATGAAGTG GTCCATGGGG GAGGTCGTCG AGGGAGCGCG TCCTTACCCC AAGCTGGGGG GCGGCTACGC GGTCGAGGGC TACGTGGTTG CTACGAGCGT GAAGGGGGTT GAGAAGGGGT TGTACCGCTT CAACTCCTCC GAGCTCGAGC TAGAGCTAGT CCGGGTCGGC GAGTTCGCGG AGAAGCTGTG GAGGGCGGCG CTCATGCAGG GCTCCGTGAG GAAGGCGCCG GCCGTCGTGG TGCTGACGGG TAGGGGGCCG CTCGCCGAGG TAGAGGTCGG CGCGGTTGGG CAGAACATTT ACCTAAACGC GGTCCACGAG GGGCTTGGCA CCGTGGCTAT AGGGGCGTTC GAGGACGAGG AGGTTAGCAG TATACTCGAA GTCGAGGACC CGCTCTACCT GATGCCCGTT GGCAAGCCGG CTTAA
|
Protein sequence | MWVKEKPSFR PRPLLREALW CLANGQPLRP GEGDGDALRR MLFHVQGCRE GRFRTVPSAG ATYPLEVYVS SRGEVFRLER GLYRYVPCGG GLSRVGGDAG FEGFVLTAFP GRTTTYYGER GYRYVRMELG HALQNLFLSI YSHGLAGSVE LVDYEFRAGS AEYVLARVAV SRPAGYCAGF ALEKGLPLDD VVAARRSVRN YARASLGLEA LDSIMKWSMG EVVEGARPYP KLGGGYAVEG YVVATSVKGV EKGLYRFNSS ELELELVRVG EFAEKLWRAA LMQGSVRKAP AVVVLTGRGP LAEVEVGAVG QNIYLNAVHE GLGTVAIGAF EDEEVSSILE VEDPLYLMPV GKPA
|
| |