Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0745 |
Symbol | |
ID | 4600396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 696409 |
End bp | 698241 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639773521 |
Product | hypothetical protein |
Protein accession | YP_920150 |
Protein GI | 119719655 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCGTG GCGCGGGTAG TTCGAGGAAG GTCAGCGTCC TAGACCTGGC TAAGATGTGG AGGCTCCTGG CGGGCTTCGA GGGCGAGAGG GTTTCGCGGG TCGCGCAGAG CGGCGACGTC TTCCTTTTAA GGTTTAGGCG CGGCGCCCTG GTCTTCTCCG CGGCTAGGGG GGTCGCCCCG TGCGTGGAGG AGTGCACCCT CCCGGGTAGC TGGGAGAAGC CGAGGTGGGC GGGGGAGGTC GAGGGTAGGC GCGTGGAGGC AGTTGAGCAG GTGTCGGGCG ACAGGGTCGT AGCCTTCGAG CTCGGCCCGC GGAGGCTGGT CTTGGAGTGG GTGCGCGAGG GCAACCTCTT GTTGCTGGAC GGCGGGGGAA GGATCCTCCG TGCTCTGAGG CAGAGGGAGA TGAGGGACAG GGCGCTGAAG CCGGGGCACC AGTACGTCCC CCCGCCGAGG GTAGGCGACG CCTTCTCGGA CGACCCGGGC TACCTCTACG CCAAGCTGGG GGAGTATTCG GGGCGCGCGG CCGTTACTGC AGTCTCTCTC GTCGCATCGC TCCCGGCGGA GCTCGTGTAC GAGGCTATGT ACCGCCTAGG GCTCGACCCG TCCGCGAAGG CCCGGGCCCT CGGCGAGGAC GCGTTCAGGC GGGTCCTCTC GAAGAGCGTG GAGATATTCG AGGAGGCCCT CGCGGACCCG GACAGGGGCT TCAGGGTCTC CGGGGAGGTC TACGCGTTCA ATCCGGAGCA CCTGGGCGCG GGCGCCGAGG AGGTGTCCTT CGCCAGGGAG TTCCCGGCGT ACGTGCTCGG CCTAATCAAG CGGGACCTCC AGCCCGAGGA GGGAGCCGGG GCGCAGGAGG CCATCAGGAG GGCTGTCGAG GAGCTTTCCA GGAAGGCGGA GCTCCTGTCG AGGCACTCGG CAACCGTGGA CGAGGTTCTG GCGGCGTACA GGGGGCTCGT CGCGTCGAGG CTCCAGTGGA GCCTGGTGGA GGCGCGCCTC AAGGAGGCCT ACCCGATCGT CAAGTCCGTG GACCCGGCGC GGTCCAGGCT CGTCTTGGAG CTGGAGGGCG TGGAGGTAGA GGTCGACGCG TCCAGGAGCG CCCTCTCGAA CGCCGCGAGC TACTTCGAGA AGGCGAAGTC GGCGAAGAGG AAGCTCGCAG AGGCCTCCGC GGCGGTTGAG CGGAGCGCCG AGCCGGCGCC GGCGAGGCCC GCCAAGCCGG CGGCGTGGTA CGCGCAGTTC CGCTTCTTCT TCACCTCCAA CGGCTTCCTC GTAGTCGCGG GGAGGTCTGC TGGGCAGAAC GAGTTGCTCG TGCGCAGGTA CATGGAGCCC GGGGACATAT TCCTCCACGC CGACATCCAC GGGGCGGCGG CCGTCGTGCT CAAGACCGGC GGAAAGCAGC CCGGGGAGGC AGACATAGCC GAGGCGGCCC AGTTCGCCGC GTGCTTCTCC AGTGCGTGGA AGGGCGGGCT CTACGCGGTC GACGTCTTCT GGGTCCCCGC GGAGCAGGTG TCGAAGAAGC CGCCGAGCGG CGAGTACCTG GCGAAGGGTA GCTTCATGGT CTACGGGAAG AAGAACTACG TGAGGGGGGT GAAGCTGGAG CTCCTCGTGG GGCTGTGCCC GGGCGGCGAG CTCTGCGTCC TCCCGGCGCT GGCGAACCCC CGCGGGGGGT GCTTCCTGAA GGTAACCCCG GGCGTGTACA GGAAGGACCT GGCGGCCAGG AAGATAGCGG AGTTCGTCAG GAGGGAGTGC GGCGCGAAGG TCGGCGAAGA CGAGGTCGCC AGGCTCCTCC CGGACGGGGG GTTCCACCTG GAGAGGTGGC GTCCGTGGCC GGGGAGTACC TAG
|
Protein sequence | MSRGAGSSRK VSVLDLAKMW RLLAGFEGER VSRVAQSGDV FLLRFRRGAL VFSAARGVAP CVEECTLPGS WEKPRWAGEV EGRRVEAVEQ VSGDRVVAFE LGPRRLVLEW VREGNLLLLD GGGRILRALR QREMRDRALK PGHQYVPPPR VGDAFSDDPG YLYAKLGEYS GRAAVTAVSL VASLPAELVY EAMYRLGLDP SAKARALGED AFRRVLSKSV EIFEEALADP DRGFRVSGEV YAFNPEHLGA GAEEVSFARE FPAYVLGLIK RDLQPEEGAG AQEAIRRAVE ELSRKAELLS RHSATVDEVL AAYRGLVASR LQWSLVEARL KEAYPIVKSV DPARSRLVLE LEGVEVEVDA SRSALSNAAS YFEKAKSAKR KLAEASAAVE RSAEPAPARP AKPAAWYAQF RFFFTSNGFL VVAGRSAGQN ELLVRRYMEP GDIFLHADIH GAAAVVLKTG GKQPGEADIA EAAQFAACFS SAWKGGLYAV DVFWVPAEQV SKKPPSGEYL AKGSFMVYGK KNYVRGVKLE LLVGLCPGGE LCVLPALANP RGGCFLKVTP GVYRKDLAAR KIAEFVRREC GAKVGEDEVA RLLPDGGFHL ERWRPWPGST
|
| |