Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1250 |
Symbol | |
ID | 4600545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1186233 |
End bp | 1188059 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639774026 |
Product | nucleotidyl transferase |
Protein accession | YP_920651 |
Protein GI | 119720156 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1213] Predicted sugar nucleotidyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGTAG TTATTGTAGC GGCAGGCGAG GCTTCGAGGC TTAGGCCTTA CTCCGAGGAA ATGCCGAAAA CACTCATGGA ACTGAGGCCC GGCCTTCCTA TAATCGACTT CATACTGAAA AGGGTTATGG CTCTCTCTCC TAGGAAGGTG GTGGTTGTTA CGAGAAGGAC GTGGAAGGAT ATTCTCGCCT CGCATCTCGC CGGCTTAGCC GAAGTCGTTA CAGTAGACCT CGAGGGCGGG TTTGGGAACC TCTACAGCGT GTACACGGCG CTCACAAGAG TCGGGGACGA GTTCCTGATA TTGATGTCTG ACCACATATT CGAGGAGGAA GTACTGATGA GGCTTGCAGA ACACGCGAGC AAAGCGTCGT TTACCGTTTG TCTCGACAGG AATCCCTCCC GCTCCGAGGC TGTGGAAGGT TTGAAAGTTA TAATCGAAGG AGGACTGGTT AGGGACGCTG GCAAAGATGC GGCGCCCCGG TACGGGATCG ATACAGGGGC GATAATGGCG CGCGGGAGAG CCAAAGAGTT TATCGAGGAA GTGATCAGAC AGAAAGGTCC AGGGGCATCT ATTGCCGACG TGCTGAGATT CGCCGCCTCT AAGGGAGAGG AGGTCGACTA CGTAGACGTT ACGGGGCTTT TATGGAAAGA TATCGACACC CCCGAGGACC TCGTTAAGGC TAGGAAACTC GTCGAGAAGG TGGAGCTCAG GGAGCTAGCG AAGAGGACTG GGGATCCCCT CTCGCGAGCT CTTTTGAAAC CCCTAACGAC GAGGCTTTCG ATGCTCGCCT TAAAGAACTC TCTGCTGAAA CCCGCCATGC TTACTAGCTG GGCTTCGGCG TTTTTAGCAT CGTGGGCTCT GCTTCTCCCA CAGGTGGAGA GCAACGCGCT TCTAATGTTT CCCCTATTAG CCGTGCACGT AGTAGCGAGC GAACTTCTAC GCGAGCTAGC AGTGCTAGCA TCGTGGGACG TCACGGCGAT ATCGCAGGTA GATTTCCTCG CTACACTACC CTTAGCGGCT TTTGTCTCAA GGCTTGCCGA TACATGGCCT TTGTCTGTGC TTGCGGGGGC TGGGATACTT TTCACTTCGC TTGCCGGGCT CCCCAGAATG GCAAGCGCGG AGGGTTACTC AGGGTTACTG GTCTCCAAGC TTCTCGCTAC TATCGTGCTT TCGCTCTCTC TTTTCGTGGG TGGGGCGATC TACGGGGTAG CCTTTTGCGC ACTAATGCCC TGGTTGCACA TAGCTTTAAG GCTTAAGCCG CGTAGATCTG GGCTTGCCAG AGGGAAAGAG CCTTCCCATG TACCCATCCC CGCGGTTCGT GTTGACGAGA TCGCCGAGAA AATCGAGTCC CTTGTAACGA ATGTGGTGAA GCTAGCGGTG GTTCTAGCTG TGCTTACGAT GCTCTCCCCC CTCTCATCGG ACGTCCTCTT CCAAGTCGAT GAATATTCGC TACGCGTGGG GACTCTTCTA ACGGTCGCGC AGTTAACCGC CGTTGTGTAC TATGGCTATA AGATCCTGGA ACCTCTCTTT AGCCTGCTAG ACGCCCTGGC GACAAGACTC GTCGAGAAAC TCGGGGTAAC AAGGTACACT GCTAGGCGTA TACTGGTAGA GTCCGTGTAC TTGGTGCTAA TAGTGCTTAC ATGGTTCCTC CTCCCCAGCC TCAAGAACAT ACCCGGTATT GGCGGTGTGC TCTACAGGGT TCTAACGATA GTTATCGTCG CGCTTTTTGC CCTTATCTCC TACGACTTGG TGAGGCTTCT CTTCAGGGTA TTCGAGGACA CGCTTAAGCG CATAGTACAC GCTATAGCCC GTTTTGTCTC CGGGTGA
|
Protein sequence | MDVVIVAAGE ASRLRPYSEE MPKTLMELRP GLPIIDFILK RVMALSPRKV VVVTRRTWKD ILASHLAGLA EVVTVDLEGG FGNLYSVYTA LTRVGDEFLI LMSDHIFEEE VLMRLAEHAS KASFTVCLDR NPSRSEAVEG LKVIIEGGLV RDAGKDAAPR YGIDTGAIMA RGRAKEFIEE VIRQKGPGAS IADVLRFAAS KGEEVDYVDV TGLLWKDIDT PEDLVKARKL VEKVELRELA KRTGDPLSRA LLKPLTTRLS MLALKNSLLK PAMLTSWASA FLASWALLLP QVESNALLMF PLLAVHVVAS ELLRELAVLA SWDVTAISQV DFLATLPLAA FVSRLADTWP LSVLAGAGIL FTSLAGLPRM ASAEGYSGLL VSKLLATIVL SLSLFVGGAI YGVAFCALMP WLHIALRLKP RRSGLARGKE PSHVPIPAVR VDEIAEKIES LVTNVVKLAV VLAVLTMLSP LSSDVLFQVD EYSLRVGTLL TVAQLTAVVY YGYKILEPLF SLLDALATRL VEKLGVTRYT ARRILVESVY LVLIVLTWFL LPSLKNIPGI GGVLYRVLTI VIVALFALIS YDLVRLLFRV FEDTLKRIVH AIARFVSG
|
| |