Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0459 |
Symbol | |
ID | 6165839 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 414578 |
End bp | 415702 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641667616 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001793852 |
Protein GI | 171184933 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000000538339 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAAGGTCG CCGTTTTTTC CGAATCCCTC TGGCCGCTGG GGGAGGGGGG CGCGGAGCTC GCCACGTACC TCTACGCGAG GTTGCTCGCT GAGCTGGGGG TCAGAGTTAG GGTCTACGTG AGGCGGGGGG GCGCGAGGTG GGAGGGGCTT GATGTGGCGG AGCTGGGGGG CGCGGGGACG AGGAAGTTCC ACGCCCCGCC GCTTGGGGCT AGGAGGGCGC TGGAGTGGTG CGACGTGGCC TACTTCGCCT CGGCCTACTG GGAGCTGGTG CCTCTGGCGA AGAGGATGGG TAGGAGGGCG GTTGTCCACC TACACAGCTA CGACCCAGCC TGTCCCGTGG GGACGCTTTA CAACGCGAGG GGGGGCTCCG TCTGCCGCCC TGAGACGAGG AGTTGCTGGG GGTGTATACA CCTCCAGGAG AGGTGGCTGG GGCGGCCCCT CTGGAGGGCG GTTGCCTCGC AGGTGTTAAA CGGCTTGTTC AACCCGCTGT TTGCCGAGGC CGTGAGGCGG GCGGATGCGC TGGTGTTCGT ATCTGAGGCG CAGCGTAGGC TCTTCGCCGA ACACTTCGGC GACGTGCCCA GGAGCCACGT GGTGTACAAC CCGGCGCCTC CTCTGCAGTA TCTCCCGCCG GGGGGTAGGG CGGTGGGCTA CTTCGGCGGG CTGAGCCCAT GGAAGGGGGT CTACGTGCTG TTGCGCGCGT GGGCGCGCCT CGGCGGGGGG GCGAGGCTGT ATATGACGCG GGCCTCCGCT CTGAGAAGCC GCCCCCCTGG CGTGGTGGCG CTTGGGGACC TAACGCCGTG GGAGCTGGAG GAGGTGCACA GGTCAGTCTC CGTGGTGGCT GTCCCCTCCC TCTGGTGGGA GCCCTTTGGC TACGCCGCTC TGGAGGGCTT GGTGAGGGGG CGGGTGGTCG TGGCTTCAGA CGTCGGAGGT CTTCCCGAGG TGGTGGGGGG CGCGCCCGGG GCTAGGCTTG TCCCTCCGGG GGACGCCGAC GCGCTGGCGG AGGCTCTGGA GTGGGCTCTG GCGGCGGACG CGGCGGAGCT TGGGGCTAGA AATAGGGAGT ACGCCCTTAG GAGGTTCGAC GGGGTTAGGC TTGCTGAAAG GCTTCTAAAG GTTTTGGAGG GCTAG
|
Protein sequence | MKVAVFSESL WPLGEGGAEL ATYLYARLLA ELGVRVRVYV RRGGARWEGL DVAELGGAGT RKFHAPPLGA RRALEWCDVA YFASAYWELV PLAKRMGRRA VVHLHSYDPA CPVGTLYNAR GGSVCRPETR SCWGCIHLQE RWLGRPLWRA VASQVLNGLF NPLFAEAVRR ADALVFVSEA QRRLFAEHFG DVPRSHVVYN PAPPLQYLPP GGRAVGYFGG LSPWKGVYVL LRAWARLGGG ARLYMTRASA LRSRPPGVVA LGDLTPWELE EVHRSVSVVA VPSLWWEPFG YAALEGLVRG RVVVASDVGG LPEVVGGAPG ARLVPPGDAD ALAEALEWAL AADAAELGAR NREYALRRFD GVRLAERLLK VLEG
|
| |