Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0462 |
Symbol | |
ID | 6166075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 417462 |
End bp | 418628 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641667619 |
Product | glycosyl transferase family protein |
Protein accession | YP_001793855 |
Protein GI | 171184936 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.215189 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000000773119 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGTTTTC GGCACGACGT GTGTATAACC GTTGTGCTTC CAGTTTTAAA CGAGGCCGAG GCCCTCCCCC GGGTAGTTGA GGAGCTTAGG GCGGCGGGGT TCAGCAACAT CTTGGTGGTA GACGGCGGCT CCACAGACGG TAGCGTCGAG GTGGCGAAGA GGCTTGGCGT TAGAGTGGTC CCCCAGATGG GCAGGGGGAA GGGGATGGCC GTGAGGACGG CCCTCATGTA CGTCGACACC CCCTACGTGG CTTTTCTAGA CGCCGATTAT ACATATCCGG CTGAGGACCT CAAAAAGCTG TTGCCCCTCC TCCGCCACTA CGACGTCGTC CTCGGCGCTA GAAGGGGCGA GATGCCCCTT GTGTACAAGC TGGGCAACGG GGCCTTGGGC TGGCTCTTCA GGCTTCTCTT CGGCGTAGAC ATAAGGGACC CCCTCACTGG GATGTACGCC GCCAAGACGG AGGTGCTTAG AGACGCCGCT CTCGAGGCCA GGGGCTTCGA CCTAGAGGTT GACCTCCTCG CGAAGGCTCT GGCGGCCGGG GCTAGGGTGG CCGAGGTGGA GATCGGGTAC AGGAGGAGGG TCGGAAAGAA GAAGCTGAGG CCTTGGCACG GCCTATCCAT CGCGTTGAAG TCCCTCTCGC TTGCATACCG CCTCAACCCC ACCCTTTCCC TATCCCTCCT CGGCGCGCTT CTCCTCGTGC CTGGCGTCGC GCTGGGGTCG TGGGTGGCGT ATAGGTTCTT CTACCAAGGC GTCCCCCACT ACATGCTGGG CCTCCTCTCC CTGATCCTGT TGATGCTCGG CGCCCTATCT GTGGCGTTGC TCCCGCTGGC CACGGCGGTG TTGAGGCTTC AAGCCGCCGT GAGGAGGAGG GATCTACGCC TCCCCACCGA CTGCCTCCCT CCCATGCCCG AGCCAGCGCC GCCGGCGGCG CCGGCCTCGG CTGAGGGGAG CGAGGCGGAG ACGCCGCTTC TCCACGTGGG GAGGGGGCTT GTCTTGTCCT TCATGGCGCT TCTAGCCGTA GCGGCGTACT ACCTGGGGGT CGGCGACGCC GCAACAGCTA ACAAGCTGGC CGAGTGGGCC TACTACGCCT TGGCGGGCGC CGTCGTTGCG CTACTCGTCG ACGCCGCGGT GGTCAGCCGC CGTGGACAGC GAGGAAGTCA GACATAG
|
Protein sequence | MSFRHDVCIT VVLPVLNEAE ALPRVVEELR AAGFSNILVV DGGSTDGSVE VAKRLGVRVV PQMGRGKGMA VRTALMYVDT PYVAFLDADY TYPAEDLKKL LPLLRHYDVV LGARRGEMPL VYKLGNGALG WLFRLLFGVD IRDPLTGMYA AKTEVLRDAA LEARGFDLEV DLLAKALAAG ARVAEVEIGY RRRVGKKKLR PWHGLSIALK SLSLAYRLNP TLSLSLLGAL LLVPGVALGS WVAYRFFYQG VPHYMLGLLS LILLMLGALS VALLPLATAV LRLQAAVRRR DLRLPTDCLP PMPEPAPPAA PASAEGSEAE TPLLHVGRGL VLSFMALLAV AAYYLGVGDA ATANKLAEWA YYALAGAVVA LLVDAAVVSR RGQRGSQT
|
| |