Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0523 |
Symbol | |
ID | 6165816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 474273 |
End bp | 475343 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641667676 |
Product | glycosyl transferase family protein |
Protein accession | YP_001793912 |
Protein GI | 171184993 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.343071 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTTGTGC TTGCGGCTGC TCTGGCGGCT GTACACTTCG GCGCGCCGGC CCTCTACCTG CTATACCTAC GCGCCGCCCC GAAGAAGCCC CTCCAGACGG CGGCTATATA CCCCAAGGTG GCGGTGGTCG TGCCCACCTA CAACGAGGCG CGGAACATAG AGGCTAAGCT CGAGGACGTA TACAGCCAGA GCTACCCCAG AGATAGGATG TCTATATACG TCGTCGACTC GGCCTCCACC GACGGCACAG CCGAGGCGGC GGAGCGGTGG GCCGCCGGCA GGAGAGACGT CAGAGTCGTG GTGCTGAGGG AGCCCAAGAG GCGGGGGAAG GCCCACGCCT TAAACACGGC CCTTGCCCAC CTCGCCGACG AGGAGGTGGT GGTGATCACC GACGCCGACT CCCGCTGGCT AGACCGAGAC ACGCTGAGGA GGGCCGTGGC CTACCTCGCC GCCGCCGACG CGGTCTCCTG CCTAAAAAGG CCGGCGGGGG GAGGCCCCAC GGAGGAGGCC TACCGCACGT GGTACAACAG GCTGAGACTC GCCGAGAGCC TAGTCCACTC CACCCCGGTC TTCCACGGCG AACTCGCCGC CTTTAGACGG GAGGCCATCG CCGGGGGGTT CCCGGAAGAC GTCGGCGCAG ACGACAGCTA CGCCGCCATT AGGATAGCCG CAGCGGGGGG ACGCGCCGTC ACGCCGCCGG ACGTGTGGTG CATAGAGGCG GTGCCCCAGA GGGGCTACCC CACGTGGCGC CTAAGGCGGG CGCAACACCT GATACAAGCC TTCGCGCGGG CGCTTCCAAA CGTCGCCAAG GCCCCGCCGC CCTACAGAGT AATCCTCGCC GCCGAGGCCT ACCTACACCT GTTTAACCCA TGGCTCCTCC CAGCCGCCGC CGCCCTAGCC GCCGCCTCCG GACCCCCCGG CCTGGCCCTC CTCGCCGCAG GCGCCGCCGC GTTGCTGTAC AAGCCCTACA GAGCCTGGGC GGCGGGCCAG ATATACCTAA TGGCAGCCGC CCTGAGAAAC ATATGGAACA AGGAACTCAT ATGGCAAAAA CAAGAAAAGC CGCCGCCGTA A
|
Protein sequence | MLVLAAALAA VHFGAPALYL LYLRAAPKKP LQTAAIYPKV AVVVPTYNEA RNIEAKLEDV YSQSYPRDRM SIYVVDSAST DGTAEAAERW AAGRRDVRVV VLREPKRRGK AHALNTALAH LADEEVVVIT DADSRWLDRD TLRRAVAYLA AADAVSCLKR PAGGGPTEEA YRTWYNRLRL AESLVHSTPV FHGELAAFRR EAIAGGFPED VGADDSYAAI RIAAAGGRAV TPPDVWCIEA VPQRGYPTWR LRRAQHLIQA FARALPNVAK APPPYRVILA AEAYLHLFNP WLLPAAAALA AASGPPGLAL LAAGAAALLY KPYRAWAAGQ IYLMAAALRN IWNKELIWQK QEKPPP
|
| |