Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0516 |
Symbol | |
ID | 6165896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 467680 |
End bp | 468945 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641667669 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001793905 |
Protein GI | 171184986 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.628757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.125688 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCGGG GCGCCCCCCT ATTAAAAGAT ATATGCCAGC CGGTGAGCTA CGCCATGGAG CGCTACGCCG TCGTCGCCCA CCACTACTGG GGCACCCCAG GAGGGGGCCA GCTCGTCTGC GCCGCCGCCG CCAAGGCGCT GGAGGAGGCG GGCTACAGGC CGGCCCTAGC CGGCACCTTC AAATTCGACC CCCGGAGATA CGTCGAGTGG TACGGCATAG ACATATCTAG GTACCCCGTC GAAACCCTCC CCATAGCCCC CAGGGCCTTC GGCCTCTGGA GCAGGCTCTA CGTCTGGCTC CCCGCCAAGA AAGCCGCCGA GAGGTACAAG CCGGAGCTCC TCTTCATCGA CGAAGTCGCC TACAAGCCCC TCGCCAGGGG GAGGAGGTTC AGGCTAGTGG AGTACATCCA CTTCCCCTTC GAGGTTGTGG TGGACCCAAG GTACAGGGGG ACCGGCCTCG CCTACGGGGA GGACCCCTAC ATCATGGAGC GGTACGGCAG ATTCCCCATG AGCCTCTACT GGAGGGCCTT CGTCTGGGGG CTCAAGAGAT ACGCCAGAGA AAACCCCTTC CACTACGCCG ACGCCGTGTT GGTCAACTCC CGCTGGACCG CCCAGGTGGC CAAGATGGTG TACGGCCAGG AGCCCCAGGT CCTCAACCCG CCCCTGCCCC CAAACGTAGA GGTGGTCGAG AAGCCGAGGC CCTTCGAGGA GAGGGAGCCC ACCGTCGTGA TGCTAGGCCG CTTCTCCCAG GAGAAGCGCT ACCACTGGGT CGTCACAGAG GTCGCGCCCA GGCTGTTGAA GGAGGTCCCC GGCGCCAAGA TCATAATCTT CGGCGGCGCC GCCACCCCCA CCCTACAGGC CTACAGAGAC AGGGTGAGGA AGATGGCGGA GGACGCCGGC CTAAAGACGG CAGAGACGCT AGACGCCCAC GCCCACATCT ACCTAATAGC CAACGCCCCC CGCCGCGTCA TAAACGACGC CATGGACAAG GCCAGGGCCT TCCTCCACGC CACCATAAAC GAACACTGGG GCATAGCGGT GGCGGAGGCC ATGGCCCGAG GCCTCACGCC GGCGGTCCAC AGGTCCGGAG GCGCCTGGAC AGACCTCGTC ATGGAGGGCA GATACGGCCT AGGCTACACA ACCGCCGAAG AGGCCGTAGA GGCGCTGGCG AAGCTCCTCA CCCAGAAGGC CAGCTACGCC CCCCAGGAGA GGGCCCGGGA GCTGGTCTTC CAGAACTTCG CCAGCGCCCT CCGGAGGTAC ATATGA
|
Protein sequence | MRRGAPLLKD ICQPVSYAME RYAVVAHHYW GTPGGGQLVC AAAAKALEEA GYRPALAGTF KFDPRRYVEW YGIDISRYPV ETLPIAPRAF GLWSRLYVWL PAKKAAERYK PELLFIDEVA YKPLARGRRF RLVEYIHFPF EVVVDPRYRG TGLAYGEDPY IMERYGRFPM SLYWRAFVWG LKRYARENPF HYADAVLVNS RWTAQVAKMV YGQEPQVLNP PLPPNVEVVE KPRPFEEREP TVVMLGRFSQ EKRYHWVVTE VAPRLLKEVP GAKIIIFGGA ATPTLQAYRD RVRKMAEDAG LKTAETLDAH AHIYLIANAP RRVINDAMDK ARAFLHATIN EHWGIAVAEA MARGLTPAVH RSGGAWTDLV MEGRYGLGYT TAEEAVEALA KLLTQKASYA PQERARELVF QNFASALRRY I
|
| |