Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0513 |
Symbol | |
ID | 6165864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 464884 |
End bp | 465951 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641667666 |
Product | glycosyl transferase family protein |
Protein accession | YP_001793902 |
Protein GI | 171184983 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.149546 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0426124 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCTGGC CGCGGGTGTC TATCCTCTGG CTTAACTACA ACAGCGGCAG ATTCTTGGAC GTCGTTCTCG ACTCTCTCAG GGGGGTGGCG GAGCTCGACT ACCCGGACTA CGAGCTGGTT GTCGTGGACA ACGGCTCCAC CGACGGGAGC AACAGGGCGG TTAGGGAGTT CGTAGAGAGG TGGAGGGGGA GGGGCGGCAG GGCTAAGTTC ATCCAGCTGG ATAGGAACCT GGGCTTTACG GGCGGCAACA ACGTCGCCTT TAGGGCGAGG GACAGGGAGA GCAAATACGT CGTTCTGCTG AACAACGACG CGGTTCCCGA GCCCGGGAGC TTGAGGACCC TGGTCGAGTA CCTGGAGAGG GACGGGAGGC TGGGGGCGTG TCAGGGGGTG GTGGTGAAGT ACGGCGACCC CTCCGTGGTA GACACCGCGG GTGATTTCCT CGACGAGTTG CTCCGCCCCA TGGCTCTTTT CGAGGGGAGG AGGGGGCAGC CCCTCTCCAG GCCCATCTAC ATCACCTACC CCGACGGCTC CTACTCTATC TACCGGGTGG AGGCCGTGAG GAGGGCGGTG GGTGGAGAGA GGCTTTTTGA CGACTGGGCT TTTGCCTATT TTGACGACAA CGTGCTTGGG CTTAGGCTCT GGAACGCCGG CTACAGGGTC ATCTCCGTCC CCGTGGTGGC CGGGAGACAC AGGAGGAGCG CCACCTTCGG CTGGGCTAGC CCCTTCCAGC TGTACCACGC CTTCAAGGGG AAGATAGCCC TCCTCAGAAT CACCAACCTC CGGCGTAGGC GGCTGGTGTG GGCGTTCTAC GCCAAGGTGT TGGCTAGGCA CACGCTGGTG CCTCAGTACG CCAGGCTGGC GTGGAAGGCC TACATAGACG GGTGGCGCCT GGGCGGCAGA CTGGCAAAGG CCGGCGCGGT GCTTGACATC TACAAGGCCC CTGTGGTGAA GCTCGACGCT GGCGACATCT ACATGGCGCT GTTTAGGCGG GCGAGCCTCC TCCGCGTGTT GGGGGAGGAG AGGCTTCTAA GGCTTATTAG AGGTGGCGTG TTGACCTACG TGGGTTGA
|
Protein sequence | MGWPRVSILW LNYNSGRFLD VVLDSLRGVA ELDYPDYELV VVDNGSTDGS NRAVREFVER WRGRGGRAKF IQLDRNLGFT GGNNVAFRAR DRESKYVVLL NNDAVPEPGS LRTLVEYLER DGRLGACQGV VVKYGDPSVV DTAGDFLDEL LRPMALFEGR RGQPLSRPIY ITYPDGSYSI YRVEAVRRAV GGERLFDDWA FAYFDDNVLG LRLWNAGYRV ISVPVVAGRH RRSATFGWAS PFQLYHAFKG KIALLRITNL RRRRLVWAFY AKVLARHTLV PQYARLAWKA YIDGWRLGGR LAKAGAVLDI YKAPVVKLDA GDIYMALFRR ASLLRVLGEE RLLRLIRGGV LTYVG
|
| |