Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1094 |
Symbol | |
ID | 6165518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 979202 |
End bp | 980446 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641668246 |
Product | peptidase M16 domain-containing protein |
Protein accession | YP_001794471 |
Protein GI | 171185552 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00000467698 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00190333 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGGTCGGC GGGGGGGCTC GCGGCCCGTG TTGCGGCGTC GGGGGTTGGG GATTGGCGTT GCCAAGTTTA TATATGGCGT GGTTTGGGGG TGCGTGTTGC TGGATAACGG GGTGAGGCTT GTGTTGGATA GGTTTGCCGC GCCTACGGCG GCTGTGGTGG TGGGGGTCGG CGTTGGGTCG CTGTTTGAGG AGAGGGGGCG GAGGGGGATT ACCCACCTGC TGGAGCATAT GCTGTTCCGG GTGCCTGGGT TCGACGTGGA TGAGGCTGTG GAGTCGCTGG GGGGGTCCAA CAACGCCTAT ACGGAGCGGG ATGTCCTCCT CCTGGTTTTG GAGGGGGTTT CCGAGTCGGC GGCTGGGCTG GTGGAGCTGG CCTTCCGGCT GTACGCCAAC GAGCGGTTTG ATGAGGCGGA TCTGGAGCGG GAGAGGGACG TGGTGCTTTC TGAGCTTAGG CAGGTTAGGG AGGACCCCTC GGACTGGGTT GGGGAGCTGG GGGTTAGGGC CCTCTTCGGC GACTCTGACT GGGGGGACCC GGTGGGGGGC ACGCCGGAGG CTGTGGAGTC CATCTCCCTG GGGGACCTCC TGGAGTTTAA GCGGAGGTGG TTCACGCCGG GGAACACCTT CGTGGTGCTG TCGGGGGGGT TTGGGGAGGA GGCCGTCGCG AAGGCGGTGG AGCTCTTTGG GGGGCTTGAG GGGGAGGCCC CGCCTAGGCC GAGGCCCACG GCGGGGTCGG GCCCGGGGAG GATCGTGGAG AGGCGGGAGG TCGACGGGGT GTACTACGCC AGGGCTGTGA GGGTGGCGGT TGGGGACCCC GCGGCGGCTT TCGCCGCGCT TCACGGGGCG GCTTTCCACC TGGAGTCTGG GACCAAGTCG GTCCTCTTTA ACCTCCTGAG GACGCCGGGC ATCGCCTACT CCTACTACGT GGACTACGAC GTGGTGGGGG ACGTGGCTTA TCTCGAGGTG GTGGTGGAGT CGGCGAGGTC TCTGGAGGAG GCGAGGAGGG CCGTGTCGGA GGCTTTGAAG CCCAGGTCTC CTCCCCCCTA CAGGCTTAGG TACTTCGACT ACGTCTGGAG GGTGGCGTGG AGGAGCCCGG CGAATAGGGC CGTCTCCATC GGGGAGTACG TGGCGAAGGG GGGGAGGCCG GAGGATGCCG AGGCCGCGTT TAGGAGGGCG GCGGAGGGGG GCACCCTGTG GGTTACGCCG CTGGCTGAGG CCGAGGCGGT GGTGGGGCCC GAAGAGCTTA TATAG
|
Protein sequence | MGRRGGSRPV LRRRGLGIGV AKFIYGVVWG CVLLDNGVRL VLDRFAAPTA AVVVGVGVGS LFEERGRRGI THLLEHMLFR VPGFDVDEAV ESLGGSNNAY TERDVLLLVL EGVSESAAGL VELAFRLYAN ERFDEADLER ERDVVLSELR QVREDPSDWV GELGVRALFG DSDWGDPVGG TPEAVESISL GDLLEFKRRW FTPGNTFVVL SGGFGEEAVA KAVELFGGLE GEAPPRPRPT AGSGPGRIVE RREVDGVYYA RAVRVAVGDP AAAFAALHGA AFHLESGTKS VLFNLLRTPG IAYSYYVDYD VVGDVAYLEV VVESARSLEE ARRAVSEALK PRSPPPYRLR YFDYVWRVAW RSPANRAVSI GEYVAKGGRP EDAEAAFRRA AEGGTLWVTP LAEAEAVVGP EELI
|
| |