Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0873 |
Symbol | |
ID | 6164525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 778466 |
End bp | 780118 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641668029 |
Product | peptidase S16 lon domain-containing protein |
Protein accession | YP_001794256 |
Protein GI | 171185337 |
COG category | [R] General function prediction only |
COG ID | [COG1750] Archaeal serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGAGGA AGGCTCTGGC GTTGATCCTC CTAGCCGCCC TGGCGCTGGC CGCGTGGCAG ACCTACACGG TGAAGGTGGC CTCGGCGGAG ATAAACGCGC TGGCCGTTGG GCCCTCCGGA GGCGCCGTCT TGCCCATCAA GATCACCCTC ATCACGCCGG GGGACGGGAG GGCGTACGTG GCAGGGGTCC CAGAGGCTGG CCAGGGCTTC GGCCCCTCGG CGCAGATTGC GCTCTACGTC GCGTCTAGGT ACTCGGGCAA GCCCTACACG AACTACACCG CCCTGCTGAG GGTGTTGGCC AGCGACGCCC AGGTGGGAGG CCCCTCGGCC AGCGGCTACA TAACAGTGGC CATGTACGCC CTCATGAACG GCCTGGAGCT TAGAAACGAC ACAGCGATGA CGGGTATAAT ACTGCCCGAC GGGCTTATAG GGCCGGTGGG CGGGGTCTCC CAGAAGGTGT CGGCGGCCGC GGAGAGGGGG ATAAAGACCG TGTTGGTCCC CATTGGCGAG AAGCCGAGCG CCGTGCGCGG CGTTAAGGTG GTGGAGGTGG GCACGGTGGA AGACGCCATC TACTACCTCA CTGGGCATAG GGTCGAGACG CCGCGGCCTT CGGCGGTGGA CGACACGGCG TTTAGGGAAA TATCGCGGGA CCTCTTCAAC GCCGTCTACA GCTACTACAA CGCGACGGTG GGGAGGGGGT ACGTAGACGA GGCCTTGATC GATAGGCTGA AGAGCCGGGG CGACTACTAC GCCGCCGCCT CGTTGATATA CCAGGGGATC GTGAGGTACT ACAGCGACCA AGCCTCCTCG TCTAGGAGAG CCGCCAGGGA GCTCTACGAC AAGGCCCTCC AGCTGGCGAA GGAGGCCGAG GCGGAGCTCT CCAAGATCCC CACGACCGTC AACAACCTAG ACCTCGTGGT GGCGGCCTAC ACCAGGGTAT ACGAGGTCTA TCTCCAGGCC AACTCCACCT CGGCCAACCC AGGCGCTATG TACGCCCGGG CCGTCACCCT CAAGCCTTGG GTCGACGAGG CGAGGAGGAT GGCCTACGGC CCCGCCGTAA ACGAGAGCAA GCTGGCGGAG ATCGCGAGGA TGTACCTAGA CTACGCCAAG GCCATGTACG CCTATCTGGA GACCACCTCT GACGTTCCCC TAGGCGATTA CTCCACAGCA GTGCAGCTGG CGGAGGACCT CTACGGGAGA GGTCTCTACC TGGCCTCCAT TGCCAACTCC ATAGAGATAA TCGCAGAGTC CGCCGCATCT CTAATGTCGG CTGCCCCCGA GAAGTACCTG GAGGTGGCCA GGGAGAGGGC TCTCACCAAC ATGGCCCGCG CCGCCCAGTG CGGCTACACG AACACGTTGC CGCTGAGCTA TCTACAGTTC GGCGACTACT ACAGCCAGCA GCCCGACGGC GTCAAGTACG CGTTGATGTA CTACATCACA GCCTCCGTCT ACTCGACGGC GATGGGCGAC GCGGCTTGCT TCGCGAAAAG CGGCGTGGTC TACCAAAAGC CAAGCTTCGC CCCGCCTGAG CCGGCGGCGC CGGCGGCTCA TACCGCCGCC GTTGCTAGGC AGGAGGGGGG AAAAAGCCTG TGGTTGCCGC TTGTCCTCGC ACTACTGGCG GCCCTCGCCC TGGTCTACAG CGCTAGACGG TAA
|
Protein sequence | MVRKALALIL LAALALAAWQ TYTVKVASAE INALAVGPSG GAVLPIKITL ITPGDGRAYV AGVPEAGQGF GPSAQIALYV ASRYSGKPYT NYTALLRVLA SDAQVGGPSA SGYITVAMYA LMNGLELRND TAMTGIILPD GLIGPVGGVS QKVSAAAERG IKTVLVPIGE KPSAVRGVKV VEVGTVEDAI YYLTGHRVET PRPSAVDDTA FREISRDLFN AVYSYYNATV GRGYVDEALI DRLKSRGDYY AAASLIYQGI VRYYSDQASS SRRAARELYD KALQLAKEAE AELSKIPTTV NNLDLVVAAY TRVYEVYLQA NSTSANPGAM YARAVTLKPW VDEARRMAYG PAVNESKLAE IARMYLDYAK AMYAYLETTS DVPLGDYSTA VQLAEDLYGR GLYLASIANS IEIIAESAAS LMSAAPEKYL EVARERALTN MARAAQCGYT NTLPLSYLQF GDYYSQQPDG VKYALMYYIT ASVYSTAMGD AACFAKSGVV YQKPSFAPPE PAAPAAHTAA VARQEGGKSL WLPLVLALLA ALALVYSARR
|
| |