Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1921 |
Symbol | |
ID | 6164869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 1693149 |
End bp | 1694330 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641669084 |
Product | proteinase inhibitor I4 serpin |
Protein accession | YP_001795282 |
Protein GI | 171186363 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4826] Serine protease inhibitor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.705743 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCTGT GGGCTCTGCT ACCGGTTACG TTGGCTTTGG TAGTGTTGCT GATCTTCATC GTGTCTCTTA AGGAGGCGTC CACGCACTCT CCTCCCACGT CTCACACATC TGCTCCTGCG CCTCTGGAGA GCGGTAGCTA TGGCGACTTC GCCGTCAGTT TCTATAAGAG AATAGCCTTG GAGAGACTTC GCGAAAACCT TGTCCTCTCG CCGTATTCCG TGTATAAAGC CTTTGCCATG GCCTACGCGG GCGCGGCTGG GGAGACTAGG GAGGAGATCG GGAGGGTGTT CGGCTTCGGC GACGACGTCT GCGCCTTGGC TCAGGCTGGG CGGGGGGTCG AGGAGGCGGT GGCCGCGTGG GTGCAACTGG GATTCCCGCT GAGGGAGGAG TACGAGCGGG AGCTGTCTTG TCTAGGGGCG GAGCTGAGGC GGGTGGATTT CGCGGAGCGC TCCGCGCTGG TCGAGATAAA CAGATGGATA GAGGAAAAGA CCAGGGGATA CGTAAAGGAC TTGATCCCTA CTGACTACCC GCGTAGCCAG GATATACGCG TTGTATTGAC TTCAGCTCTG TTTTTCAACG GTAGTTGGTG GCCGCTCCAG TTTGGGCGTA TTGGGAAGAG AGAATTCCAG GGGGTAGGCC CGGTGGAGTT TATGAAGCTT GACCTTGGGT CTTGCGTCCC CTCGCTCAGG GGGCGCGTCT CTGCCGATCT CACGGTGGTG GAGCTCCGGT TTGAAAATAC AGATGTAGCT ATGTACGTCA TAATGCCAAA GTCGCTTGAG GACTATGTAA AAGGCTTGAC CTATGAGAAG TTGAAGAGAG ACATCACCGA ATTGCCGGAC GAGATCGTCG CCGTAACGAT GCCTCTATTT ACCGCAGAGT TTAAAGACTC TCTCAAAAAG GTATTAATCG ACATAGGCAT ACGCTCAGCA TTTGACAAAA CCAGAGCAAA CTTCACCCGG ATGTCGCCTA TCAGGATTTA TATAGACGAA GTTTTCCACG GCGCGTACAT AAAAGCTGAT GAAAAAGGTG TTGTAGCAAC AGCCGCCACT GCCGTCGTAT TTGTGCCCGT TTGTGCTAAG GTTGGAGGCA TGGAGGTGGT AATAGACAGG CCCTTCCTCT TCGTCTTGGC GGATCGGAGA GACGGCACGA TCTACTTCAT AGGGCATGTG GTTAAGCCCT AG
|
Protein sequence | MRLWALLPVT LALVVLLIFI VSLKEASTHS PPTSHTSAPA PLESGSYGDF AVSFYKRIAL ERLRENLVLS PYSVYKAFAM AYAGAAGETR EEIGRVFGFG DDVCALAQAG RGVEEAVAAW VQLGFPLREE YERELSCLGA ELRRVDFAER SALVEINRWI EEKTRGYVKD LIPTDYPRSQ DIRVVLTSAL FFNGSWWPLQ FGRIGKREFQ GVGPVEFMKL DLGSCVPSLR GRVSADLTVV ELRFENTDVA MYVIMPKSLE DYVKGLTYEK LKRDITELPD EIVAVTMPLF TAEFKDSLKK VLIDIGIRSA FDKTRANFTR MSPIRIYIDE VFHGAYIKAD EKGVVATAAT AVVFVPVCAK VGGMEVVIDR PFLFVLADRR DGTIYFIGHV VKP
|
| |