Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1879 |
Symbol | |
ID | 6165971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 1655978 |
End bp | 1656847 |
Gene Length | 870 bp |
Protein Length | 289 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641669041 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001795240 |
Protein GI | 171186321 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00000550299 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.675265 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTGA GCGGCTTAGT GGAAAAGGTT GCCCGCTCTG TCGTGGGCGT TGCGGCGAGG GGGGTTGGGG CTGTCGGCGA GGGCTTCGGC TCCGCCTTTG CCATAGACCG GGGGGTCTAC GCCACTGCAT ACCACGTCGT GGCGCAGGCG GGTGAGGTGG CGTTGATCAC CCCCGAGGGG GAGGTGGCTG ACGCCGTGGT GGCGGCGGCG GACCCCGCCG AGGATCTGGC CATACTCTAC TCCGACCTCT ACGCCGTTCC GCTGTCACTA GGGAGCGCGC TGAGGCTGAG GGTCGGGCAG GGGGTAGTCG CCGTGGGCTT CCCCCTAGCC CTCCTTGACA AGCCCACTGC GACCTTCGGC ATCGTAAGCG CTGTGGGGAG GAGCTTGAGG GCTGGCGATA GGTTTTTCGA ATACCTCGTC CAGACAGACG CGGCGATCAA CCCCGGCAAC TCGGGCGGCC CGCTCGTGAA CCTCTCCGGG GAGGCGGTGG GGGTCTGCTC GGCCGTAATC GCCGGGGCCC AGGGCCTGGG CTTCGCGGTG CCTATAGACC TAGTCAGAAT CATGTACCAG ATGGTGAAGA GATACGGGAG ATACGTAAGG CCGGCGCTCG GGGTATACGT CGTTGCGTTG AACAAAGCTC TGAAGGCCCT ATACGGCCTC CCCACAGACA GAGGGCTTCT CGTCGTTGAC GTCGTGCCAA GCTCGCCGGC CGAAGAGATG GGCATCGCCC GAGGCGACAT CTTAACCAAG GTCGACGGCC GCGAGGTGGC CAACGTCTTC GAACTCCGCC TGTTGATAGG CGAAGCGCTG ATCCAGGGCA GAACCCCCAG GATAGAGGTC ATCAGAGGCG GAAGGAGGAT AGAGCTCTAA
|
Protein sequence | MDLSGLVEKV ARSVVGVAAR GVGAVGEGFG SAFAIDRGVY ATAYHVVAQA GEVALITPEG EVADAVVAAA DPAEDLAILY SDLYAVPLSL GSALRLRVGQ GVVAVGFPLA LLDKPTATFG IVSAVGRSLR AGDRFFEYLV QTDAAINPGN SGGPLVNLSG EAVGVCSAVI AGAQGLGFAV PIDLVRIMYQ MVKRYGRYVR PALGVYVVAL NKALKALYGL PTDRGLLVVD VVPSSPAEEM GIARGDILTK VDGREVANVF ELRLLIGEAL IQGRTPRIEV IRGGRRIEL
|
| |