Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0602 |
Symbol | |
ID | 6165530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 548111 |
End bp | 549061 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641667753 |
Product | peptidase M42 family protein |
Protein accession | YP_001793987 |
Protein GI | 171185068 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTAG AGGAGCTGAC CAACGCCCTG GGGGTCTCCG GCTTTGAGGA GGAGGTGCGC CGTAGGATAC TCTCCGCCGT CCCCAACGCC GAGGCGGACG ACTTCGGAAA CCTCCTAGCC GTGGACAAGT CCAGGGTTGC CTTCGTGGCG CACATGGACG AGGTGGGGCT CCTCGTGACG TCCATAGAGG AAGACGGGAG GATGAGGTTT AGGAAGGTGG GAGGCGTTGA CGACAGGATC CTACCCGGCT CCTCGGTGGT CCTATACGGA GATGGCTTCA AGGTGGAGGG GGTTATAGGC ATCGCGCCCC CCCACTTCCA GCAACAACAG AGCCAGATCT CCTGGCAGGA CCTGTATATA GACGTGGGAG CCGCCGGGAG GGCGGAGGTG GAGTCCATGG GGATCGGCCC CATGACCCCC GCCGCCTTCT CCAGGCGGTA CGCCGAGGTG GGCAGGTATA TATCGGCAAC GGCGTTAGAC GACAGAGTGG GGTGCTGGGC CTTGCTGGAG GCCTACCGGA GGGGAGCCCA GGCGACATAC GTATGGAGCG TCCAAGAAGA GCTGGGCCTC CTCGGCGCGC GCGCCCTCTC GAAGAGGCTG GAAGGGAAGT ACGCCGTCGT GGTAGACACG ACCTCCTGTT GCCACCCCAA CTTCACAGGC TCCGCCAAGC CCGGACAGGG GCCCGTGCTG AGGATCTTCG ACAACTACGG CGCCTACAAC AACAAGCTGG CGAAGAGGAT ACTCGAGATA GCCAAGAGGC GGGGTATACC GATCCAGATA AGCGGAGGCG GAGGGGGGAC AGACGCAGCC GCGTTTTTCG TCTCAGGCAT ACCAGCGGTC GCCATCGGGA TACTCAGCAA GTACTCCCAC TCGCCCGTAG AAATGGTTCA CAAAGACGAC CTCAAACACG CCGTGGAGCT CCTCGTGGCA ATATCCGAGG ACCTACGCTG A
|
Protein sequence | MSLEELTNAL GVSGFEEEVR RRILSAVPNA EADDFGNLLA VDKSRVAFVA HMDEVGLLVT SIEEDGRMRF RKVGGVDDRI LPGSSVVLYG DGFKVEGVIG IAPPHFQQQQ SQISWQDLYI DVGAAGRAEV ESMGIGPMTP AAFSRRYAEV GRYISATALD DRVGCWALLE AYRRGAQATY VWSVQEELGL LGARALSKRL EGKYAVVVDT TSCCHPNFTG SAKPGQGPVL RIFDNYGAYN NKLAKRILEI AKRRGIPIQI SGGGGGTDAA AFFVSGIPAV AIGILSKYSH SPVEMVHKDD LKHAVELLVA ISEDLR
|
| |