Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0666 |
Symbol | |
ID | 6165303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 597775 |
End bp | 600249 |
Gene Length | 2475 bp |
Protein Length | 824 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641667819 |
Product | peptidase M1 membrane alanine aminopeptidase |
Protein accession | YP_001794051 |
Protein GI | 171185132 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTACC TAGTGGGGAG GGACTTCGCC TTTCCCGAGT ACACGCCGAG GTTCCCCCCG TTTTACGAGT TCGAGGTTCT CCACATGCGG CTAGATGTGT CAATAGACGT GGCCAACAGA TCAGTCGACG GCCTCGTAAA ATACAGGCTG AGACCTAGGA AAGACGGCGC TAGAGTTGTG CTAGACGCCG TTGAGATGGA CGTGTTAAGC GTAAGCCACG ACTTCTACTA CGACGGGGAG AAGATAGAGA TTAAGCCGCA GTGGAAAGCC GGCGAGGTTG TTGAAGTGGC GGTGAAATAC AGAGCTAGGC CTCGGGCTGG GATATACTTC GTCACCCCCC ACGGCCAGAG GAGGGGCGTC TACGTGTGGA CCCAGGGGGA GAGCGAGTAC AACCGCTACT GGGTCCCGTT GCCCGACTCC CCCAACATAA AGTTTCCTTG GACCGTGGCG GTGACGGTGC CTAAGCCCCT CGTCGCTGGG AGCAACGGGC TGTTGGTTGA GGTCAAGGAG GGGGAGGACA GCCGGACCTA CGTCTGGGAG ATGAGACACC CCATGTCTCC ATATCTCTTG GCCATAGCCG CCGGCGACTT CGAGATATAT AGCGAGAAGT GCGGCGAGGT TTTGCTGGAG TACTACATCC CGAGGTACGT CGGCGGCGAG TGGCGCCACA GCTTCTACAA CACCTGCGAG ATAATGCGGT TCTTCTCTGA GTACCTCGGC GTTCCCTACC CCTACGAGCG ATATGCCCAG GTGGTGGTGC CGGAGTTCAT ATACGGAGGG ATGGAGAACA CGACGTTTAC CATACTGACG GACTGGACTA TACACGACAA ACACGCCCAC TGCCCCTACG GCGAGTTCCC CTGCCCCGGC CAGGAGGACT TCTCCTCCGA TCCGCTGGTT GCCCACGAGA TGGCCCACAT GTGGTTTGGC GATTTAGTCA CGGCAAAAGA CTGGGGGCAC ATAGCGATAA ACGAGTCCTT CGCGACCTTC ATAGAGGCGC TGTGGACCGA GAGGTCCAAG GGCCGGGAGG AGTACCTCTA CGAGATCTAC ACAAACTTCA GGACGTATCT GGGCGAGTAC TCCCGCCGCT ATTCGAGACC CATCGTCACC AACGTCTACA AGATCCCCGA CGAGGTGTTC GATAGACATG CCTACGAGAA GGGGTCCGTC GTCCTCCACA TGCTGAGGAG CCTCCTCGGC GACGACGACT TTAGGAGGGG GCTGAAGGTG TTTCTAGAGC GGAATAGGTA TAGGGCTGTC GATATTGAGG ATCTTCGGAA GGCCCTGGAG GAGGCGTCGG GGAGAGATCT GGAGTGGTTC TGGAGGCAGT TCTTCTACGC AGCGGGCCAC CCGGTGTTGA AGATCTCGTG GAGCTATTCA GACGGCGTTC TGAGGTTGCA GATCAAGCAG AGCCAGGGAG AGGATAGCTA CCCCGTGTAC ACCATCCCGC TTGAGGTTAA GGTCGTGTAT GAAGACGGGA GGAGGGAGGT GAGGGAGCTT CAGCTGGGCG AGAGGGAGGT GGCTTTACAG ATCCCGGGGG GCAAGCCCAG GTACGTATGT GTGGACCCCG CCTTCAAGGT CATGAAGGCG CTTGACCTAC AGTATCCTCT GGAGTCCGCT ATAGCTATGT TAAACGACGA GGATCTCTAC TGCAGGTTGC AGGCCGTAGA GGCGCTTAAG AGAAACGGGA GCCCTAAGGC TGTGGAGGCG CTGGCGAAGG CGCTTGGCGA TGGGTTCTGG GGGGTGGCTG CTGAGGCCGC GCGGGCTCTG GGCGAGATAG GCACGGGGGA GGCCGTGTAT AAGCTCATTG AGGGGTACTG GAGGGCGCGG CACCCCAGGG TTAGGCGCGC CATTGTGGAG GCTCTTGGCA ATGCCAGACG GAAGGAGGCC GCCGAGTTCC TCGACAAAGT TCTCCACAGC GCCGAGGAGA GCTACTACGT GAGGGCTGAG GCCGCGCGGG CGCTTGGCAG AATCAGGTGG GAGTACGCCG AACACAGCCT AAGGAAGGCG CTGGAGTACA GTAGCCATCT AGATGTGATA AAGAGGGGGG CGCTGGAGGG GTTGGCCGAG CTCGGCTCTG ACGACGCGCT AAGGGTGGTC CTTAGGCATA CCGAGTCCGA CATGCCGACG TATGTGAGAG CCTCCGCCGT GCAGTCTCTC GCAAAGTTCG GCCCACGTAG GGAGGTTCTC GACGCCGTTA AGGCCGCCCT CCGCGACGAG AACTTCAGGG TGAGATACGC GGCTGTGACC GCGGCTCTTG AGCTCCTGGA CCACCGCCTC ATACCTGACC TCCAGGAGCG CATGGAGAGA GACGTAGACG GCAGGATTAG GCGCGTAGCC AGGGAGGTGG TCGAGAGGAT TAGGAGGGCA ATGGAGAGGG GCGCCGAATA CCAGAAGCTT AGGGAGGAGG TGGAGAAGCT ACGGGAGGAG TACAGGAAGC TGCTCGACCG CGTAGGGAAA TCCTATGCCG GTTAA
|
Protein sequence | MKYLVGRDFA FPEYTPRFPP FYEFEVLHMR LDVSIDVANR SVDGLVKYRL RPRKDGARVV LDAVEMDVLS VSHDFYYDGE KIEIKPQWKA GEVVEVAVKY RARPRAGIYF VTPHGQRRGV YVWTQGESEY NRYWVPLPDS PNIKFPWTVA VTVPKPLVAG SNGLLVEVKE GEDSRTYVWE MRHPMSPYLL AIAAGDFEIY SEKCGEVLLE YYIPRYVGGE WRHSFYNTCE IMRFFSEYLG VPYPYERYAQ VVVPEFIYGG MENTTFTILT DWTIHDKHAH CPYGEFPCPG QEDFSSDPLV AHEMAHMWFG DLVTAKDWGH IAINESFATF IEALWTERSK GREEYLYEIY TNFRTYLGEY SRRYSRPIVT NVYKIPDEVF DRHAYEKGSV VLHMLRSLLG DDDFRRGLKV FLERNRYRAV DIEDLRKALE EASGRDLEWF WRQFFYAAGH PVLKISWSYS DGVLRLQIKQ SQGEDSYPVY TIPLEVKVVY EDGRREVREL QLGEREVALQ IPGGKPRYVC VDPAFKVMKA LDLQYPLESA IAMLNDEDLY CRLQAVEALK RNGSPKAVEA LAKALGDGFW GVAAEAARAL GEIGTGEAVY KLIEGYWRAR HPRVRRAIVE ALGNARRKEA AEFLDKVLHS AEESYYVRAE AARALGRIRW EYAEHSLRKA LEYSSHLDVI KRGALEGLAE LGSDDALRVV LRHTESDMPT YVRASAVQSL AKFGPRREVL DAVKAALRDE NFRVRYAAVT AALELLDHRL IPDLQERMER DVDGRIRRVA REVVERIRRA MERGAEYQKL REEVEKLREE YRKLLDRVGK SYAG
|
| |