Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1225 |
Symbol | |
ID | 4601722 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1161958 |
End bp | 1164429 |
Gene Length | 2472 bp |
Protein Length | 823 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639774001 |
Product | peptidase M1, membrane alanine aminopeptidase |
Protein accession | YP_920626 |
Protein GI | 119720131 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.921665 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTACCG TTAAGACTGG GAGGGGGTTC GCCTACCCCG AGTACGAGCC GGTGTGGCCC CCCGAGCCAC CCTTCGAGCT TCTCGAGGTG AGGGCGGAGG TCGACGTCGA CTTCGAGTCG AGGACTCTGA GAGGGGTCTG CGTTAACAGG CTAAAGGTTA CCGCGCCTGC AGAGAGGGTA GTGTTCCACG CGGTGGACAT GAAGATTGAA GAGGTCCTGG TGAACGGCCG GAAGGCCGCC TACACCTACG ACGGCAGGGA GCTATCCGTC GGCGTTGAGG GAGTCGGCGC GGGCGGAGAG GTAGAAGTCT ACGTGAGGTA CGCGACCGTG GAGCCCAAGG CTGGAGCATG GTTCGTGCCT GTCGACGGGG GCAAGCCTTA CATGGTGTAC ACGCAGGGGC AACCCGAGGA CACGAGGTAC TGGCTCCCCA CGTACGACTA CCCTAACAGG AAGGCCAAGG TTTCCCTGGC GGTCACCGTG CCCAAGGGGA TAGTCGTCGT CGCCAACGGG GTCCTGGTGG GCCGAGAGGA GCTCGGGGAC AAGGAGAGGT GGAGCTTCAG GCTGGACTCG AGGATCCCGA CCTACCTCAT CGCGTTCGCG GCTGGAGACT TTTCGGTGGT AGAGGAGGAG TACGGCGGGG TGAAGCTTCA GTACGTCGTG CCGAGGGGGA GGGAGGGGGA TATCCCGAGG AGCTTCTCGC AGACGAAGGA GATGGTAAGG TTCTTCGAGG AGTTCACGGG GGTGAAGTAC CCCTACCCCA AGTACGCGCA GGTGTGCGTC GACGAGTTCG TCGCGGGAGG GATGGAGAAC GCCTCCGTGA CAATACTGAC TAGCGCGACC CTGCACGACG AGAAGGCCCA CGCCGACTTC AGGAGCGAGC CCCTGGTATC GCACGAGCTC GCCCACCAGT GGTTCGGGGA TCTGGTGACG TGCAGGGACT GGTCCCACCT CTGGCTCAAC GAGAGCTTCG CGACGCTTAT GGAGGCGCTG TGGAGGAGGC GGGAGCTCGG CGAGGAGGAG TTCGTGTACG ACCTCATAGG GATGCTGGAC TCTTACCTCT CGGAGTACGG GAAGTACGCG AGGCCCATAG TGACGCGGCT CTACAAGTAC CCCGACGAGG TCTTCGACGC ACACAGCTAC CCGAAGGGGG CGCTCGTCCT CTGGACGCTC ATGAACATAG TCGGAGAGGA GGCGTTCCGG AGGGGGGTTA AGAAGTACCT CGAGTCGAGG AGGGAGGACA ACGCAGTCAC GGACGACCTG AGGAGGGCTC TCGAGGAAGC CTCGGGGGCT AGGCTGGACT GGTTCTTCGA GCAGTACGTG TTCAACGCCG GGCACCCGGC ACTTTCCGTG TCCTACAAGT GGGTCGAGAA GGAGGGAGTC CTGGAGCTGA GGGTCTCCCA GACCCAGGGG GACGACTCGC TCCCGAGGTA CAGGGTTCCC TTGGAGGTCG AGTTCCTGGG CGAGGGGTTC CGCGAGAGGC GCACAGTGTG GGTCGAGGAG AGGCAGTCCG TCTTCTCGTT CAGGTTGCCT TCCAAGCCCA CCGCCGTCTG CGTAGACCCC TCCTTCAAGG CCTTCAAGGC CCTTAGCCTC GACCTCGGCG TGGAGGAGCT ACTATCGATA GTGAAGCACT GCCCCTACCT CTACCCGAGG GTCGCCGCCG TCAGGGAGCT AGCCAAGAAG GCCTCGCCCA CGGCTGTAGA GGAGCTGAAG GGCCTGTTGC TGTCCGAGGA CGAGTTCTGG GGGCTTAGGA GCGAGGTCGC CTCGGCAATC GGGAGCATAG GCGGGCAGGC GGCGCTCAAC GCCCTACTCG AAGCGCTCGA CAAGGTCAGG CACCCGAAGG TTAGGAGGGC TATCGTCAGG GCCCTGGGAG GCTACAGGGA GAAGGCCGTC GCGGAGAGGC TATCGAAGGT GCTGAAAGAC GAGTCCGAGA GCTACTACGT AAGGGCGGAG GCCGCCCTCT CAATAGCGAA GACGGGCTTC CGCGAGTACT CCGACGCCTT GAAGGAGGCC TTGAACGTTC CCTCCCACAA CCACGTGATA GCTGCCTCCG CGCTGGAGGC GCTCGCAATG CTCCTAGGCG ACGAAGTTCT CGACCTGCTG GAGAAGTACG CGTCCCCCTC GACGCCTATG CCCCTCAGGA GGGCGGCTAT AGCGTCCCTG GGCTACCTCC CGCCCAGCCA GAGGGTCCTC TCGCTACTGG AGTACTCCTC CAGGAGCAGG CACCCGCACG TGAAGCTCTC AGTGATATCC GCCTGTACGC GGCTACTCTC CCCGAGGGTG CTCCCGATAC TCGAGCGGCT ACAGGGGGAC ACCTCCGGGA GGGTCGCGAG GAGCGCGAGG GACGCGGCGG AGCAAATAAA GAAGCACATG GAGAGGGGCG AGGAGTACAG GAAGCTCAAG GAGGAGCTCG ACAAGCTGAG GGAAGAGGAG AGAAGGCTGA GCGAGCGGGT CGAGAGGCTC GAGAAGAGGT AG
|
Protein sequence | MVTVKTGRGF AYPEYEPVWP PEPPFELLEV RAEVDVDFES RTLRGVCVNR LKVTAPAERV VFHAVDMKIE EVLVNGRKAA YTYDGRELSV GVEGVGAGGE VEVYVRYATV EPKAGAWFVP VDGGKPYMVY TQGQPEDTRY WLPTYDYPNR KAKVSLAVTV PKGIVVVANG VLVGREELGD KERWSFRLDS RIPTYLIAFA AGDFSVVEEE YGGVKLQYVV PRGREGDIPR SFSQTKEMVR FFEEFTGVKY PYPKYAQVCV DEFVAGGMEN ASVTILTSAT LHDEKAHADF RSEPLVSHEL AHQWFGDLVT CRDWSHLWLN ESFATLMEAL WRRRELGEEE FVYDLIGMLD SYLSEYGKYA RPIVTRLYKY PDEVFDAHSY PKGALVLWTL MNIVGEEAFR RGVKKYLESR REDNAVTDDL RRALEEASGA RLDWFFEQYV FNAGHPALSV SYKWVEKEGV LELRVSQTQG DDSLPRYRVP LEVEFLGEGF RERRTVWVEE RQSVFSFRLP SKPTAVCVDP SFKAFKALSL DLGVEELLSI VKHCPYLYPR VAAVRELAKK ASPTAVEELK GLLLSEDEFW GLRSEVASAI GSIGGQAALN ALLEALDKVR HPKVRRAIVR ALGGYREKAV AERLSKVLKD ESESYYVRAE AALSIAKTGF REYSDALKEA LNVPSHNHVI AASALEALAM LLGDEVLDLL EKYASPSTPM PLRRAAIASL GYLPPSQRVL SLLEYSSRSR HPHVKLSVIS ACTRLLSPRV LPILERLQGD TSGRVARSAR DAAEQIKKHM ERGEEYRKLK EELDKLREEE RRLSERVERL EKR
|
| |