Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1937 |
Symbol | |
ID | 6164706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 1709033 |
End bp | 1710073 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641669100 |
Product | peptidase M24 |
Protein accession | YP_001795298 |
Protein GI | 171186379 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.469872 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAACG TGGGCAAGCT CGTAAAAGCC TTCGCAGGCC GGTACAGATA CCTCATCTTA ACCAGGTCGC CAAATCTGGC ATACGCGGTG GGGATGCCCG ACGCCCTAGG CCTAGTCCTA GATCTTGAGA CAGGCCACTC TACCCTCTAC GTATCTCGGC TGGACTACGC ACGTGCAAGC GCCTTCGCCG AGGTTGACAA GGTTGTGGCG GTGGCCTCTG CCGAGATACC GCCCCGGAGA CCGGGCGAGG AGCTCGTCGT CGCCTCTAGC CTATCAGACG TCCTAAGACA GGTCTCGCAG GGGGGCGCCG CCGCGTCGGA TAACAAAGAG CTTGGGGTGG ACGTAAGCGC CGAGATCGCG GAGCTTAGGG CGGCCAAGGA GGAGTGGGAG GTGGAGATGA TGAGGGAGGC GCTTAAAATC GCCGAAGGCG CATACGTCAA ACTAGCCGAG CTGAGGCTCA TAGGCATGAG GGAGCGGGAC GTGGCCGCCC TCATATATAA ATGGTTCCTA GAGGAGGGGG CAGACGGAGT CGCCTTCGAC CCAATCGTCG CGTCGGGGCC AAACGGCGCG TATCCGCACT ACAGATTCGG CGACAGGAAG ATAGCCTACG GCGACTACGT TGTGGTAGAC ATCGGCGCGA AGAGGGGGGT CTACTGCTCA GACATCACCA GGACCTTGGC GGTGGGGCAG GGAGGCGCGT TGAGAGATGC CGTGTACGCC GTATATGAGG CCGTCAAAGC CGCTGAAAAA GTTGCGAGGG AGGGGGCGGC TGCCGCCGAG GTGGACAAGG CGGCGCGGGA CGTCATCGCC GAGTACGGCT TCGGCCAATA CTTCATACAC TCGACGGGAC ACGGCGTCGG GGTGGAGGTG CACGAGCCGC CTAGGCTATA CGCGGCCTCG CGAGATGTGC TCAAGAGGGG GCACGTGGTG ACGATAGAGC CGGGCGTCTA CATAGAGGGG GTGGGGGGCG TGAGGATAGA GGACATGGTA TACATCAACG GCGGAGCCGC GGTCCTAAAC AGGCTACCCC ACATCCTCTA G
|
Protein sequence | MNNVGKLVKA FAGRYRYLIL TRSPNLAYAV GMPDALGLVL DLETGHSTLY VSRLDYARAS AFAEVDKVVA VASAEIPPRR PGEELVVASS LSDVLRQVSQ GGAAASDNKE LGVDVSAEIA ELRAAKEEWE VEMMREALKI AEGAYVKLAE LRLIGMRERD VAALIYKWFL EEGADGVAFD PIVASGPNGA YPHYRFGDRK IAYGDYVVVD IGAKRGVYCS DITRTLAVGQ GGALRDAVYA VYEAVKAAEK VAREGAAAAE VDKAARDVIA EYGFGQYFIH STGHGVGVEV HEPPRLYAAS RDVLKRGHVV TIEPGVYIEG VGGVRIEDMV YINGGAAVLN RLPHIL
|
| |