Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0240 |
Symbol | |
ID | 6166045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 212351 |
End bp | 214069 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641667403 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_001793639 |
Protein GI | 171184720 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.393632 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTTC TTGCGAAAAG GGCTCTCTCG GTAAGATCCG CCTATGCGCC CCGCCTGGGG CCCCGCAGGG AGATCTACTA CATCAGCGAT ATTACCGGCG TTCCCCAGCT CTGGAGGTTT GATGGACATG TCCACGACAT GGTGCTCCCC TGGGAGGAAA GGGTGTCGGA GTACCGGGTT GCAGATGACG GCACTCTCGC GTTCACCAGC GACGTGGCGG GCGACGAAAG GTGGCGTCTC TACGTGCTTG AGGGCGAGGA GGCGGCGCCC GTCTCCGCTG AGGGGGTTAA CAGCCTGGGG GCTTGGTCGC CCGACTCGCA GAAGCTGGCT TTCACCTCCA CCAGGGATGA CCAGCAGAAC TTCCACCTAT ACCTCTACGA CAGAGCTACG AGGTCTATCG CCAAGCTCGC CGAGATACCC GGCATCAACG TCGTGGAGGA GTGGTCGGAG GTGGGTCTCT TCGTCACGCA CTATGAGACA AACCTCGAAA GCGCCATACT GCTCTACAGA GGCGGGGAGG TGCGGGAGTT GACCAAGAGG AGCCCGGACA CCATGAGCCT CTCCCCTAGA TACATCGGAG GGGGGAAGCT ACTCTATCTA ACAAACGAGG GGTGGGAACA CATGGGGGTG GCCCAGATGG ATCTGACGAC CGGCTCCTGG AAATACCTAA TTCAGCTGGA CAGAGACGTC GAGTTCTTCG ACGTGTGGGG GAGCTACCTC GTCTTTTCTC TAAACGAGGA GGGGTCGTCG GGTCTCTACA TGATGCACAT GCCCTCGGGC CTAACCCACA AGATAGCTAC GCCGAGGGGC GTCGTGACAT CTCTACAGTA CAGAGAGGGG CTCATCCTTT TCTCCCTCTC CAGCATAAAC AGAGGCCATG AGGTCTACAT CCACCAGGGA GGGGCACTGA GGCAACTCAC GAGATCCCCC AGGTTCGGCC TACAGCTGGA GTCCCTACCC GACCCGGAGT CCGTGTGGTA CGTGAGCCAC GACGGTAGGA AGATCCAGGC TAACATCTAC AAGCCGCCGG GCGCGGCTAG GGGAGTCGTC GTCTACCTCC ACGGAGGCCC CGAGAGCCAA GATAGGCCGG AGCTCAAGCC GCTGGTGCTG GCCTTGCTCA TGTCGGGCTT CGTCGTGGCG GCGCCTAACT ACAGAGGGAG CGCCGGCTTC GGAAAAAGCT TCCTCCGCCT CGACGACTTA GACAAGAGGT GGGACGCGAT AAAGGACGTC GAAGCCTTCG CCAGGTGGCT CACCGCTGAG GGCATCGCTA AGGCGAAGCC CTGCGTAATG GGGGGGTCCT ACGGCGGCTA CCTCACGTTG ATGGCCCTCG CCACAGCCCC CGACCTCTGG GCATGCGGCG TGGAGATAGC CGGCATCTTC AACCTGGTGA CGTTTCTGGA GAGGACCGCG CCTTGGAGGA GGAGGTACAG AGAGGCGGAA TACGGATCTC TCGACAGACA TCGCGATCTC TTGCTCCAGC TGAGCCCAGC GACGTACGTG GACAAAATCA CGGCCCCCCT CCTAGCCGTC CACGGCGCAA ACGACATACG CGTGCCCATC CACGAAGCCG AGCAGCTGGC CAAGAGGCTG GGGGAGCTGG GGAGAGAGGT TAAACTCTTG GTGCTCCCCG ACGAGGGCCA CGTCATTACA AAGGTGGAAA ACCGGGTCAA GGTCTACACG GAGGTTTTGA AGTTCGTAGA ACGGCACCAG GTTTATTAA
|
Protein sequence | MSLLAKRALS VRSAYAPRLG PRREIYYISD ITGVPQLWRF DGHVHDMVLP WEERVSEYRV ADDGTLAFTS DVAGDERWRL YVLEGEEAAP VSAEGVNSLG AWSPDSQKLA FTSTRDDQQN FHLYLYDRAT RSIAKLAEIP GINVVEEWSE VGLFVTHYET NLESAILLYR GGEVRELTKR SPDTMSLSPR YIGGGKLLYL TNEGWEHMGV AQMDLTTGSW KYLIQLDRDV EFFDVWGSYL VFSLNEEGSS GLYMMHMPSG LTHKIATPRG VVTSLQYREG LILFSLSSIN RGHEVYIHQG GALRQLTRSP RFGLQLESLP DPESVWYVSH DGRKIQANIY KPPGAARGVV VYLHGGPESQ DRPELKPLVL ALLMSGFVVA APNYRGSAGF GKSFLRLDDL DKRWDAIKDV EAFARWLTAE GIAKAKPCVM GGSYGGYLTL MALATAPDLW ACGVEIAGIF NLVTFLERTA PWRRRYREAE YGSLDRHRDL LLQLSPATYV DKITAPLLAV HGANDIRVPI HEAEQLAKRL GELGREVKLL VLPDEGHVIT KVENRVKVYT EVLKFVERHQ VY
|
| |