Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0096 |
Symbol | |
ID | 6164580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 82312 |
End bp | 83787 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641667263 |
Product | carboxypeptidase Taq |
Protein accession | YP_001793500 |
Protein GI | 171184581 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2317] Zn-dependent carboxypeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.116202 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00735546 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTAGGT CTCAAACCGT CAAGGAGATC CTGGACCACT ACAGGGTTAT ATGGGCGTTG GGGCACGCCC AAGCGCTGAT GGGCTGGGAC TCGGAGACGT ACATGCCGGA GGAGGGGGTC AAGGGGCGGG CCGCCGCCAG GGCCGAGATC GCGCAGCTTA TTCAGAGGTT TATGCTTGAT GAGAAGTTCG TCAAGCTCTT GGATAAGGCC GAGGAGGAGA GGGATCTCAC AGACGTGGAG AGGGGCATCG TCCGGGTGCT CAAGAGAGAT CTGAGGTTCT ACACCAGGGT GCCCCCCGAG GTGGTGAAGG AGCTCGCTAA GGTCACCTCC GAGGCTTTTG TGGCGTGGAG GGGGGCCAAG GAGAAGGCCA GGTTCGACCT CTTCGCGCCT TATCTGGAGA AGATAGTTGA GCTCTCCAGG GTGGTCGCCG AGAAGCTCGG CTACGAGGAG CATCCCTACG ACGCGCTCCT CGACCTATAC GAGGAGGGGC TTACGTCGAG AGACGTGGAG TCGATATTCT CCACCCTGGA GCCCGGCATA AGGTCGCTCC TCAACAAGCT GGAGGCGAGG GGGTGGCCCA GAAGCCACCC GCTTGAGGAG GCTCCCTACG ACAGGCCTGC CCTCGAGGCC GCCATTGCGG AGGTGCTTGA CCTCCTCGGC TACCCCAGGG GGAGGTTCAG GGTGGACGTG TCGCCGCACC CCTTCACCAT CGGCATCGCG ACGCCGTACG ACGTGAGGAT CACGGTTAGA TACAGGGGGG TCGACTTCAG AGAGCCGCTC TTCTCGGCGC TCCACGAATA CGGCCATGCT CTGTATGAGC TAAACGTGGC GGAGGAGCTG GCCATGACGC CCGTCGGAAC CGGCGTATCC CTCGGAGTGC ACGAGAGCCA GTCGAGGTTC GTGGAGAACG TGGTGGGGAG GAGCCGGGAG TTTATACAGA GGATCTCCCC GATATTGAGG AGGCGTCTCC CCCTCCTCTC GAAGTACGGC GACGAGGATC TGTTCTACTA CTTCAACCTG GTTAGGCCGA GCCTGATACG TACAGAGGCC GACGAGGTGA CCTACAACCT ACACATACTC CTGCGGTATA GGCTGGAGCG CCTCATGATA ACGGGCGAGG TGAAGGTGAG CCAGCTCCCC GAGCTTTGGA ACAGCGAGAT GGAGCGGTTG CTTGGGGTAA AGCCGCGTAA CGACGCGGAG GGGGTTCTGC AGGACGTCCA CTGGTCCCAC GGCTCGATAG GGTACTTCCC CACCTACACC CTGGGGAACG TCGTGGCGGC TATGATCTAC TACAAACACG GCAACGTACG CGGCCTCGTC TCAGAGGGCA ACTTCGCCGC GGTTAAGGAG TACCTCCGGG AGAAGATACA CAGGTGGGGT AGCGTCTACC CGCCGAAGGA GCTGCTCAGG AGGAGCTTCG GCGAGGCGTA CAACGCCGGC TACCTCGTGA AATACCTAGA GGAGAAGTAC CGCTAG
|
Protein sequence | MIRSQTVKEI LDHYRVIWAL GHAQALMGWD SETYMPEEGV KGRAAARAEI AQLIQRFMLD EKFVKLLDKA EEERDLTDVE RGIVRVLKRD LRFYTRVPPE VVKELAKVTS EAFVAWRGAK EKARFDLFAP YLEKIVELSR VVAEKLGYEE HPYDALLDLY EEGLTSRDVE SIFSTLEPGI RSLLNKLEAR GWPRSHPLEE APYDRPALEA AIAEVLDLLG YPRGRFRVDV SPHPFTIGIA TPYDVRITVR YRGVDFREPL FSALHEYGHA LYELNVAEEL AMTPVGTGVS LGVHESQSRF VENVVGRSRE FIQRISPILR RRLPLLSKYG DEDLFYYFNL VRPSLIRTEA DEVTYNLHIL LRYRLERLMI TGEVKVSQLP ELWNSEMERL LGVKPRNDAE GVLQDVHWSH GSIGYFPTYT LGNVVAAMIY YKHGNVRGLV SEGNFAAVKE YLREKIHRWG SVYPPKELLR RSFGEAYNAG YLVKYLEEKY R
|
| |