Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0466 |
Symbol | |
ID | 6165691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 421769 |
End bp | 423718 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641667623 |
Product | 5-oxoprolinase (ATP-hydrolyzing) |
Protein accession | YP_001793859 |
Protein GI | 171184940 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.162226 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000000366174 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCGCGGC GGATCGTGGC GGTGGACGTC GGGGGGACCT TCACGGACTT CGTCGCAGTG GACGAGACGG GGGCGGTGAA AACCCTCAAG ATGTTGTCCA CGCCGAGGGA GCCGGAGAGG GCCGTGGCGG AGGGGCTGTC CAGGCTGGAC TTCGGCGAGG TTCTCCACGC CTCCACCATC GGCACAAACG CCCTCCTCGG CCAGATGGGG CTGGAGCTTC CCAAGGTTGC GCTCTTCACC ACGAGGGGCT TCCGCGACGT GATAGAGATA GGGCGGCAGA ACAGGCCTAG GCTGTACGAC CTCTACTTCG CCAAGCCACG CCAGCTTGTG CCAAGGGAGC TCAGGTTCGA GGTAGACGAG AGAACGCTGG CAGACGGCAC AGTGGAGAAG GCCGTCGAGC CCACTGAGGT GGAGGCGCTG GCCGCCCGCG CCCGATCCAT GGGCGCCGTA AGCGCCGCGG TGGCCTTCCT CCACTCCTAC GCAAACCCCA GCAACGAGAA AGCCGCCAAG GAGGTGCTCC AGAGGTGGTT CAGCTACGTC ACCGCGTCGC ACGAGGCGGC GCAGGAGCCT AGGGAGTACG AGCGCTTCTC CACAGCCGTC GTCAACGCGG CTTTGATGCC GCTGGTAGGA CGCTACCTGT CCCGCCTGAG GGAGTTCGTA GAGGGCAGAG GCGGGTCGCT CTACGTCATG TCCAGCTCCG GCGGACTTGT GACAGTTGAG GAGGCCGCCA GGAGGCCTGT GCAACTGGTA GAGTCGGGCC CCGCGGCCGG CGTCGTCGCC GCGGCTGAGC TGGCCAAGCT GGCGGGTGAG GCCAGAGTAA TCTCCTTCGA CATGGGAGGA ACGACGGCGA AGGCCGGCAC GGTGGTGGAC TTCCAGCCGT CCATCACCAC AGAGTACGAG GTAGGCGGGG AGAGCCACAG AGGGAGGCTT GTCAAAGGCT CGGGCTACCC TGTGAGGTTC CCCTTCGTAG ATCTGGCCGA GGTGTCGGCG GGCGGGGGGA CGGTCATATG GAGAGACGCG GGAGGCGCCC TCAGGGTGGG GCCCATAAGC GCGGGGGCGG ACCCAGGGCC CGTCTCCTAC GGCAGGGGAG GCACCCAGCC AACGGTCACA GACGCAAACC TAGCCCTGGG CAGAATCCCG GAGGCCTTGG CCGGCGGCGC GCTGAAGCTC AACGCCAAAG CCGCATTAGA GGCGTTGGGC AAGCTGGGCG ACCCAGTCGA CGTAGCTGCA GACGCGGTAA AGCTTGTGGA TGTGGAGATG GCGAGAGCCA TAAGGCTGGT CACGGTGGAG CGGGGTCTAA ACCCGGGGGA CTTCGCGCTT ATGGCCTTCG GCGGGGCGGG GCCGCAACAC GCCGCGGAGC TCGCCGAGGA GGTCGGCATA AGCCGCGTGT TGGTGCCCCC CCTCCCGGGG GTATTCACCT CGCTGGGCAT GCTCATGGCC GACTTCCGGT TCGAGGCCAG GATGGCCTAC CCCAGAGACC TGGAGGGCGG GTTTAGACGG CTGGAGGAGG AGCTGGCCCG GTACCGCCCC GACTACTACG TGAGATATGC CGACGTTAGA TATGTGGGGC AGGGCTGGGA GCTCACGATC CCACTAGGCC AAGATCTCAC GCCACAGGCC GTCAGGCGGG CCTTCGAGGA GAAACACAAG GCGACCTACG GATTTGTCCT AGACAGGCCG GTGGAGGTTG TGACGATACG CGTGTTCGCC GTCGTGAAGA GGCAGAAGCC GAAGCTCCCA CAGCCGCCGG AGGGGGGAGA CCCCGCGCCG CAGGAGAAGG AGGTCTACTT CGACGGCTGG ACTAAAGCCG CAGTTTACAA CAGGTCGGAG CTACCGCTGG GCTACCGGGT GAGGGGCCCA GCGCTGATCG TCGAAGACTA CTCCACCACG GTGGTGCCGC CGAGGTGGGA GGCGGAGGTG AAGAGGTACG GACTTGAGCT GAGGCTATGA
|
Protein sequence | MPRRIVAVDV GGTFTDFVAV DETGAVKTLK MLSTPREPER AVAEGLSRLD FGEVLHASTI GTNALLGQMG LELPKVALFT TRGFRDVIEI GRQNRPRLYD LYFAKPRQLV PRELRFEVDE RTLADGTVEK AVEPTEVEAL AARARSMGAV SAAVAFLHSY ANPSNEKAAK EVLQRWFSYV TASHEAAQEP REYERFSTAV VNAALMPLVG RYLSRLREFV EGRGGSLYVM SSSGGLVTVE EAARRPVQLV ESGPAAGVVA AAELAKLAGE ARVISFDMGG TTAKAGTVVD FQPSITTEYE VGGESHRGRL VKGSGYPVRF PFVDLAEVSA GGGTVIWRDA GGALRVGPIS AGADPGPVSY GRGGTQPTVT DANLALGRIP EALAGGALKL NAKAALEALG KLGDPVDVAA DAVKLVDVEM ARAIRLVTVE RGLNPGDFAL MAFGGAGPQH AAELAEEVGI SRVLVPPLPG VFTSLGMLMA DFRFEARMAY PRDLEGGFRR LEEELARYRP DYYVRYADVR YVGQGWELTI PLGQDLTPQA VRRAFEEKHK ATYGFVLDRP VEVVTIRVFA VVKRQKPKLP QPPEGGDPAP QEKEVYFDGW TKAAVYNRSE LPLGYRVRGP ALIVEDYSTT VVPPRWEAEV KRYGLELRL
|
| |