Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0467 |
Symbol | |
ID | 6166096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 423715 |
End bp | 425259 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641667624 |
Product | 5-oxoprolinase (ATP-hydrolyzing) |
Protein accession | YP_001793860 |
Protein GI | 171184941 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.011841 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000000457573 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGGTGGG AGATCATCTA CAAAGCCACT GAGTACATCG CCGAGGAGGC CGGGATCGCG CTGAGGAACT CCGCATTCTC GCCCAACATC AGAGAGCGCA TGGACCACAG CGTCGCTGTG GTAGACGCAG AAGGCCGCGT CGTGGCGCAG GCCGAGCACA TACCGGTCCA CCTGGGGTCC CTCCACGTCG GAGTTAGAAA CCTGCTGGGG GCCGTCGAGG AGCTGGGGGA GGGCGACGCC GTTTTAACCA ACGACCCGTA CATCGCCGGG ACACACCTCA ACGATGTCCT CCTGTTGTAC CCAGTCCACT GGCGGGGGAG GCTCGTCGCC TACATAGCCT CGAAGGCGCA CTACGTAGAT GTGGGAGGGC CCCTCCCCGC GTCGCTTAAC CCACGGGCGG AGACCATATA CGAGGAGGGG GTCGTCATAC CGCCCGTGAA AATACTGAGG CGGGGGGAGC TAAATAGGGA GGCCCTGTCT TTCATCCTGG AGAACTTCAA AACGCCGGAG GCCGCGCGGG GCGACCTAGA GGCTCAGCTC GCAGCCTCGC GCGTCGGCGC GGCTAAGGTG AGAGAGCTCT TCGAGCGCTA CGGCGAGAGG GTGCTAGACG CCTGGAGAGA GGCCGTGGAA TACGGCAGGA GGCTGGCCCT AGGCGAGATT TCCAGGTGGC CCCGGGGGAG GTACGAGGCC GTCGACTATC TCGACTGGAG GGGGGAGCTA CTGCCCATTA GAGTTGCGCT CGAGGTGGGA GAGGGCGGGG TGAGGGCGGA TTTCACGGGG ACGGCGAGGC AGGCCGAGGC CCCCATAAAC GCCGTCTTCG GGGTGACCTA CTCCGCGGCG GCTTTCGCGG TTAGGGCGGC GCTGGCCGCC GACGTGCCGA CAAACCACGG CTTCTACAGC GTTGTGGAGG TGGAGGCCCC CCTGGGCACC ATAGTCAACC CCGTGAAACC CGCGGCGGTG GGCGCCGGCA ACCTGGAGAC AAGCCAGAGG GTGGCGGACG CCGTCTTCAT GGCCCTCTCC AGGGCACTGC CGGGGAGGAT CCCCGCGGCG GGCTCCGGCA CAATGATGAA CGTGATGATC GGAGGCGTCT GGAGAGGCAG ATACTGGTCC TACTACGAGA CAATAGGCGG CGGGACAGGC GGGCGGCCCA ACAGCCCAGG CGTCTCCGGC GTCCACGTCA ACATGACAAA CACCCTAAAC ACCCCCATAG AAATAGCGGA GAGGACATAC CCCATCAAGT TCACCGCCTA CAAGATACGC GAGGGGAGCG GAGGCCCCGG CAGATACAGA GGCGGAGACG GAATAGTCCG GGCCTTCAAG ACGCTGGCCC CCGCCACCCT CTCGATAATA GCCAGCCGCT TCGACACCCG GCCGTGGGGG CTAGAGGGCG GCTGCCCCGG AAAGCCGGCA AAAGCCCTAG TCAAGAGGCG GGGCGGAGAA TTCGCCATAA GGTCAGACAC GGTGTACCTC GACGAAGGCG ACGAAGTCGT GATAGAGACG CCAGGCGGAG GAGGCTACGG CCCCCCAGAC CAGCCCTGCG ATTAA
|
Protein sequence | MRWEIIYKAT EYIAEEAGIA LRNSAFSPNI RERMDHSVAV VDAEGRVVAQ AEHIPVHLGS LHVGVRNLLG AVEELGEGDA VLTNDPYIAG THLNDVLLLY PVHWRGRLVA YIASKAHYVD VGGPLPASLN PRAETIYEEG VVIPPVKILR RGELNREALS FILENFKTPE AARGDLEAQL AASRVGAAKV RELFERYGER VLDAWREAVE YGRRLALGEI SRWPRGRYEA VDYLDWRGEL LPIRVALEVG EGGVRADFTG TARQAEAPIN AVFGVTYSAA AFAVRAALAA DVPTNHGFYS VVEVEAPLGT IVNPVKPAAV GAGNLETSQR VADAVFMALS RALPGRIPAA GSGTMMNVMI GGVWRGRYWS YYETIGGGTG GRPNSPGVSG VHVNMTNTLN TPIEIAERTY PIKFTAYKIR EGSGGPGRYR GGDGIVRAFK TLAPATLSII ASRFDTRPWG LEGGCPGKPA KALVKRRGGE FAIRSDTVYL DEGDEVVIET PGGGGYGPPD QPCD
|
| |