Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1005 |
Symbol | |
ID | 6165080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 895986 |
End bp | 897881 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641668158 |
Product | metallophosphoesterase |
Protein accession | YP_001794383 |
Protein GI | 171185464 |
COG category | [R] General function prediction only |
COG ID | [COG1409] Predicted phosphohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000460676 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.000515762 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGGCAGC TGTACCTCCT CCTCGCCGCG GCCCTCGCCC TAGCCGCCTC CATCGTCGAC CCCAGGTGGG CCGTCCCCGC CTACGTAACC CCCGGCGGCG CCTTCAACAT CACGCTGGAT GCGCCAACCG CTGTGAAAAG CGTAGCCCTA GCCGCCCCCG GCGTGGAGAA GCCGCTGGAG CTCAGCTTCA ACGCCTCTGG AGCCGTCATC ACGGCTCGCG TCCCGGCGGA CGTCCCCCCC GGCCTCTACG ACCTCGTAAT CAACGGCGGT GAGATATACG AGCCCAAGGC CGTCTGGGTC GGCAACGTGA CGGGGCCCCT TAGGATAATC CAGCTCACCG ACATACACGT GGGAGTTGAG CTAGACATGG CCTCCATATA CCGCCTAACC CACGCCGCTC TCTACGCCAG CTCCAGCCCC TACGACGTCG TGTTCCTCAC CGGCGACCTC GCAGACGTGG GAGGCCAGCC CTGGCAGTAC GCCCTGCTGG TCAGATACAC CTCCACAATT ACAAAGCCCA TCTTCGCCGT GCCGGGGAAC CACGACCACG CCGGCGACGA CCCCCTCAAT AACTACAGGA GGTATGTGGG GCCCCCCTAC TGGTACAGAG TCGTCGGGCC CTACCTAATA ATCGGCCTAG ACAGCGGCCA CGACGGCTAT CTAACAGAGG AGCAGGTCAA GTTCTACGGA GACGTCCTGA GGCGCTACCC AGACAAGGTG AAGATAGTCC TAATACACCA CCCGCCCTTC TACATACGCG ACGCCTACGT CGCCGAGATC TACCGCGGCC CCCAGGACAT AGACAGACTC AGCAGAGACC CCACCGGGAG GAGGAACTAC TACATCGTCT ACACCAGCTA CCTCTACAAC CGCCCCACCT ACGAGAGGTT CCTCAACCTA ACGATTAGGT ACAGAGTAGC CCTGGTGATG GCCGGCCACG TACACCCCGG CAACTCCACC GTGGTCATAA ACGGCACCTA CTTCGTCACC ACCAGGACGT TGGGAGGGTC TGTGGACACC TCCCACGGCT TTAGAACCTA CGTCGTCTAC CCAGACGGCC GCGTCCAGAT AAACCCAGAA ACCCTCACCT ACAAAAACTA CGCCGTCGTG GCCCAAGGCG CCAAAGCCGC CCAGATATAC GCCGACAGCG ACCTCCTCCC CGGCACAATA GCCATAGACC TACCCGGCCA GTACCAAGGC CTCAAAGCCC TAAACGGCAC AGCCCAGCTA GTAGAAGCGA AGAAACACCC ACTAGCCAAA TACACGCGGT ACTACATCTC TACAGCAGGC AAGCCCATCT GGATAGCCAT CGGAGACTAC GCCCCAGCCC CCACCCTATC GGTAGAAAAG ATAATGCCCA GGTCCCCCAC CCCCGGCGAC GTGGTGACAG TCACGATAAA GGCAGAGGAC CCCAACGTCG GCATACCCTT CCTAACGGTA GACGGCAAGA AGATACTCGC CTCCTACCCA GGAGAGCAGC CGGTATACCA ATACAGATTC AGATACGACA AGCCAACCAC ACTACAGATA CAAGCCCCAG GAGGCCAACC CACAACAGTA CAGATAGGCC AAACCACGCA GCCGCCAACA ACTACGCCGA CACCCACCCC GACACCCACA GCCACCCCAA CCCCAAGCCC CACGCCAAGT AAAACACCAA CGCCCACGCC AACTACACCA CCCCCAACCA CCCCGACGGC CACCTCTCCA ACACCTACGC CCACCCCCAC CCGCACGCCA ACCCCCACCG CAGCTCCGGC GCCAGCCCCA TTCCCCACTG AAGCCGCCGT GCTAATCGCA ATAGCCGCGG TCGGCGCCGC GCTACTGGCG GCCCTCGCCA AAACAGGCAA AAAGAAGGCA GAAACCGGCG GAACCAGGGT ATACGGAGAA AGATAA
|
Protein sequence | MRQLYLLLAA ALALAASIVD PRWAVPAYVT PGGAFNITLD APTAVKSVAL AAPGVEKPLE LSFNASGAVI TARVPADVPP GLYDLVINGG EIYEPKAVWV GNVTGPLRII QLTDIHVGVE LDMASIYRLT HAALYASSSP YDVVFLTGDL ADVGGQPWQY ALLVRYTSTI TKPIFAVPGN HDHAGDDPLN NYRRYVGPPY WYRVVGPYLI IGLDSGHDGY LTEEQVKFYG DVLRRYPDKV KIVLIHHPPF YIRDAYVAEI YRGPQDIDRL SRDPTGRRNY YIVYTSYLYN RPTYERFLNL TIRYRVALVM AGHVHPGNST VVINGTYFVT TRTLGGSVDT SHGFRTYVVY PDGRVQINPE TLTYKNYAVV AQGAKAAQIY ADSDLLPGTI AIDLPGQYQG LKALNGTAQL VEAKKHPLAK YTRYYISTAG KPIWIAIGDY APAPTLSVEK IMPRSPTPGD VVTVTIKAED PNVGIPFLTV DGKKILASYP GEQPVYQYRF RYDKPTTLQI QAPGGQPTTV QIGQTTQPPT TTPTPTPTPT ATPTPSPTPS KTPTPTPTTP PPTTPTATSP TPTPTPTRTP TPTAAPAPAP FPTEAAVLIA IAAVGAALLA ALAKTGKKKA ETGGTRVYGE R
|
| |