Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1820 |
Symbol | |
ID | 6164604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 1599294 |
End bp | 1600454 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641668983 |
Product | metallophosphoesterase |
Protein accession | YP_001795183 |
Protein GI | 171186264 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0420] DNA repair exonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00514924 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTCC TACACATCTC GGATGCGCAC CTCGGCAGAG CCCAGTACCA CCTCCCGGAG AGGGAGGAGG ACTACTTCAG AGCGTTTGAG GAGGCGCTGA GGAGGGGGAG GGGGGCAGAC GCCGTCTTGA TAACGGGCGA CCTCTTCGAC CTCAAGAGGC CCTCCACGAG GGCGCTCGTG AAGTTTGTGG AGGCCGTGGA GGCGGCGGGC GCGCCTGTCT ACCTAATCGG GGGAAACCAC GACTTCAGCT ACGTCCGGTA TAGGGCTGAG GCGGAGAGGT GTCCGCGGCC GGCGGAGTGC CTCTACGACA CGGCGCTGAG GCTTCTAGAT AGGCTGAGGC TGGCGAAGCT CCTCTGTTGG GAGTCCGTAG ACGCCGGGGG CGTTCACATA TTTGGGGCAT GCGCAACGCC TAGGGACTAC GCGGCGGAGT ACCGCCGGGC TCTCCAGAGG ATGCCGCCGG GGGCTGTCCT CGCCATACAT CAGGCGGTGG AGGGGGTCAA GGCCAGGTAC CCCGCCGAAG ACGACGAATA CACCATGCCC CAGGAGGTTT TCCAAGGCCT GCCGTATGTA CACATCGCCG CTGGCCACAT ACACGACCAT CTGGCGAGGC ATCCGGTAGG CGCCGTGTGG GCGGGGTCGC TTGAGGTCTG GGACGTCGGG GAGTTCGAGA CCTGGGACTA CAGAGGGGGC TTTGAGAAGG CGCAGGACAG AGCTGAGAAA GGCGCTGTCT TAATAGACGT AGCCGGTAGG GCGGTCTCCC TCAGAGCTAT CCCCATCCCC CCTGGGAGGC CTCTGTACAG GGTCAGGCTC TATGTCAGGG AGAGGAGGGA GGCCTACGGG GCTGCGGAGG AGGCGGCGAA GCTTTTTGAC AAACCGGGGG CGGTGGTTAG AGTCGAGGTG TGGGGTACGT TGGAGGAGGC TCTAAGGCCT AGGCAGATGG CTACCTTGTT TACAAAAGCC CTCTACGTCG ACGTTGTTGA CAGAACCGCC GCCCCGCAGA GGGCCGTGTC TCTAAGGGGG TCCGCCATGG AGGAGCTGTG GCGGCTGATG AGGGAGAAGC TTGGGCAACA CGCCGAGGTG GTGCTCAGGG CTATGGAGCT CCTTAGAGAG GGGGAGAAGG AGGCGGCGTA CAAGCTCATC CTCAAGGCGC TTTATGATTA G
|
Protein sequence | MKLLHISDAH LGRAQYHLPE REEDYFRAFE EALRRGRGAD AVLITGDLFD LKRPSTRALV KFVEAVEAAG APVYLIGGNH DFSYVRYRAE AERCPRPAEC LYDTALRLLD RLRLAKLLCW ESVDAGGVHI FGACATPRDY AAEYRRALQR MPPGAVLAIH QAVEGVKARY PAEDDEYTMP QEVFQGLPYV HIAAGHIHDH LARHPVGAVW AGSLEVWDVG EFETWDYRGG FEKAQDRAEK GAVLIDVAGR AVSLRAIPIP PGRPLYRVRL YVRERREAYG AAEEAAKLFD KPGAVVRVEV WGTLEEALRP RQMATLFTKA LYVDVVDRTA APQRAVSLRG SAMEELWRLM REKLGQHAEV VLRAMELLRE GEKEAAYKLI LKALYD
|
| |