Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0103 |
Symbol | |
ID | 6164805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 88512 |
End bp | 90701 |
Gene Length | 2190 bp |
Protein Length | 729 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641667270 |
Product | DEAD/DEAH box helicase domain-containing protein |
Protein accession | YP_001793507 |
Protein GI | 171184588 |
COG category | [R] General function prediction only |
COG ID | [COG1204] Superfamily II helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.297331 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00614894 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGAAAA TCATGGCGGG GGTTTTCTCA CAGCCGAAGA GGAAAAATAC CCCCGCCCCG GCGCCCCATG TGGAGGTTTC TGAGCTTCCC CTAGATGGGC GTTTGATCTC CGTGCTGAGG GGGAGGGGGG TCCGGGAGCT CTTCCCCCCT CAGGTGGAGG CTGTCAAGGC GGGTCTCTTC GACGGGAGGA GCGTGCTCCT CTGCACCGCC ACGGCCTCGG GCAAATCCCT CCTGGCCGAG GTGGCGTCTG TGAAGGCGGC GTTGGAGGGG GGGATGGCGC TGTATGCCGT CCCGCTGAAG GCGCTTGCCT ACGAGAAGCT GGTGCACTTC TCCCACTACG GCGGTCTGGC TAAGGTGGGC GTCTCCACGG GGGACTTCGA CTCCGAGGAC AGGAGGTTGC ACGAGTACGA CGTGTTGGTT GTGACATACG AGAAGCTGGA CAGCCTCCTG AGGCATAGGC CGGGGTGGCT GAGCTCTGTA AAAACGGTGG TGGTGGACGA GATCCACTAC CTCGGCGACC CGAGGAGGGG GCCGGTGCTT GAGTCTATCA TCGCCAAGCT TAAACACCTG GGCCTCAGGG CGCAGTTCAT CGGCTTAAGC GCCACCGTGG GGAACGCCGC CGAGGTGGCG GAGTGGCTGG GGGCTAGGCT TGTGGCGTCC AGCTGGAGGC CCGTCCCGCT GAGGGAGGGG GTCTACCACG GGGGGAGGAT ATACTACCCA GACGGCACAT ATAGGCGGGT TGAGGCGGCG GGCGACGCCG AGGTTGCCCT GGCGGTGGAC GCCGTGGCGG GCGGGGGACA GGCGCTTGTG TTCGCCAGTA GCAGATCCTC CACGGTGAGG ATCGCCAAGG CGGTGGCTAA GGCCATCGCG GCCCACCCCG CCAGGCTGAT CGACGTGGCC GAGGCGTCGA GGCTGGCCGG GGAGGTCGCC AGGGCCTCCT CCAGCAAGAT CATAGGGAGG GAGCTCGCGG AGCTTGTGGC GCGGGGCGTG GCTTTCCACA ACGCCGGCTT GGAGCTGGAG GTGAGGAGGC TGGTGGAGGA GGGCTTCAGG AGGGGGGTCG TCAAGGTGGT GGTGTCCACG ACGACGCTGG CCGCGGGGGT GAACCTGCCG GCTAGGAGGG TCGTGGTGGC GGACTTCGAG CGTTTTGACC CCGTGGTGGG GAGGGAGGAG ATCCCGGTTC TCGAGTATAA GCAGATGGCC GGCCGCGCCG GCAGGCCGGG CCTCGACGCC GTGGGGGAGG CCGTCTTGGT GGCGAGGAGT AAAAACGAGG TGGGCTACCT CATGGAGAGG TACGTCAGGG GGCAGGTGGA GGAGGTGAGG TCCCACATCT TGGCCGGCCC CAACCTCAGG GCGCACGTCT TGGGGGCGGT GGGGGGCGGC TACGCCCGGT CTATAGACGA GCTGGTGGAC TTCTTCTCCA ACACCCTGGG GTACCACCAG GCGAGGACCG CCGTCAAGTC GGCTCTACTT CGGTCCAGGG TGGGGGCGGC CGTGGAGGAG CTGGAGGAGT GGGGGTTCCT GGAGCGAGAC GGCGAGCGCG TATACGCGAC GGAGCTGGGG AGGCAGGTGG CGAGGCTGTA TCTAGACCCC GAGACGGCCG CCAGATACAT CGAGCTTATA AAGAGCTTGA GGGGGGGCAA CACGTCGGCG TATCTCTACG TCGTGCTCAC GGCGCGCGAC TTCCCCAGGG TGAGGAGGGG GAGGGCGGAT AGGCGCGTGG CCGAGGAGGT CCTCTCCGTG CTCGACGTGG AGGAGGACGA GGAGTTTGAG GAGGTGGTTA AGACCGCCTC GGTGCTGACG GCGTGGATCG AGGAGGAGGA CGAAGACGCC ATATTCAACA GGTTCGAGGT GGCGCCGGGG GATCTGCGGG TGTACATAGA CCTATTCGAG TGGCTTGGGA ACGCGGCGGC TAAGCTAGCC GGTATGCTGG GCTTGGAGGA GCACAGGAGG GGGCTGGAGG TGGTGACGGC GAGGGTGATC TACGGCGTTA GGGAGGAGTT GCTGGAGCTC GTGACTGCGC TGAGGGGCGT GGGGAGAGTT AGAGCTAGGG CTCTCTACAA CTTCGGCTAC AGGACGGTTA AGGACTTGGC CAAGGCCTCC GTCAGGGAGA TAGCCTCTCT GCCCGGCTTC GGCGAGAGGC TGGCCGAGTC CATCATCGAG CAGGCCCGGC AGCTGGCTGG GCTATCGTGA
|
Protein sequence | MSKIMAGVFS QPKRKNTPAP APHVEVSELP LDGRLISVLR GRGVRELFPP QVEAVKAGLF DGRSVLLCTA TASGKSLLAE VASVKAALEG GMALYAVPLK ALAYEKLVHF SHYGGLAKVG VSTGDFDSED RRLHEYDVLV VTYEKLDSLL RHRPGWLSSV KTVVVDEIHY LGDPRRGPVL ESIIAKLKHL GLRAQFIGLS ATVGNAAEVA EWLGARLVAS SWRPVPLREG VYHGGRIYYP DGTYRRVEAA GDAEVALAVD AVAGGGQALV FASSRSSTVR IAKAVAKAIA AHPARLIDVA EASRLAGEVA RASSSKIIGR ELAELVARGV AFHNAGLELE VRRLVEEGFR RGVVKVVVST TTLAAGVNLP ARRVVVADFE RFDPVVGREE IPVLEYKQMA GRAGRPGLDA VGEAVLVARS KNEVGYLMER YVRGQVEEVR SHILAGPNLR AHVLGAVGGG YARSIDELVD FFSNTLGYHQ ARTAVKSALL RSRVGAAVEE LEEWGFLERD GERVYATELG RQVARLYLDP ETAARYIELI KSLRGGNTSA YLYVVLTARD FPRVRRGRAD RRVAEEVLSV LDVEEDEEFE EVVKTASVLT AWIEEEDEDA IFNRFEVAPG DLRVYIDLFE WLGNAAAKLA GMLGLEEHRR GLEVVTARVI YGVREELLEL VTALRGVGRV RARALYNFGY RTVKDLAKAS VREIASLPGF GERLAESIIE QARQLAGLS
|
| |