Gene Tneu_0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0103 
Symbol 
ID6164805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp88512 
End bp90701 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content66% 
IMG OID641667270 
ProductDEAD/DEAH box helicase domain-containing protein 
Protein accessionYP_001793507 
Protein GI171184588 
COG category[R] General function prediction only 
COG ID[COG1204] Superfamily II helicase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.297331 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00614894 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGAAAA TCATGGCGGG GGTTTTCTCA CAGCCGAAGA GGAAAAATAC CCCCGCCCCG 
GCGCCCCATG TGGAGGTTTC TGAGCTTCCC CTAGATGGGC GTTTGATCTC CGTGCTGAGG
GGGAGGGGGG TCCGGGAGCT CTTCCCCCCT CAGGTGGAGG CTGTCAAGGC GGGTCTCTTC
GACGGGAGGA GCGTGCTCCT CTGCACCGCC ACGGCCTCGG GCAAATCCCT CCTGGCCGAG
GTGGCGTCTG TGAAGGCGGC GTTGGAGGGG GGGATGGCGC TGTATGCCGT CCCGCTGAAG
GCGCTTGCCT ACGAGAAGCT GGTGCACTTC TCCCACTACG GCGGTCTGGC TAAGGTGGGC
GTCTCCACGG GGGACTTCGA CTCCGAGGAC AGGAGGTTGC ACGAGTACGA CGTGTTGGTT
GTGACATACG AGAAGCTGGA CAGCCTCCTG AGGCATAGGC CGGGGTGGCT GAGCTCTGTA
AAAACGGTGG TGGTGGACGA GATCCACTAC CTCGGCGACC CGAGGAGGGG GCCGGTGCTT
GAGTCTATCA TCGCCAAGCT TAAACACCTG GGCCTCAGGG CGCAGTTCAT CGGCTTAAGC
GCCACCGTGG GGAACGCCGC CGAGGTGGCG GAGTGGCTGG GGGCTAGGCT TGTGGCGTCC
AGCTGGAGGC CCGTCCCGCT GAGGGAGGGG GTCTACCACG GGGGGAGGAT ATACTACCCA
GACGGCACAT ATAGGCGGGT TGAGGCGGCG GGCGACGCCG AGGTTGCCCT GGCGGTGGAC
GCCGTGGCGG GCGGGGGACA GGCGCTTGTG TTCGCCAGTA GCAGATCCTC CACGGTGAGG
ATCGCCAAGG CGGTGGCTAA GGCCATCGCG GCCCACCCCG CCAGGCTGAT CGACGTGGCC
GAGGCGTCGA GGCTGGCCGG GGAGGTCGCC AGGGCCTCCT CCAGCAAGAT CATAGGGAGG
GAGCTCGCGG AGCTTGTGGC GCGGGGCGTG GCTTTCCACA ACGCCGGCTT GGAGCTGGAG
GTGAGGAGGC TGGTGGAGGA GGGCTTCAGG AGGGGGGTCG TCAAGGTGGT GGTGTCCACG
ACGACGCTGG CCGCGGGGGT GAACCTGCCG GCTAGGAGGG TCGTGGTGGC GGACTTCGAG
CGTTTTGACC CCGTGGTGGG GAGGGAGGAG ATCCCGGTTC TCGAGTATAA GCAGATGGCC
GGCCGCGCCG GCAGGCCGGG CCTCGACGCC GTGGGGGAGG CCGTCTTGGT GGCGAGGAGT
AAAAACGAGG TGGGCTACCT CATGGAGAGG TACGTCAGGG GGCAGGTGGA GGAGGTGAGG
TCCCACATCT TGGCCGGCCC CAACCTCAGG GCGCACGTCT TGGGGGCGGT GGGGGGCGGC
TACGCCCGGT CTATAGACGA GCTGGTGGAC TTCTTCTCCA ACACCCTGGG GTACCACCAG
GCGAGGACCG CCGTCAAGTC GGCTCTACTT CGGTCCAGGG TGGGGGCGGC CGTGGAGGAG
CTGGAGGAGT GGGGGTTCCT GGAGCGAGAC GGCGAGCGCG TATACGCGAC GGAGCTGGGG
AGGCAGGTGG CGAGGCTGTA TCTAGACCCC GAGACGGCCG CCAGATACAT CGAGCTTATA
AAGAGCTTGA GGGGGGGCAA CACGTCGGCG TATCTCTACG TCGTGCTCAC GGCGCGCGAC
TTCCCCAGGG TGAGGAGGGG GAGGGCGGAT AGGCGCGTGG CCGAGGAGGT CCTCTCCGTG
CTCGACGTGG AGGAGGACGA GGAGTTTGAG GAGGTGGTTA AGACCGCCTC GGTGCTGACG
GCGTGGATCG AGGAGGAGGA CGAAGACGCC ATATTCAACA GGTTCGAGGT GGCGCCGGGG
GATCTGCGGG TGTACATAGA CCTATTCGAG TGGCTTGGGA ACGCGGCGGC TAAGCTAGCC
GGTATGCTGG GCTTGGAGGA GCACAGGAGG GGGCTGGAGG TGGTGACGGC GAGGGTGATC
TACGGCGTTA GGGAGGAGTT GCTGGAGCTC GTGACTGCGC TGAGGGGCGT GGGGAGAGTT
AGAGCTAGGG CTCTCTACAA CTTCGGCTAC AGGACGGTTA AGGACTTGGC CAAGGCCTCC
GTCAGGGAGA TAGCCTCTCT GCCCGGCTTC GGCGAGAGGC TGGCCGAGTC CATCATCGAG
CAGGCCCGGC AGCTGGCTGG GCTATCGTGA
 
Protein sequence
MSKIMAGVFS QPKRKNTPAP APHVEVSELP LDGRLISVLR GRGVRELFPP QVEAVKAGLF 
DGRSVLLCTA TASGKSLLAE VASVKAALEG GMALYAVPLK ALAYEKLVHF SHYGGLAKVG
VSTGDFDSED RRLHEYDVLV VTYEKLDSLL RHRPGWLSSV KTVVVDEIHY LGDPRRGPVL
ESIIAKLKHL GLRAQFIGLS ATVGNAAEVA EWLGARLVAS SWRPVPLREG VYHGGRIYYP
DGTYRRVEAA GDAEVALAVD AVAGGGQALV FASSRSSTVR IAKAVAKAIA AHPARLIDVA
EASRLAGEVA RASSSKIIGR ELAELVARGV AFHNAGLELE VRRLVEEGFR RGVVKVVVST
TTLAAGVNLP ARRVVVADFE RFDPVVGREE IPVLEYKQMA GRAGRPGLDA VGEAVLVARS
KNEVGYLMER YVRGQVEEVR SHILAGPNLR AHVLGAVGGG YARSIDELVD FFSNTLGYHQ
ARTAVKSALL RSRVGAAVEE LEEWGFLERD GERVYATELG RQVARLYLDP ETAARYIELI
KSLRGGNTSA YLYVVLTARD FPRVRRGRAD RRVAEEVLSV LDVEEDEEFE EVVKTASVLT
AWIEEEDEDA IFNRFEVAPG DLRVYIDLFE WLGNAAAKLA GMLGLEEHRR GLEVVTARVI
YGVREELLEL VTALRGVGRV RARALYNFGY RTVKDLAKAS VREIASLPGF GERLAESIIE
QARQLAGLS