Gene Tneu_0451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0451 
Symbol 
ID6166005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp408131 
End bp409111 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content59% 
IMG OID641667608 
Productarsenite-transporting ATPase 
Protein accessionYP_001793844 
Protein GI171184925 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00229332 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000282817 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGGCAGC TCCTAGACCA GAGGGTTAAG TACATCTTCT TTGGGGGGAA GGGGGGCGTG 
GGCAAGACCG TGGTGGCGGC GGCGACGGCG CTCTACCTGG CAGAGTCTGC GGGGGAGAGG
ACCCTCCTCG CCTCCTTCAA CCCCGTGCAC TCCCTATCCT CGGTCTTCGG CCAGGACCTC
TCGGGGGGCG TAGTTAAGGA GGTTCGGGGG GTGAGGAACC TGTGGGCTGT GGAGGTGCAG
TACGACGACA TCGTGGAGAA GTACAAGGCG AGGATCTCGA ACCTGCTTAG GGAGATGCTT
AAGATGGCGG AGCTTTCTGT GGACATCAAA CCGCTTATAG ACATAGCCAC CACCAACCCG
GCCTTCCACG AGGCGGCCTC CTTCGACAAG ATGATGGACG TGGTTCTCAA GGAGGGGTCG
AAGTTCGACC GGGTTATCTT CGACATGGCG GCTGTGGCCA ACGCGGTGCG GCTGATCGGC
CTCTCCAAGC TCTACGGCGC CTGGCTTCAG AGGACTATAA AGATGAGGAT GGAGACCCTC
TCCCTGAAGG AGCAGCTGTC TTTCCGCAAG GAGAAGGTTA GGGAGGAGAT AGAGAGGGAC
CCCGTGTTGG CCGAGCTGAA GGACTTATAC AGCCGCTATA TGGAGGTAAG GAAGGTGCTG
ACCGACCCGG CCCAGACGCG GTTTGTCTTC GTCACGATAC CCACGGTGCT CTCCATCTCG
GTGGTGCAGA GGTTTATAGA GATGGTGAAG GCGTACGAGA TACCCTTCGG CGGGGTGGTT
GTGAACATGG TGATTCCGGG GGAGGAGGCG GCTAGGGACG CCACGGGCTT CCTCAGGAGC
AAGTACGAGG AGCAGCAGAG GAACCTCGAG GTGATTAGGC AGTCCTTCTC GCCGCATATC
CTCGCCTCGG TTAGGCTTTT CCCCGAGGAC ATAGTAGGCC TTGAAAGGCT GAGGCAGTTC
GTGGCCGAGC TCGTTAGATG A
 
Protein sequence
MRQLLDQRVK YIFFGGKGGV GKTVVAAATA LYLAESAGER TLLASFNPVH SLSSVFGQDL 
SGGVVKEVRG VRNLWAVEVQ YDDIVEKYKA RISNLLREML KMAELSVDIK PLIDIATTNP
AFHEAASFDK MMDVVLKEGS KFDRVIFDMA AVANAVRLIG LSKLYGAWLQ RTIKMRMETL
SLKEQLSFRK EKVREEIERD PVLAELKDLY SRYMEVRKVL TDPAQTRFVF VTIPTVLSIS
VVQRFIEMVK AYEIPFGGVV VNMVIPGEEA ARDATGFLRS KYEEQQRNLE VIRQSFSPHI
LASVRLFPED IVGLERLRQF VAELVR