Gene Tneu_0450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0450 
Symbol 
ID6165961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp407130 
End bp408134 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content58% 
IMG OID641667607 
Productarsenite-transporting ATPase 
Protein accessionYP_001793843 
Protein GI171184924 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0025028 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000316077 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCGGCC TCGGCGGTCT CCTTGAGAGG AACCCCCGCC TCAAGGTCTT TATCTACGCG 
GGGAAGGGGG GGCTTGGGAA GACTACGCTT AGCGCGGCGA CGTCGGTTAA GCTGTCTAGC
CTTGGGAAGA AGACGCTTGT GTTTAGCACG GATCCTCAGG CGTCGCTGAG CGACGTGTTT
GAGCAGAACG TGTTTGGGAG GGGGGAGGTT AAGCTGGCGG AGAACCTCTA CGTGATGGAG
ATCGACGCCG ACAAGAAGAT TAACGAGTAC GTGGCCTCCA TCAAGAAGAA GATCGTCGAT
ATGTACCGCC TTGATAAGCT TCCTCCAGAC ATCGAGGAGT ATATCGACAG CGCGGCGGCT
GAGCCGGCGA TGTACGAAAG CGCTGTTTAC GACGCCATGG TGGACGTGGT GTCTGAGGGG
AGGTACGACT ACTACGTCTT CGATATGCCT CCCTTTGGCC ACGGGATTAG GATGATCGCC
ATCGCTGACG TCATCAGCAA GTGGGTGGAG AAGATCACCG AGCTTAGGAG GCAGGCCTAC
GAGTACGGCC GTGTGGCGGC TTCGCTTAAG AAGCAGAAGT TGACCTACGA GGACGAGATC
TTGAGGGAGC TGGAGTACAT CAGGGGGCGT ATCCTCAAGT TCCGCGACAT AGTTATGAAC
TCCGAGACGA CGGCTTTTAT GACGGTCATG ACGCCGGAGA GGATGACCAT CCTCGACACT
GAGAAGGCGC TGGAGATGTT CGAGTCGCTG GGTCTGAGGC TGACGGGGAT AGTGGTTAAC
CAGGTGTATC CGCCTGAGCT GGCTAAGAAC CCCGACGCCC CGGCCTACAT TAGGCGTAAG
GTGGAGGAGC AGCGGAAGTA CATGGCCGAG ATCGCCGACA AGTTCGGGAA GTACATCATC
GCGGTGGTGC CCATGTTGAA CAGGGAGCCG AAGGGCCTCG ACACGCTTAA GGCCGTGGCG
GAGGAGCTCT GGAGGCCGAG CAGGAGGCTG GAGGAGTACA TATGA
 
Protein sequence
MIGLGGLLER NPRLKVFIYA GKGGLGKTTL SAATSVKLSS LGKKTLVFST DPQASLSDVF 
EQNVFGRGEV KLAENLYVME IDADKKINEY VASIKKKIVD MYRLDKLPPD IEEYIDSAAA
EPAMYESAVY DAMVDVVSEG RYDYYVFDMP PFGHGIRMIA IADVISKWVE KITELRRQAY
EYGRVAASLK KQKLTYEDEI LRELEYIRGR ILKFRDIVMN SETTAFMTVM TPERMTILDT
EKALEMFESL GLRLTGIVVN QVYPPELAKN PDAPAYIRRK VEEQRKYMAE IADKFGKYII
AVVPMLNREP KGLDTLKAVA EELWRPSRRL EEYI