Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0450 |
Symbol | |
ID | 6165961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 407130 |
End bp | 408134 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641667607 |
Product | arsenite-transporting ATPase |
Protein accession | YP_001793843 |
Protein GI | 171184924 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0003] Oxyanion-translocating ATPase |
TIGRFAM ID | [TIGR00345] arsenite-activated ATPase (arsA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0025028 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.000316077 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCGGCC TCGGCGGTCT CCTTGAGAGG AACCCCCGCC TCAAGGTCTT TATCTACGCG GGGAAGGGGG GGCTTGGGAA GACTACGCTT AGCGCGGCGA CGTCGGTTAA GCTGTCTAGC CTTGGGAAGA AGACGCTTGT GTTTAGCACG GATCCTCAGG CGTCGCTGAG CGACGTGTTT GAGCAGAACG TGTTTGGGAG GGGGGAGGTT AAGCTGGCGG AGAACCTCTA CGTGATGGAG ATCGACGCCG ACAAGAAGAT TAACGAGTAC GTGGCCTCCA TCAAGAAGAA GATCGTCGAT ATGTACCGCC TTGATAAGCT TCCTCCAGAC ATCGAGGAGT ATATCGACAG CGCGGCGGCT GAGCCGGCGA TGTACGAAAG CGCTGTTTAC GACGCCATGG TGGACGTGGT GTCTGAGGGG AGGTACGACT ACTACGTCTT CGATATGCCT CCCTTTGGCC ACGGGATTAG GATGATCGCC ATCGCTGACG TCATCAGCAA GTGGGTGGAG AAGATCACCG AGCTTAGGAG GCAGGCCTAC GAGTACGGCC GTGTGGCGGC TTCGCTTAAG AAGCAGAAGT TGACCTACGA GGACGAGATC TTGAGGGAGC TGGAGTACAT CAGGGGGCGT ATCCTCAAGT TCCGCGACAT AGTTATGAAC TCCGAGACGA CGGCTTTTAT GACGGTCATG ACGCCGGAGA GGATGACCAT CCTCGACACT GAGAAGGCGC TGGAGATGTT CGAGTCGCTG GGTCTGAGGC TGACGGGGAT AGTGGTTAAC CAGGTGTATC CGCCTGAGCT GGCTAAGAAC CCCGACGCCC CGGCCTACAT TAGGCGTAAG GTGGAGGAGC AGCGGAAGTA CATGGCCGAG ATCGCCGACA AGTTCGGGAA GTACATCATC GCGGTGGTGC CCATGTTGAA CAGGGAGCCG AAGGGCCTCG ACACGCTTAA GGCCGTGGCG GAGGAGCTCT GGAGGCCGAG CAGGAGGCTG GAGGAGTACA TATGA
|
Protein sequence | MIGLGGLLER NPRLKVFIYA GKGGLGKTTL SAATSVKLSS LGKKTLVFST DPQASLSDVF EQNVFGRGEV KLAENLYVME IDADKKINEY VASIKKKIVD MYRLDKLPPD IEEYIDSAAA EPAMYESAVY DAMVDVVSEG RYDYYVFDMP PFGHGIRMIA IADVISKWVE KITELRRQAY EYGRVAASLK KQKLTYEDEI LRELEYIRGR ILKFRDIVMN SETTAFMTVM TPERMTILDT EKALEMFESL GLRLTGIVVN QVYPPELAKN PDAPAYIRRK VEEQRKYMAE IADKFGKYII AVVPMLNREP KGLDTLKAVA EELWRPSRRL EEYI
|
| |