Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0309 |
Symbol | |
ID | 4711241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 347588 |
End bp | 348880 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639854769 |
Product | arsenical pump membrane protein |
Protein accession | YP_001001905 |
Protein GI | 121997118 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1055] Na+/H+ antiporter NhaD and related arsenite permeases |
TIGRFAM ID | [TIGR00935] arsenical pump membrane protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGTTAG CCGCGATTGC GATCTTCGTC GTCACCATCG CCCTGGTCAT CTGGCAGCCC AAGGGCCTGG GGATCGGCTG GAGCGCCACC GCCGGCGCCC TGGTCGCGCT GGCCACGGGC GTCGTCGAGC TCAGCGATGT GCCCATCGTG GTCGATATCG TCTGGAACGC GACCCTGACC TTCGTGTTCA TCATCATCAT CTCGCTGCTG CTCGACGAGG CCGGCTTCTT CGAGTGGGCC GCACTCCACG TAGCGCGCTG GGGCAGGGGC AGCGGTCTGC GCCTCTACGC CTTCATCATC CTGCTCGGTG CCGCGGTGGC GGCGATCTTC GCCAACGACG GCGCGGCGCT GATGCTCACC CCCATCGTGC TGGAGATGCT GCTGGCTCTG GGCTTCACCG CAGGGGCGGC CTTCGCCTTC GTGATCGCCG CCGGGTTCAT CGCCGATACC GCCTCGCTGC CGCTGGTCAT CTCCAACCTG GTGAACATCG TCTCGGCGGA TTTCTTCGGC ATCGGCTTCA AAGAGTATGC GACGGTGATG GTGCCGGTGA CCGTGGTCTC GGTGGTGGCC TCGCTGGCGG TGCTGACCCT GTTCTTCCGC AAGCAGATCC CGCGGGTTTA CGAGACCCAC GGGCTGCGCG ATCCGCAGGA GGCGATCAAG GATCCGGCGG TCTTCAAGGC CGGTTGGGGG GTGCTCGGCC TGCTGCTCTT CGCCTTCCTG GTGATCGAGC CGATGGGGGT GCCCATCTCG GCGATCGTCG GCGTCGGCGC CGCGATCCTG CTCGCCGTGG CGGCGCGGGG CCACGCCATC GCCACGCGGA CGGTCCTCAA GAACGCCCCC TGGCAGATCG TCATCTTCTC GGTGGGCATG TACCTGGTCG TCTACGGCCT GCGCAACGAG GGGCTGACCG GCGAGGTCGC CGGCCTGCTG GATGTCATCG CCGAGCAGGG TGTGTGGGTT GCGACGGTGG GGACCGGCTT TATCGCCGCC TTCCTCGCCT CGGTGATGAA CAACATGCCC GGTGTGATGG TGGTGGCGCT CTCCATCGAT GAATCAGCGG CTACCGGGCT GGTTAAGGAG GCGATGATCT ACGCCAACGT GGTCGGCTCC GACCTGGGGC CGAAGATCAC ACCCATCGGC AGTCTCGCCA CGCTGCTTTG GCTGCACGTG CTGGCGCGCA AGGGGGTGCA CATCACCTGG GCCAAGTACT TCGCCATCGG CATCGTGATC ACGCCGCCGG TGCTGCTGGT CACGCTCCTG GCGCTGGCGG GGTGGCTGAC GGTGTTGCAC TGA
|
Protein sequence | MLLAAIAIFV VTIALVIWQP KGLGIGWSAT AGALVALATG VVELSDVPIV VDIVWNATLT FVFIIIISLL LDEAGFFEWA ALHVARWGRG SGLRLYAFII LLGAAVAAIF ANDGAALMLT PIVLEMLLAL GFTAGAAFAF VIAAGFIADT ASLPLVISNL VNIVSADFFG IGFKEYATVM VPVTVVSVVA SLAVLTLFFR KQIPRVYETH GLRDPQEAIK DPAVFKAGWG VLGLLLFAFL VIEPMGVPIS AIVGVGAAIL LAVAARGHAI ATRTVLKNAP WQIVIFSVGM YLVVYGLRNE GLTGEVAGLL DVIAEQGVWV ATVGTGFIAA FLASVMNNMP GVMVVALSID ESAATGLVKE AMIYANVVGS DLGPKITPIG SLATLLWLHV LARKGVHITW AKYFAIGIVI TPPVLLVTLL ALAGWLTVLH
|
| |