Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4527 |
Symbol | |
ID | 5736378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5792672 |
End bp | 5793931 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281689 |
Product | arsenical pump membrane protein |
Protein accession | YP_001547286 |
Protein GI | 159901039 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1055] Na+/H+ antiporter NhaD and related arsenite permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.142525 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACTGC TACTTGGTGG AACAATTTTT GGAGCCACGC TCTTAGGCGT GATGGTTCGT CCGCGCAATA TTTCGGAAGC TTGGGTCGCG TTACTTGGCG CAGTTGCCAT GTTATTGGTC GGAATTTTGC CACTGAAGGC AATTTTGCCG ACGCTTGCCC GCGAATGGAA TGTGTATGGT TTTTTTGCCG GTTTGATGCT GATTGCCTTT TTTGCCGAGC AAGCTGGGGT GTTTCAGGCC TTGGCGTTGC AGGCAGCACG TTGGGCCAAT GGCTCGGCGC AGCGGCTCTA TTTGGCGGTC TTTTTGGTGG GCACGCTGAT TACAGCGGTG CTCTCCAACG ATGCGACAGC CTTGATTTTA ACTCCAGTGG TTTGGACGTT GGCCAGCCGT TTGCGCTTGC CAGCCTTGCC ATTTATGTTT GCCTGCACCT TTATTGCCGA TACTGCTTCG GCATTGCTGC CCGTTTCCAA TCCGATTAAT ATTTTGGTGC TAACCCGCTT TAATCGTGAG CTGTTGGAAT ATTGGGCCTA TTTGTTGGTT CCGTCACTGG TGTGTATTGG CTGGAATATT GGGCTATTTG CTTGGCTGTT TCGGCGCGAT TTGCAGGGCA GCTACGATTT GGCACTGCTT GATGATTTAA CTATCGCCAA CCCACGCCTT TATCGGACAA CCCTTGTTGG CTTGGGCAGT ATTGCTGTGG CCTATGTAGC TGGCTCGTTG TGGCACGTGC CATTGGCCTT TGTGGCGTTG GCTGGAGCCG CTCTCTTAGC GGCAATTAGC TGGTGGAATG GCACGTTTAA GCCCAAACAA GCCTTGCACG AATTATCTCC AGCCTTGTTT GGCTTTATTA GCGGCATGTT TTTGGTGGTA CGGGCGATTG AGCAATTAGG CTGGACTGAA CGCTTTGGCG CGAGTTTGTT ACAGGGCAGC GGCGCGAGTT TGGGCAATAT TGCGCGGGTA ATTTTCGGTA GTGCACTTGG CTCGAACATG ATCAACAATG TGCCGATGAC CTTGGTGATG ACCTCGACGC TTGAACATTT GCCTAGCACA CCTGAGCCTG CGTTGATTTA TGCCACCATT TTTGGGGCTG ACCTTGGGCC AAATTTAACA ATTGTTGGCT CGTTGGCTAC AATGCTGTGG TTGGTGATTT TACGGCGCAA AGGTCTGGAA ATTAGTGCCA AACAATACTT TAAATTGGGC TTGCTGTTTG TGCTACCATC CTTATTAATT GGTACATTTT GGATGTGGCT GATGGCATGA
|
Protein sequence | MQLLLGGTIF GATLLGVMVR PRNISEAWVA LLGAVAMLLV GILPLKAILP TLAREWNVYG FFAGLMLIAF FAEQAGVFQA LALQAARWAN GSAQRLYLAV FLVGTLITAV LSNDATALIL TPVVWTLASR LRLPALPFMF ACTFIADTAS ALLPVSNPIN ILVLTRFNRE LLEYWAYLLV PSLVCIGWNI GLFAWLFRRD LQGSYDLALL DDLTIANPRL YRTTLVGLGS IAVAYVAGSL WHVPLAFVAL AGAALLAAIS WWNGTFKPKQ ALHELSPALF GFISGMFLVV RAIEQLGWTE RFGASLLQGS GASLGNIARV IFGSALGSNM INNVPMTLVM TSTLEHLPST PEPALIYATI FGADLGPNLT IVGSLATMLW LVILRRKGLE ISAKQYFKLG LLFVLPSLLI GTFWMWLMA
|
| |