Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3335 |
Symbol | |
ID | 5735205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4203546 |
End bp | 4205408 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280482 |
Product | sodium/hydrogen exchanger |
Protein accession | YP_001546099 |
Protein GI | 159899852 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0025] NhaP-type Na+/H+ and K+/H+ antiporters [COG0569] K+ transport systems, NAD-binding component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.263671 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGCAC ATAGTTTGTT GTTGATGGCG GGGATGGTGA TTTGTTTTGG GATTGCCACC TTCGTGCTGG CCGAACGAAT TGGCATCCCG TCGATTGTAT TGTTGCTGTT GGTGGGAGTG CTGGTTGGCC CCGAAGTTTT CGGCTTGGTT GATCCATCGG CCTTAGGGGT TGGGCTGCGG GTGTTGATTC CGCTGTTTGT GGCGATTATT GTGTTTGAAG GTGGTTTGGT GCTCGATATC AACGATTTGC GCCGTTTATC CTACCCATTG CAAATGTTGA TGAGTGTTGG AGCCTTGGTG ACATGGGGCA GTGCAACCCT GGTAGCACAT TTTGTGGCGG GATTATCGTG GCCACTGGCA ACCCTGTTTG GAGCATTAAT GAGCGTGACT GGGCCAACCG TGATCACGCC TTTGATGCGG TGTAGCAATG CCAAAGAACG CTTAAAAGTG TTGTTACAAG CGGAAGGCGT ATTGGTTGAT GCGGTTGGGG CAATTTTGGC GGTGGTTGTG CTCGATGTTT TGCTTACCAG CGGCACAACG ATTAGTGGAG CCTTCGATTG GGCCGAACGC TTATTGATTG GCTCGTTAAT TGGCTTGATT GGCGGCATTG GCTTAGCCTA TGCGTTGCGC TTTATTGGCT CGAAACTTAA TGCTGAAACC ACCCGTTTGA GCGCGTTGGG TGGCGCAATC GCGATCTTTA TTATTGCCGA AGCAATTTCG CATGAGGCTG GCATTGCGGC AGCAGCAGTT TCGGGGATTG TCGTTGGCAA TATCGATTTT CCCCACGAAG AAGAAGTTGT GTTGTTCAAG GGCGACTTAA CCATGCTGGC AATTACGATT ATTTTTATTT TGCTGGCAGC TCGTTTGAAA TTTGCCGATT TGGCCGCGCT TGGCTGGTGG GGTGTGTTAG CGGTTGTATT GATGATTGTG GTGGTGCGGC CATTGTGTGT CTTTGCTTCG ACGCTTGGCT CAACCTTAAC TTGGCGCGAA CGAGCATTTA TTGCTGCGGT TTGCCCACGC GGAGTTGTCG CCGCTTCAGT TGCCACCTTT GCCGCGATCT CATTGGAAGA AGCCGGATTT GCTGGGGGCA ATTTGTTGAT TGGCTTGGTC TTTATGACCG TGATTGGCAC AGTTGTGTTG CAAAGCTTCA CCACCCCATT GTTTGCCCGT TTGTTAGGAG TTGAACCGAT GACCACTTTA GTTATTGGCG CGAATGATTT GGGCCAGCTG TTTGCCCGCC AACTGCATAG CCAACATCAC GATGTAGTCG TAATCGATAC CGATCAGCAG TTGATCGATC GAGTGCGCCA GCATGGCATT AGCACAATCA CTGGTGATGC GACTGATCTG CAAGTGTTGC GTAAAGCTGG CGTTGAACGG GACAAAGCGG TTGTAGCCTT GACCCCGAGT GATAAACTCA ATCTGTTGGT GAGCCAAGTT GTGCGTTCGC ATTTCAAGGT TACAACAATT GTGGCTCAGG CTGAAAATGA GAGTACCAGC AGTGTTTTGC GCGATCTTGG CATTACGGTT TTGAATCCCT TACAAGCATC GGTTGATGCT TTAACCCAAT TGGTTCAAGG CCCATCAGCC ATGAGCATGC TCTTAAGTCA TCAAGCCGAC CAAAGCATTT ACGAAGTTGA AGTATCCAAT CCGCGGGTGG TGGGTAAGCC GTTGAAACAG CTTAATTTGC CAAATAATGT CTTGATTGTG GCAATTCGGC GGGCGGGCAA CTTATTTGTG CCCGATGGTC AAACCCAATT GCAAGCAGCC GATCAACTAA CCTTGATTGG CACAGCCAGC ACAATTCAAC AGGCTGATCA ACAACTGCGT GAGGGAGCGA GCGAGGCAAT GTATCAAGCA TGA
|
Protein sequence | MDAHSLLLMA GMVICFGIAT FVLAERIGIP SIVLLLLVGV LVGPEVFGLV DPSALGVGLR VLIPLFVAII VFEGGLVLDI NDLRRLSYPL QMLMSVGALV TWGSATLVAH FVAGLSWPLA TLFGALMSVT GPTVITPLMR CSNAKERLKV LLQAEGVLVD AVGAILAVVV LDVLLTSGTT ISGAFDWAER LLIGSLIGLI GGIGLAYALR FIGSKLNAET TRLSALGGAI AIFIIAEAIS HEAGIAAAAV SGIVVGNIDF PHEEEVVLFK GDLTMLAITI IFILLAARLK FADLAALGWW GVLAVVLMIV VVRPLCVFAS TLGSTLTWRE RAFIAAVCPR GVVAASVATF AAISLEEAGF AGGNLLIGLV FMTVIGTVVL QSFTTPLFAR LLGVEPMTTL VIGANDLGQL FARQLHSQHH DVVVIDTDQQ LIDRVRQHGI STITGDATDL QVLRKAGVER DKAVVALTPS DKLNLLVSQV VRSHFKVTTI VAQAENESTS SVLRDLGITV LNPLQASVDA LTQLVQGPSA MSMLLSHQAD QSIYEVEVSN PRVVGKPLKQ LNLPNNVLIV AIRRAGNLFV PDGQTQLQAA DQLTLIGTAS TIQQADQQLR EGASEAMYQA
|
| |