Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1026 |
Symbol | |
ID | 5732930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1170873 |
End bp | 1172138 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278161 |
Product | chloride channel core |
Protein accession | YP_001543802 |
Protein GI | 159897555 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0038] Chloride channel protein EriC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.724519 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGGGAA CTATTATTAC GCTGATGCGC CAAGTTGGGC GCTGGACTGG ATTAGCTTTG GTGATAGGCT TTGCTGCTGG TGTGGCAAGT GCCTTCTTTT TAGTCAGTTT GGCTTGGGTG ACCAGCTTGC AAACCACCAA TTGGTGGCTA TTACTCAGTT TGCCGTTGCT TGGTGGCTTG GTGGCTTGGC TCTATCAGCG TTTTGGCACA AGCGTGGCGG CTGGCAATAA TCTGATCATT GAACAACTGC ATAACCCTGA TTCGGCGGGA ATTCCCTTGC GCATGGCTCC GTTGGTGTTG CTCGGAACCT TGTTGACCCA TCTTGGTGGT GGCTCGGCTG GGCGCGAAGG GACGGCAGTG CAAATGGGCG CGAGTTTGGC AGCGCGAATC GGGCGTTGGT GGCGCTTGCC CCAAGCTGAA TGGCGTTTGG TAGTAATGAT GGGCATCAGC GCGGGGTTCA GCAGCGTCTT TGGTACGCCG ATAGCCGGCA CAATTTTCGC CATGGAAGTG CTAGCATTTG GGGTTTTGCG CTATGAAGCG TTGTTGCCAT GTTTGGTGGC AGCCTTAGTT GGCGATGGAG TGGTGCGTTG GCTGAATGTT GCCCATAGCC ATTATCATGT AGGCAGTGTA CCCGATTTGA GCATGATCTG GGCGATTTTA GTTGGAGCAG GAATTTGCTT TGGCTTAGCC AGCAGTGCGT TTGCGATCTG GACTGAATTG GTGCAAACCT GGTCACGGCG TTGGCTGCCC AACCCAATTT TGCGGGCAGT GGCTGGTGGC GGGCTGATTG TAAGCATCAG TTTTTTGTTG AATACGCGTG ATTACAATGG GCTAAGCTTG CCGTTGCTGG CGCAGGCTTT TGAGCCACGA GGCGTTGTAT TTTGGGCTTT TGCACTCAAA TTATTGTTAA CTGGCTTGAC CTTGGGCGTG GGCTTTAAAG GTGGCGAAGT AACGCCATTG TTTGTGATTG GCGCGACGCT TGGCTCAGCT TTAGCGCAAT TATTTGGTGT GCCAACTGAT CTGTTGGCGG CATTGGGCTT TATTGCAGTG TTTGCTGGGG CTGCTAATAC CCCGATTGCC TGTGTCTTGA TGGGAGTTGA ATTATTTGGC TCGGCCTTGC TTGGGCCTTT GATGCTCACA ACCTGCATTG CCTACGCCAT TTCGGGCCAT CGCGGAATCT ACGCGGCGCA ACGGGTTGGC TGGGCCAAAC GCCAGCATTT GGCACATCAA ACTAGCAAAC GGCTTGATCA ACTGCATATT GATTAA
|
Protein sequence | MLGTIITLMR QVGRWTGLAL VIGFAAGVAS AFFLVSLAWV TSLQTTNWWL LLSLPLLGGL VAWLYQRFGT SVAAGNNLII EQLHNPDSAG IPLRMAPLVL LGTLLTHLGG GSAGREGTAV QMGASLAARI GRWWRLPQAE WRLVVMMGIS AGFSSVFGTP IAGTIFAMEV LAFGVLRYEA LLPCLVAALV GDGVVRWLNV AHSHYHVGSV PDLSMIWAIL VGAGICFGLA SSAFAIWTEL VQTWSRRWLP NPILRAVAGG GLIVSISFLL NTRDYNGLSL PLLAQAFEPR GVVFWAFALK LLLTGLTLGV GFKGGEVTPL FVIGATLGSA LAQLFGVPTD LLAALGFIAV FAGAANTPIA CVLMGVELFG SALLGPLMLT TCIAYAISGH RGIYAAQRVG WAKRQHLAHQ TSKRLDQLHI D
|
| |