Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5087 |
Symbol | |
ID | 5737045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 112002 |
End bp | 113030 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641282252 |
Product | NMT1/THI5-like domain-containing protein |
Protein accession | YP_001547843 |
Protein GI | 159901597 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGCTG TGTCACGCCC AACACACCGC GTCCGTCGCA TCGGATCGTT CACGACAATC CTGCTGATCC TGCTAGCAGC CTGTAGCACC CAGACAGAAC CGACTCCGGT TCCCATGGAT GCGGTTACCC TCCAACTCAA CTGGGTCAAT GACTTTTCCT CAGCGGGCTT TTTTGCAGCG GAAAAGAACG GACGCTTTGC CGACCAACGC CTGCAGGTCA CCTTGCGCGA GGGTGGCTTT GATGCCAATG GCTATATTGA TGGCACCGAA CAAGTCAGTA GCGGTGCGGC TGATTTTGGG GTGGCCAGCG CCGATAGTAT CCTTCACGCC CGTGCCCAAG GAAAACCAAT TGTTGGGATT GCGGTGTTGG CGCAAGATAG TCCGCTGGCG ATTCTCTCGC TTCCTGCGAC CAATATTCGC ACGCCCCGCG ATTTAATTGG CAAACGGGTG TTGGTTTCCG AGGGCGGAGC AACCCAACTC TATACCACGT TGCTTGCTTC TCAATCCATC GATATTGCCC AAGCTCCACC CATTCCCCGC ACCGATTCAG GGATTAATCA GCTGATTGCT GGTGAGATTG ATGCCCTCGT CGCATGGAAT GTCAACGAAG CGATTGAATT AAGTGAACTT GGCTACCCAC CATCGGTTAT GCGGTTCAGC GATTATGGCA TCAATAGCTA TGAATTGGTC GTGATCACCT CGGAGCGCCT CGTCACCGAG AATCCCGATC GCGTCACTCG GTTTCTCAAG GCCGTCCTGC AAGGGTGGAA GGATGTCATC CTTAGTCCCG CCCAAGCGAT TGGCTATGTG AAGGACTATG CCCCGGACGT TGAGCGGGAT GGACAGTTGC AGCGGTTAAG TGTGTTTGTT GAGTTATTAC AACCAGCACA AACCAAACTC GGCGATATGC TGCCTGAACG CTGGGCATTT ACCCAGACGA TGTTGCAAAC CCAAGGGGTG CTCACGACCC CCATTGACCT TAATCGTGCC TACACGACCA CATTCCTTGA ACAGTTGCCA GATCGCTAA
|
Protein sequence | MHAVSRPTHR VRRIGSFTTI LLILLAACST QTEPTPVPMD AVTLQLNWVN DFSSAGFFAA EKNGRFADQR LQVTLREGGF DANGYIDGTE QVSSGAADFG VASADSILHA RAQGKPIVGI AVLAQDSPLA ILSLPATNIR TPRDLIGKRV LVSEGGATQL YTTLLASQSI DIAQAPPIPR TDSGINQLIA GEIDALVAWN VNEAIELSEL GYPPSVMRFS DYGINSYELV VITSERLVTE NPDRVTRFLK AVLQGWKDVI LSPAQAIGYV KDYAPDVERD GQLQRLSVFV ELLQPAQTKL GDMLPERWAF TQTMLQTQGV LTTPIDLNRA YTTTFLEQLP DR
|
| |