Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1579 |
Symbol | |
ID | 5733466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1835745 |
End bp | 1836773 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641278718 |
Product | NMT1/THI5-like domain-containing protein |
Protein accession | YP_001544350 |
Protein GI | 159898103 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00570901 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTATC GTCGTATCTT AGCCGTGGTT GGGTTATTCT TTTTGGCGGC TTGTGGTGGG CAAACTGCTA CGCCAACCGC CGTTACGGGC AATGATCAAG CGAAACCGCT AACCAAAGTA ACGATTGCCA TGCCCTATGT GCCGAATATT CAATTTGCCC CGTTTTATTT GGCCAAAACC CAAGGCTACT ACGAAGCTGA AGGCTTAGAT GTAACCTTCG ATTATCAATA TGAAACTGAT TCGGTGCAGC GTGTAGCTAA TGGTTCTGTT CAATTTGGCA TGGCTGGCGG CGATTCGGTG CTGCTAGCAC GAGCGCAAGG CTTGCCTATT ATGACTGTTG CAACGATCAG TCAACGCTCG CCGATTGTTT TTTATAGCAA AGCTGAGCTG AATATCAAAA CTCCAGCCGA TCTCAAAGGC AAAAGTGTTG GGATTCCAGG CCGCTTTGGG GCTTCGTATA TTGGTTTGTT GGCGTTGATG TATTCAAATT CCTTGCAAGA GAGCGATTTG AACATTCAAG AAATTGGCTT TGCCCAAGTT CAAGCGCTGA GCGAAGATAA AGTGCAGGTT GCCAGTGGCT ATGGCAATAA CGAGCCAATT CAGTTGGCCG AGGCTGGGGT TAAATTAAAT GTTATTCGGG TGTCGGATTC GTTTGCCTTG ACCTCTGATG GCCTTATTGT CAGTGAAAGC TTGATTAAAG AGCAACCCAC GGTGGTTATG GGCTTTGTCA AAGCCACATT AAAAGGCATG AGCGCTACGA TTGCTGATCC GACGCAGGCC TTTAATAGTA GTTTGCGTGA AATTCCCGAG CTGCAAGCGG CTGATGATGC GACCAAAGCC TTGCAACAAA AAGTTTTAGC TGAAACAATT GGCTATTGGC AAAGCGATTC GACTGCTAAA TATGGCCTTG GGTTTACTGA TCAGGCCACT TGGCAAGCGA CTCACGATTT CTTGCGCCAA CAAAATATTC TCAAACAAGA TGTTGCAGTG GGCGAGTCGT TTGTGAATGG GTTTATTGCT ACACCCTAA
|
Protein sequence | MRYRRILAVV GLFFLAACGG QTATPTAVTG NDQAKPLTKV TIAMPYVPNI QFAPFYLAKT QGYYEAEGLD VTFDYQYETD SVQRVANGSV QFGMAGGDSV LLARAQGLPI MTVATISQRS PIVFYSKAEL NIKTPADLKG KSVGIPGRFG ASYIGLLALM YSNSLQESDL NIQEIGFAQV QALSEDKVQV ASGYGNNEPI QLAEAGVKLN VIRVSDSFAL TSDGLIVSES LIKEQPTVVM GFVKATLKGM SATIADPTQA FNSSLREIPE LQAADDATKA LQQKVLAETI GYWQSDSTAK YGLGFTDQAT WQATHDFLRQ QNILKQDVAV GESFVNGFIA TP
|
| |