Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0404 |
Symbol | |
ID | 5731972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 474807 |
End bp | 475691 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277527 |
Product | NmrA family protein |
Protein accession | YP_001543183 |
Protein GI | 159896936 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0702] Predicted nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAATA CCCAAGGTTT GGTAGCGATC ACAGGTGCGG CTGGGCAATT AGGGCGGTTG GTTTTGCAGC AAGTGCTCGA AAAAGTTGCA GCAAATCAAG TTGTGGCAAT CACCCGTGAT CCCGCCAAAT TGGCCGATGT CGCTGCTCAA GGGGTTAAAG TCGTCGCTGG CGATTTCAGC GATCCCGCTG GTCTGACTGC TGCTTTGGCT GGAGTCGAGC GCGTCTTGAT GATCAGTATC GATGTGGTTG GCGGCGAACG TGTGCGCTTG CAAACAGATG CGGTCAAAGC AATCGCTGCG GCTGGCGTGA AGCATTTGGT TTATACCTCG GCAATCAATC CTGCCGAAGC CCCAATGGAA TTTATTCGCG AACACGCTGC GACCGAGCAA GCGATTGTGG CGAGCGGTTT GAGCTATAGT TTCTTGCGCA ACAACTTCTA TTTTGAAACG ATCACCGATA AAATCAAAGG CGCTTTGGCT GGTGGCGTAA TCGCTGCCGC TGCTGGCGAT GCTGCTGCTG GCTTAGTTGC CCGCGCCGAT TGTGCTGCCG CCGCCGCTGC CGCCTTGGTC AGCGATAACA CCGCCTCAGA AGTGTATAAC ATCACTGGCC CGGTGAGTTT GACCCATAAA GCAATCGCTG AAACTGTGGC CGAATTTGCA GGCCGCGAAG TCGCCTACTA TCCAATTGAT GCTGCCAGCG CTCAACAACA ACTTGAGCAA TTTGGGCTGC CCGCCCAAAT CGCTGGTTTT GTCGTCGCGA TCGACCACGA TTTGATCGGC TCAGGTGCGT TGGATCTTGT GACCAACGAC GTTGAGCGCT TGACTGGCAA GCCAGCCCAA TCGTTGGCCG AGTATCTCAA TGCCAATCGC GAGTTGTTTA GCTAA
|
Protein sequence | MANTQGLVAI TGAAGQLGRL VLQQVLEKVA ANQVVAITRD PAKLADVAAQ GVKVVAGDFS DPAGLTAALA GVERVLMISI DVVGGERVRL QTDAVKAIAA AGVKHLVYTS AINPAEAPME FIREHAATEQ AIVASGLSYS FLRNNFYFET ITDKIKGALA GGVIAAAAGD AAAGLVARAD CAAAAAAALV SDNTASEVYN ITGPVSLTHK AIAETVAEFA GREVAYYPID AASAQQQLEQ FGLPAQIAGF VVAIDHDLIG SGALDLVTND VERLTGKPAQ SLAEYLNANR ELFS
|
| |