Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2847 |
Symbol | |
ID | 5736884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3611454 |
End bp | 3612629 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279990 |
Product | phosphopentomutase |
Protein accession | YP_001545613 |
Protein GI | 159899366 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1015] Phosphopentomutase |
TIGRFAM ID | [TIGR01696] phosphopentomutase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATATTA AACGGGTAAC GGTGATTGTT TTGGATGGTG TGGGAATTGG CGAAGCACCT GATGCCGACG AATATGGCGA TGTAGGCAGC CACTCGTTGG CCAACACAGC CGCAGCTATC AATGGGCTTG ATCTGCCCAA TATGGCAGCA CTAGGGCTTG GTTGCATTAG CGAAATGCAG GGTGTCGCCT GCCCTGAATC TTTTAGTGGC AGCTATGGCA AAATGCAACC GCTATCAAAA GGCAAAGATA CAGTTTCAGG TCACTGGGAG ATGATGGGCA TCGTCTTGCC AACACCATTT CCGGTGTATC CCGATGGCTT TCCGGCAGCA GTAATTGAGC CATTTAAACA AAAAATTGGG CGTGGCGTGC TGGGCAATAA ATCAGCCTCA GGCACTGATA TTCTTGAAGA ATTAGGCATG GAACACATTC GCACGGGTGA TCCAATTGTC TATACCTCAG CTGATAGTGT TTTTCAAATT GCGGCCCATG AAGACGTGAT TACTCCCAAA GAGTTGTATG CAATGTGCGA AATCGCGCGT GAGATCTTGG TTGGAGAGCA TGCGGTTGGG CGGGTGATTG CGCGGCCATT TATTGGCGAT AGTCCCGAAA CCTTTAAGCG CACCATTCGC CGCCACGATT ATGCCCTGAC CCCAGAAACC CCCACCATTT TGGATAAAGT AGTAGCGGCA GGCAAACAAG TCTATTCAGT TGGCAAAATC GATGATATTT TTGGCAATCG CGGCATCAGC GTTTCCAACC ATACCGTTGA TAATGCGGCC AGTTTAGAGG CAGTGCTTGA ATTTCTCGAT GTGGATTTTG AAGGGCTGTT GTTTGCCAAC TTCATCGAGT TTGATATGAT CTATGGCCAT CGCAATGATC CGGTTGGCTA TGCCAACGCC TTGAAGGCAG TTGATCAGCG TTTGCCTGAG CTACAAGCCA AATTACGGGC GGGCGATCTG GTGGTAATTA CCGCCGATCA TGGCGTTGAC CCAACCACCC CTGGCTCGAA CCACAGCCGT GAATATGTTC CGCTATTGGT TTTTGGCCCC GAAGTGCGTA GCGGAGTCAA TTTGGGAACT CGTCAGACCC TGAGCGACTT GGCGGCAACG ATTGCTGAGA TTTTCGGGCT AGAGCAACCA CTGCATGGCA CAAGTTTCCT CAGTGAACTA CAATAA
|
Protein sequence | MDIKRVTVIV LDGVGIGEAP DADEYGDVGS HSLANTAAAI NGLDLPNMAA LGLGCISEMQ GVACPESFSG SYGKMQPLSK GKDTVSGHWE MMGIVLPTPF PVYPDGFPAA VIEPFKQKIG RGVLGNKSAS GTDILEELGM EHIRTGDPIV YTSADSVFQI AAHEDVITPK ELYAMCEIAR EILVGEHAVG RVIARPFIGD SPETFKRTIR RHDYALTPET PTILDKVVAA GKQVYSVGKI DDIFGNRGIS VSNHTVDNAA SLEAVLEFLD VDFEGLLFAN FIEFDMIYGH RNDPVGYANA LKAVDQRLPE LQAKLRAGDL VVITADHGVD PTTPGSNHSR EYVPLLVFGP EVRSGVNLGT RQTLSDLAAT IAEIFGLEQP LHGTSFLSEL Q
|
| |