Gene Haur_2847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2847 
Symbol 
ID5736884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3611454 
End bp3612629 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content50% 
IMG OID641279990 
Productphosphopentomutase 
Protein accessionYP_001545613 
Protein GI159899366 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1015] Phosphopentomutase 
TIGRFAM ID[TIGR01696] phosphopentomutase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATTA AACGGGTAAC GGTGATTGTT TTGGATGGTG TGGGAATTGG CGAAGCACCT 
GATGCCGACG AATATGGCGA TGTAGGCAGC CACTCGTTGG CCAACACAGC CGCAGCTATC
AATGGGCTTG ATCTGCCCAA TATGGCAGCA CTAGGGCTTG GTTGCATTAG CGAAATGCAG
GGTGTCGCCT GCCCTGAATC TTTTAGTGGC AGCTATGGCA AAATGCAACC GCTATCAAAA
GGCAAAGATA CAGTTTCAGG TCACTGGGAG ATGATGGGCA TCGTCTTGCC AACACCATTT
CCGGTGTATC CCGATGGCTT TCCGGCAGCA GTAATTGAGC CATTTAAACA AAAAATTGGG
CGTGGCGTGC TGGGCAATAA ATCAGCCTCA GGCACTGATA TTCTTGAAGA ATTAGGCATG
GAACACATTC GCACGGGTGA TCCAATTGTC TATACCTCAG CTGATAGTGT TTTTCAAATT
GCGGCCCATG AAGACGTGAT TACTCCCAAA GAGTTGTATG CAATGTGCGA AATCGCGCGT
GAGATCTTGG TTGGAGAGCA TGCGGTTGGG CGGGTGATTG CGCGGCCATT TATTGGCGAT
AGTCCCGAAA CCTTTAAGCG CACCATTCGC CGCCACGATT ATGCCCTGAC CCCAGAAACC
CCCACCATTT TGGATAAAGT AGTAGCGGCA GGCAAACAAG TCTATTCAGT TGGCAAAATC
GATGATATTT TTGGCAATCG CGGCATCAGC GTTTCCAACC ATACCGTTGA TAATGCGGCC
AGTTTAGAGG CAGTGCTTGA ATTTCTCGAT GTGGATTTTG AAGGGCTGTT GTTTGCCAAC
TTCATCGAGT TTGATATGAT CTATGGCCAT CGCAATGATC CGGTTGGCTA TGCCAACGCC
TTGAAGGCAG TTGATCAGCG TTTGCCTGAG CTACAAGCCA AATTACGGGC GGGCGATCTG
GTGGTAATTA CCGCCGATCA TGGCGTTGAC CCAACCACCC CTGGCTCGAA CCACAGCCGT
GAATATGTTC CGCTATTGGT TTTTGGCCCC GAAGTGCGTA GCGGAGTCAA TTTGGGAACT
CGTCAGACCC TGAGCGACTT GGCGGCAACG ATTGCTGAGA TTTTCGGGCT AGAGCAACCA
CTGCATGGCA CAAGTTTCCT CAGTGAACTA CAATAA
 
Protein sequence
MDIKRVTVIV LDGVGIGEAP DADEYGDVGS HSLANTAAAI NGLDLPNMAA LGLGCISEMQ 
GVACPESFSG SYGKMQPLSK GKDTVSGHWE MMGIVLPTPF PVYPDGFPAA VIEPFKQKIG
RGVLGNKSAS GTDILEELGM EHIRTGDPIV YTSADSVFQI AAHEDVITPK ELYAMCEIAR
EILVGEHAVG RVIARPFIGD SPETFKRTIR RHDYALTPET PTILDKVVAA GKQVYSVGKI
DDIFGNRGIS VSNHTVDNAA SLEAVLEFLD VDFEGLLFAN FIEFDMIYGH RNDPVGYANA
LKAVDQRLPE LQAKLRAGDL VVITADHGVD PTTPGSNHSR EYVPLLVFGP EVRSGVNLGT
RQTLSDLAAT IAEIFGLEQP LHGTSFLSEL Q