Gene Haur_0887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0887 
Symbol 
ID5732788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1012762 
End bp1014012 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content51% 
IMG OID641278019 
ProductN-acylglucosamine 2-epimerase 
Protein accessionYP_001543663 
Protein GI159897416 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2942] N-acyl-D-glucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAATC TCGTAGATGG CGGCTTTGAT CATGGCTTGC GCGAAATGTG GCAAGCGCAG 
TTTCGCGATG AAGTGTTGGG CAATATTTTG CCATTTTGGG CAAACCACAC GCTTGATCAT
GAGCACGGTG GCTTTTATGG CGGCTTAACC AACGATCTGA CAATCCATAA TGAGTTTCCT
CGCTCAGCCG TATTATGTGG TCGGATTCTC TGGACGTTTG CCTGTGCCTA TCGCATGCTC
GGCGATCCCC GCGATTTGGC GGTGGCTGAG TATGCCTATG CCTACCTCAA ACAGGCTTTT
TGGGATCAAA CTTATGGCGG ATTGTATTGG TCGATCGATG CCAACGGCCA GCCTTTGGCC
GACCATAAAC AAACCTATGC TCAATCGTTC GCCATCTACG GGTTGGCCGA GTATGTTCGA
GCTACAGGCG ATCAGAGCGC CTTGGAATTG GCGCAAACCC TGTTTCACTT GATCGAAAAT
CATGCCTTTG ATGCGGTTTA TGGTGGCTAT ATCGAGGGCT GTGATCGGGT TTGGCAACCC
TTGGGCGATA GCCGCCTGAG CAAGCTTGAG CCAGAAGCCC GCAAAACCAT GAACACCATG
TTGCATATGA TGGAGGCCTA TGCCAACTTG CTGCGAGTTT GGGATGTGGC CGATGTGCGC
CAGCAATTGG CCAGCCTGAT CGAGGCCTGC TGCGAACATA TTATCGACCC TGTGCAAGGC
CGCTTTCATC TATTTTTCGA TGACCAATGG AACCACCACG AACATGGTAT TTCGTATGGC
CACGATATCG AAGGCAGTTG GTTATTGATG GAAGCAGCCC ATGTGTTGGG CGATGAACAC
TTAATTGCCA AGGCCGAAAC TTTAGCGATT GGCATGGCCG ATGCGGTCTA TCGCAATGGC
CGCCACGCCG ATGGCAGCAT TATTCACGAA CGCGCTCCCG ATGGCTCGAT TAATTTGGAA
CGCCATTGGT GGCCCCAAGC TGAAGCCGTT GTTGGCTTCT ACAACGCCTA TCAGGCCACT
GGCAAACCTG AGTTTGCTCA AGCCGCCTAC GATAGTTGGA ATTTTATTCA ACGCTATTTT
ATCGACCACG ATCATGGTGA TTGGTTCAAA ATTCTTGATG CTCAAAACCA ACCGTTGGGA
GCGATTCCAA AAGTTGGCCC ATGGGAATGC CCGTATCACC ATGCCCGCGT TTGTTTTGAA
ATGATCGAAC GTTTAGCTGA ACACAACGTG AGTGTTCAGA TGCGAGGGTA A
 
Protein sequence
MANLVDGGFD HGLREMWQAQ FRDEVLGNIL PFWANHTLDH EHGGFYGGLT NDLTIHNEFP 
RSAVLCGRIL WTFACAYRML GDPRDLAVAE YAYAYLKQAF WDQTYGGLYW SIDANGQPLA
DHKQTYAQSF AIYGLAEYVR ATGDQSALEL AQTLFHLIEN HAFDAVYGGY IEGCDRVWQP
LGDSRLSKLE PEARKTMNTM LHMMEAYANL LRVWDVADVR QQLASLIEAC CEHIIDPVQG
RFHLFFDDQW NHHEHGISYG HDIEGSWLLM EAAHVLGDEH LIAKAETLAI GMADAVYRNG
RHADGSIIHE RAPDGSINLE RHWWPQAEAV VGFYNAYQAT GKPEFAQAAY DSWNFIQRYF
IDHDHGDWFK ILDAQNQPLG AIPKVGPWEC PYHHARVCFE MIERLAEHNV SVQMRG