Gene Haur_0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0401 
Symbol 
ID5731969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp471172 
End bp472380 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content50% 
IMG OID641277524 
Productnucleoside-diphosphate-sugar epimerase 
Protein accessionYP_001543180 
Protein GI159896933 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAAC GAATATGGAT AGTGTTGTGT GGCTTGATGC TTGGCCAGCG CTGCTGGAAA 
TGGTGGCAAG TATGGCGCTT TTTTGGCAAG CCTACTCCTA CTGCTCAACA CGAGCCTGCG
ACCACCTTGG TGAGTTTGCT CCAGCCAATT TTGAGCGGCG ACCCGCATTT GGCCATATGT
TTACGCGCCA ATTTGAATGC GCCAAGCAGC TATAAGCGCG AATGGCTATG GTTAATTGAT
GATGATGATC GGATCGCTCA ACAGCTTTGC TATGGATTGC AAGCAGAATA TGCCGAGCAA
ACGATTCGGA TTATCAGTTT GCCAGCCCCA GCTGAGCGGG TTAATCCCAA AACCTTCAAG
CTAATTGCGG GATTACAACA AGCCCAAGGC CAAATTATTT GCGTGCTCGA CGATGATACC
AGCCTGCCAG CCTATGGCTT GGAACAATGT TTGCCATGGC TCGATCAAGC GGGAGTAGGC
TTGGCCTTTG GCTTGCCCTA CTATCGCTCG TTCGATAATA CTTGGTCGAG TTTGGTGGCA
TTGTTTGTTA ATAGCAATAG TTTGCTGACC TATGTACCCT ATAGCCAAGT TAGCGAGCCA
TTTACGATCA ATGGAATGTT CTATGCCATG CGCCGCGACG TTTTGGAGCA ATTGCATGGT
TTTGTTGGCT TAGAGCATAT TTTGGCTGAC GATTTTGCGG TGGCGCAACG GGTGCAACAG
GCAGGCCTGC GCTTGCAGCA AACCAGCATG CGCCATGCAA TTCGTACTAC CGTGACCAAT
GCCCAACGCT ATCGCAGCCT GATCCAACGC TGGTTTATCT TCCCACGTGA ATCGCTGCTA
CGCCATTTGA ATCGGCGCGA ACGCAGCTTG CTGTTTTGTT TGGCGATTGT GCCCACGCTG
TTTCCATTGG TTTTGGCGAT CGTGAGCGTC TTGCGACCAA GCCAACGTCA ACGCTGGTTT
GCTGCAACGT ATACTTTGCT TGGCCTGATC AGTTTTATTC AGATTGATCA AGCCTACCTA
GAGCAAGCCA CGCCACGGCG CTATTGGCTG TTTGTACCAT TTTTAGAATT GCTCATTCCA
GTGCAATTGA TTCAAGCCTT GCTTGCGCCT CAGCGCATTG TTTGGCGCGG CCATGTGATG
GATGTGGAAA AAGGCGGCGC ATTTCGCTTT GTGCAACGCA GGGATGATGG TTCGGCGAAT
GGGTTCTAG
 
Protein sequence
MIKRIWIVLC GLMLGQRCWK WWQVWRFFGK PTPTAQHEPA TTLVSLLQPI LSGDPHLAIC 
LRANLNAPSS YKREWLWLID DDDRIAQQLC YGLQAEYAEQ TIRIISLPAP AERVNPKTFK
LIAGLQQAQG QIICVLDDDT SLPAYGLEQC LPWLDQAGVG LAFGLPYYRS FDNTWSSLVA
LFVNSNSLLT YVPYSQVSEP FTINGMFYAM RRDVLEQLHG FVGLEHILAD DFAVAQRVQQ
AGLRLQQTSM RHAIRTTVTN AQRYRSLIQR WFIFPRESLL RHLNRRERSL LFCLAIVPTL
FPLVLAIVSV LRPSQRQRWF AATYTLLGLI SFIQIDQAYL EQATPRRYWL FVPFLELLIP
VQLIQALLAP QRIVWRGHVM DVEKGGAFRF VQRRDDGSAN GF