Gene Haur_4278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4278 
Symbol 
ID5736137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5462649 
End bp5463587 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content52% 
IMG OID641281438 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001547038 
Protein GI159900791 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0211938 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGAT ATTTAGTAAC TGGTGGCGCA GGCTTTATTG GCTCGCATTT GGTTGATGCG 
CTCTTGCAAC GCGGCGATGA GGTGCGGGTT TTCGATAATT TTTCGACTGG CTATGAGCAT
AATCTGGCGC ATTGTATTAA TGATATTGAG TTGGTGCGCG GCGATTTACG CGATGCCGAG
GCTGTGAGCC AAGCGGTGGC TGGCTGCGAA GTAATTTTTC ACGAAGGTGC GTTGCCCTCA
GTGCCACGCT CGGTCAGCGA CCCGCTAACC ACCGATGCGG TCAATACTGG CGGCTCGTTG
CATGTGTTAC AAGCGGCGCG GCAGCATGGC GCACGGCGCG TCGTTTTTGC AGCGTCATCA
TCGGTCTATG GCGATACCCC AATTTTGCCC AAAGTCGAAA CTATGGCCAT GAGCCCCAAA
TCGCCGTATG CAGTCAGCAA AATGGCCGCC GAAAGCTATT TGAAGGTTTT TCATCATGTG
TATGGCTTAG AAACGGTTGG CTTGCGCTAT TTCAACGTTT TTGGGCCGCG CCAAGACCCA
ACTTCGCAAT ATTCGGGCGT AATTGCTCGT TTTATGACCT TGGCCTTACA AGGCGAACCT
TACACCATGA ATGGCACTGG CAATCAATCA CGCGACTTTA CCTACGTTGC GAATGTGGTG
CAGGCCAATT TGCTGGCGGC AAGCGTGCCT GCTGCGGCTG GCCATGTGTT TAACATTGCT
TGTGGCTTGC GCATTAGCCT TAATGATGTG GTTGCGATGT TGAACAAACT AGTTGGTAAA
GAACTACCGA TTATTTACAG TCCAGCCCGT ACTGGCGATG TTGAGCATTC GTTGGCCGAT
ATTAGTGCTG CTCGCCAAAT CTTGGGTTTT GAGCCAAGCG TCGATATTGA AACTGGCATC
GCCCGCACAC TGGATTGGTA TCGCACTCAG GGAGCCTAA
 
Protein sequence
MARYLVTGGA GFIGSHLVDA LLQRGDEVRV FDNFSTGYEH NLAHCINDIE LVRGDLRDAE 
AVSQAVAGCE VIFHEGALPS VPRSVSDPLT TDAVNTGGSL HVLQAARQHG ARRVVFAASS
SVYGDTPILP KVETMAMSPK SPYAVSKMAA ESYLKVFHHV YGLETVGLRY FNVFGPRQDP
TSQYSGVIAR FMTLALQGEP YTMNGTGNQS RDFTYVANVV QANLLAASVP AAAGHVFNIA
CGLRISLNDV VAMLNKLVGK ELPIIYSPAR TGDVEHSLAD ISAARQILGF EPSVDIETGI
ARTLDWYRTQ GA