Gene Haur_4699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4699 
Symbol 
ID5736546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6003683 
End bp6004705 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content53% 
IMG OID641281863 
Product3-beta hydroxysteroid dehydrogenase/isomerase 
Protein accessionYP_001547458 
Protein GI159901211 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTTG AGCAATCCAC ACGGGCATTG GTGATTGGTG GTTGTGGCTT TGTCGGCAAA 
CATTTGGTGC AGCAATTATT GGCAGCAGGC CACCCTGTCC GCGTTTTCGA TCTCCAGCCC
TACCCCGACC CTCAAGTCGA ATCGGTTGTA GGCGATTTAC GCAAAGCTGA GCAGGTATTG
CAAGCCTGTC ACGATGTTGG TACGGTGTTT TTGTGTGCAG CAGCGGTCGA TTGGGGTTGG
GGCAATGCCC AGCTCCTGCA CGATGTCAAT GTGTTGGGGC CGCAACATGT GGTTGCTGCT
TGCCAAGCCA CAGGTGTTGC CCAACTGATT TATACCAGCA GCGTTGATGT GGTTTTCGAG
GGCAAACCAA TTCGGGCTGG CGATGAGCAA TTGCCCTATC CCAAGCAGCA CTTAGATATT
TATGGCGCGA CTAAAACCGC TGGTGAACGT TTGGTTTTGG CGGCCAATGG TCAAGCAGGC
TTGGCAACCA GTGCCTTGCG ATTGGGGGGC GTGTATGGCC CAGGCGATTC GCATCGCTTG
CCATCGTTGG TGAATTTAGG CAAGCGTGGC CCAATTCCAC GTTTGGGCAA TGGCTCGGCG
CGATTTTCGC ATATTTATGT TGAAAATGCC GCGCACGGCC ATATTTTAGC AGCCCAGCGT
TTAACCGCCG ATGGTGCGAT GGGCGGCCAA GCCTATTTTT TGGTTGACCC TAATCCTGAT
AACTTTTTTC TGTTTCTCAA GCCGATTGTT GAGGCCTTGG GTTTGCGCAT GGCCAAGCGC
CATGTGCCAT TTGGCTTGAT GCATTTCCTG GCATGGCCCA GCGAATTCTG GTATCGCACA
ACGCGCAGCA AAACCCGCCC AAGCCTAACG CGCTACACAG TTACCTCAAC CTGCGTTGAT
TTTTGGTTTA CCGGAGCCAA GGCCGCCAAC GATTTTGGCT ATCAGCCGCT GGTTGATTTG
GCCGAAGCGC GGCAACGCAC AATTGCATGG GCCAAACGTG AGTTTAATCT AGGCATGAAG
TAA
 
Protein sequence
MALEQSTRAL VIGGCGFVGK HLVQQLLAAG HPVRVFDLQP YPDPQVESVV GDLRKAEQVL 
QACHDVGTVF LCAAAVDWGW GNAQLLHDVN VLGPQHVVAA CQATGVAQLI YTSSVDVVFE
GKPIRAGDEQ LPYPKQHLDI YGATKTAGER LVLAANGQAG LATSALRLGG VYGPGDSHRL
PSLVNLGKRG PIPRLGNGSA RFSHIYVENA AHGHILAAQR LTADGAMGGQ AYFLVDPNPD
NFFLFLKPIV EALGLRMAKR HVPFGLMHFL AWPSEFWYRT TRSKTRPSLT RYTVTSTCVD
FWFTGAKAAN DFGYQPLVDL AEARQRTIAW AKREFNLGMK