Gene Haur_4820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4820 
Symbol 
ID5736665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6146730 
End bp6147758 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content52% 
IMG OID641281985 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001547578 
Protein GI159901331 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1089] GDP-D-mannose dehydratase 
TIGRFAM ID[TIGR01472] GDP-mannose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.503744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAATAG GCATAAACAG CTTCTTGCCA ACAATCTCAA CTCGCAGTAT AGTGCGGGCT 
ATGACACGAG CATTAATTAC TGGCGCTAAT GGCTTTGTTG GCCAACATCT TGTTCGCTAT
TTGCAGCAAG CAACCACTTG GGAATTATGG GCTTTGGGGC GTGAAGCTCA CCCACAACTT
CCAACAGTGC TGGCTGATCT GCTTGATCGT TCGGCGGTAG CAACGGCGGT GGCAAATGCA
GCCCCCGATC TGGTGGTGCA TTTAGCGGCC CAATCGGCGA TTCCTCAATC GTTTCGTGAT
CCCGCCGGCA CGTTTAGTAT CAATGTGCTC GGCCAATTGC ACCTATTTGA AGCGATCAAG
TCGGCTCAAC TTGATCCAAT TGTGTTGGTG GTTGGCTCGA ATGCGATGTA TGGCATGGCC
CATCGTTCAG GCTTGCCCGC CGATGAAAAC ACCATGCTTT GCCCAGCTGA TCCCTATGCT
GTTTCCAAGG CAGCCCAAGA TCTGTTGGCG GGGCAATGGT GGTATAGCCA TGGCCTGAAG
GTGATTCGTG CTCGGCCATT TAACCATACT GGGCCTGGCC AACGGGCTGA TTTTGTTGTG
CCAGCCTTTG CCCACCAAAT TGCTCGCATC GAAGCGGGCT TGCAGCCGCC AGTGATTCAG
GTTGGCAACC TTACGCCACA GCGTGATTTT AGCGATGTAC GTGATGTTGT GCGGGCCTAT
CATCTGCTGC TTGAACGAGC GCAGCCTGGC GAAATTTATA ATATTGGCGT AGGTCAGAGT
GTCTCAATTC AGTCAATTCT TGATCGCCTC ATTGCACTCA GTGGCCAAAC GATCACGGTC
GAAGTTGATC CTCAACGCTT GCGTCCAGTT GATGTGCCAA TCGTGGCGTG CGATGCTAGT
CGCTTGCGCA GCCAAATCGG GTGGGAGCCG CAGTATTGCC TTGACGATAC GCTGAGCGAT
ATTCTGAATG AATGGCGCAG CCACGTTGCA ACCGAGCAAG AGAGAGTTGG TTCCCACATT
AACCAATAA
 
Protein sequence
MRIGINSFLP TISTRSIVRA MTRALITGAN GFVGQHLVRY LQQATTWELW ALGREAHPQL 
PTVLADLLDR SAVATAVANA APDLVVHLAA QSAIPQSFRD PAGTFSINVL GQLHLFEAIK
SAQLDPIVLV VGSNAMYGMA HRSGLPADEN TMLCPADPYA VSKAAQDLLA GQWWYSHGLK
VIRARPFNHT GPGQRADFVV PAFAHQIARI EAGLQPPVIQ VGNLTPQRDF SDVRDVVRAY
HLLLERAQPG EIYNIGVGQS VSIQSILDRL IALSGQTITV EVDPQRLRPV DVPIVACDAS
RLRSQIGWEP QYCLDDTLSD ILNEWRSHVA TEQERVGSHI NQ