Gene Hhal_1537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1537 
Symbol 
ID4709130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1671337 
End bp1672356 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content67% 
IMG OID639856004 
ProductUDP-glucose 4-epimerase 
Protein accessionYP_001003106 
Protein GI121998319 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGTCC TGGTCACCGG CGGCACCGGG TACATCGGCA GCCACGCGGC GGTGGCGCTC 
ATCGAGGCGG GCCATGAGGC GGTGCTCCTA GACAATCAGC ACAACAGCAC CGGCGCTGTC
GCATCTCGCA TCGGCCGGAT CACCCGCACA TCGCCGGAAC TGATCCGAGG CGATGTACGG
GATGCCTCGA TGCTGGCGGA GTTATTCAGC GCCCGGCGCA TCGACGCGGT GATGCACTTT
GCCGGGCTCA AGGCCGTGGG CGAGAGCGTG CAGCGACCGC TCGACTACTA CGAAACCAAT
GTCGGCGGTA CCATTCGCCT CTGCCAGGCG ATGGAAGCCG CCGGAGTCCG CAAGCTGATT
TTCAGCTCCT CGGCCACCGT ATACGGCGAC CCGGATCGCG TACCAATCCG CGAGGACGCC
CCTACCGGCG GGACGACCAA CCCCTACGGC ACTAGCAAGC ACATGGTCGA GCGGATGCTC
ACCGACCTGA GCACAGCCAA CCCGGGCTGG TGCATCGGGA TCCTACGCTA CTTCAATCCC
GTCGGAGCCC ATAAGAGTGC GCTGATCGGC GAATCGCCGT CCGGCATCCC CAACAATCTC
GTGCCGTATA TCGCTCAGGT GGCCGCCGGG GAGCGCGAGC ACCTGCAGGT CTTCGGCGAC
GACTACCCGA CCCGGGACGG CACGGGGGTG CGGGACTACA TCCACGTCGT CGATCTGGCG
GCCGGGCATG TACGCGCGCT GGAGTACCTT GACGCCCAAC CGGGTCAGCA CGCCTGGAAC
CTCGGCCGCG GCGAGGGGCA CTCGGTCTTG GAGGCCGTGC GTGCCTTCGA GCAAGCGAGC
GGGTGCTCCA TCCCGTACCG GGTTACGGAG CGGCGCCCCG GCGATGTCGC CGAGTGCTGG
GCCGACCCAA GCAAGGCCGA GCGGGAGCTC GGCTGGCGCG CCGAGCGAGG CCTGGCCACG
ATGATGGAGG ATGCCTGGCG CTGGCAGCGT CAGGGCTCGG ATACCGAGAC AGCTTCCTGA
 
Protein sequence
MRVLVTGGTG YIGSHAAVAL IEAGHEAVLL DNQHNSTGAV ASRIGRITRT SPELIRGDVR 
DASMLAELFS ARRIDAVMHF AGLKAVGESV QRPLDYYETN VGGTIRLCQA MEAAGVRKLI
FSSSATVYGD PDRVPIREDA PTGGTTNPYG TSKHMVERML TDLSTANPGW CIGILRYFNP
VGAHKSALIG ESPSGIPNNL VPYIAQVAAG EREHLQVFGD DYPTRDGTGV RDYIHVVDLA
AGHVRALEYL DAQPGQHAWN LGRGEGHSVL EAVRAFEQAS GCSIPYRVTE RRPGDVAECW
ADPSKAEREL GWRAERGLAT MMEDAWRWQR QGSDTETAS