Gene Hhal_1645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1645 
Symbol 
ID4709937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1796484 
End bp1797998 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content69% 
IMG OID639856110 
Product3-beta hydroxysteroid dehydrogenase/isomerase 
Protein accessionYP_001003211 
Protein GI121998424 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0702] Predicted nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGATC AAGACCACCC CAACCCCGAC GCCGCAGCGA GCGAGTCACC CCTGTTCGCG 
GTCTTCGGCG CAAGCGGGTA CATCGGCTCG CACCTGGTCC CCGAACTGCT CGGCGCCGGC
TGTCGGGTCA GGGCCGTCGC GCGCAACCGG GAGGTCCTGG AGGCCCGCGG CTGGGAGGGG
GCCGAGCTGG CCGCAGCGGA CGCTCTGAAA CCGGAGACCC TGGTGCCTGC CCTCCGGGGG
GCCGACGTGG CCTACTACCT GGTCCACTCC ATGGGGGCCG GCAAGACCTT CGGCACGCTG
GACGTCGAGG CAGCCCGCAA CTTCGCGGCC GCCGCCGCCG AGGCCGGAGT GCGCCGCATC
GTCTACCTCG GCGGGCTCGT GCCCGAGTCG GCCCGCTCGG CGCACATCCT CTCTCGCCGC
CAGACCGGCG ACACCCTGCG AGAAGGCTCG GTGCCGGTTA CCGAGCTGCG CGCCGGGATC
ATCGTCGGCC CAGGGTCGGC GGCATTCGAG ATCATGCGCG ACCTGGTGCT CCATCTTCCG
GTGATGGTTA CCCCGCGCTG GGTCTTTGCC GAATCCCCGC CCATCGCGCT CCAGGATCTA
TTGGAGTACC TGCGCCGGGC CCCGCAATGC GGGGAAACCG CCGGGGCGAT CTTCGATGTG
GCCGGACCGG AACACCTGAC CTACGCCCAG ATGATGCGCA TCCTCGCCGA GGAGGCCGGG
CGGCGCCCAC CCACGGTGAT CCCGGTCCCC CTGCTGACTC CGAAGCTCTC TTCATACTGG
CTCCGCCTGG TCACTGCAGT CCCCACGCCC ATCGCACGGG CACTCATCGA AGGCCTGCGT
GAGGACTACC GGGCCGATGA CAGCCAGATT CGCCACCTGG TACCGCAGCA GCTACGCGAT
TTCCGCAGCG CCGTACAGGA CGTATTCCGC GCCGAGCGCG AGCAGACCGT GGCCGCGCGC
TGGACCGAAG GCGCCTTTAT GTTCCGCAAC TACCGGATCG ACTACAGCTA CTACGCCAAG
AAGGCCCAGG GGTCGGCGAT CACCACCGCC GACCCCCAAA CCGTCTGGCC GGTTGTCACC
GCCATCGGCG GCGACAGCCG CTACTACTAT GCCAATGCGC TGTGGAAGAT CCGCGAAACC
TTGGACTGGA TGGTCGGCGG CCCGGGGAGG AATTACGGCC GCCGCCACCC CACAGAGCTT
CGGGTAGGTG ATGTGGTCGA CTCCTGGCGG GTCATCGGGC TAGAGCCCGA ACGCCGGCTC
ACCCTCTGGT TCGGTATGCG GGCCCCCGGC TCCGGCGTGC TGGAGATCGA ACTCACACCG
CAAGCCGAGG GCGGGACCAA GATCACGGTT GCCAACCACT GGCACCCGGC CGGGGTCTGG
GGCCTCCTTT ACTGGTACGC CCTGGCCCCG GCTCACTCCC TGATATTCTC GGGGCTGGCC
CGGTCCATCG CCCGGCGGGC TGAGGCATCC TCGACCGCAT CCGGATCTCG CTCGGCGGCG
CCGGACTCGG ACTGA
 
Protein sequence
MMDQDHPNPD AAASESPLFA VFGASGYIGS HLVPELLGAG CRVRAVARNR EVLEARGWEG 
AELAAADALK PETLVPALRG ADVAYYLVHS MGAGKTFGTL DVEAARNFAA AAAEAGVRRI
VYLGGLVPES ARSAHILSRR QTGDTLREGS VPVTELRAGI IVGPGSAAFE IMRDLVLHLP
VMVTPRWVFA ESPPIALQDL LEYLRRAPQC GETAGAIFDV AGPEHLTYAQ MMRILAEEAG
RRPPTVIPVP LLTPKLSSYW LRLVTAVPTP IARALIEGLR EDYRADDSQI RHLVPQQLRD
FRSAVQDVFR AEREQTVAAR WTEGAFMFRN YRIDYSYYAK KAQGSAITTA DPQTVWPVVT
AIGGDSRYYY ANALWKIRET LDWMVGGPGR NYGRRHPTEL RVGDVVDSWR VIGLEPERRL
TLWFGMRAPG SGVLEIELTP QAEGGTKITV ANHWHPAGVW GLLYWYALAP AHSLIFSGLA
RSIARRAEAS STASGSRSAA PDSD