Gene Hhal_1936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1936 
Symbol 
ID4710616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2133728 
End bp2136649 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content69% 
IMG OID639856409 
Productmolybdopterin oxidoreductase Fe4S4 region 
Protein accessionYP_001003502 
Protein GI121998715 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTCA CCCCGCAAGC CGAGGTGGCG GACGATCCCG CTCGTCTTGA ACCTCGGGAC 
CGCCAGGAAG TCCGCTATAC CACCTGCTAC ATGTGTGCCT GTCGCTGCGG TATCCAGGTC
ACCCTGGAGG ACGGTCAGAT CCGTTTCATC CAGGGCAATC CGGATCACCC GGTTAATCGC
GGCGTGCTCT GCGCCAAGGG CAACGCCGGC ATCATGAAGC AGAACTCGCG GGCCAAGCTT
CGTCGCCCGC TGCGTCGCAA GCCCGGTAGC GAACGGGGAG AGGGCGCCTT CGAGGAGATC
TCCTGGGAGA CCGCCCTGGA CGAACTCACC GAGCGCCTCC GCCGCATCCG CGCCGAGGAT
CCGAAGAAGC TCGCCTACTT CACCGGCCGC GACCAGATGC AGGCGCTGAC CGGGCTCTGG
GCTACGCAGT TCGGGACCAT CAACTGGGCC GCCCACGGCG GCTTCTGCTC GGTGAACATG
GCCGCGGCCG GCCTCTACAC CCTGGGTCAC GCCTTCTGGG AGTTCGGCGA TCCGGACTTC
GAGCGCACCC GCTACTTCCT GCTCTGGGGG ACCGCCGAGG ACCACGCCTC CAACCCCTTC
AAGCTGGGCA TCGACACCCT CAAACGCCGG GGTGGGCGCT TCGTTGCCAT CAATCCGGTG
CGCACCGGCT ACCAGGCCGT TGCCGATGAG TGGGTGCCCA TCCGCCCGGG CACCGACGGC
ATGCTGGCCA TGGCGCTGAT CCACTGCCTG CTGCGGGACG GCCAGTTCGA CTGGGACTAC
CTGATCCGCT ACACCAACGC CCCCTACCTG GTGGTGCAGA CCCCGGGGCA GGCTGGCGAC
GGCCTGTTCC TGCGCGACGA GCAGGGCGCG CCGTTGGTCC GGGATCTCGA GCGGGAGGAC
TTCGTCGACG GCACCCGGGC CGAGATTGCG CCGGCGCTCT TCGGTGCCTG GACGGCCCCG
GACGGTCGAC CGGTGAAGAC GGCCATGACC CTCCTGGCCG AGCGGTATCT GGATCCGCAG
TACGCGCCGG ATCAGGCCGC CGAGGTCTGC GGCGTCCCGG CGGAGACCAT CGAGCGCCTG
GCCGCCGAGA TGGCCCACGT CGCCTTCCAG GAGACCATCG AGATCGAGTG CCAGTGGACC
GATTGGGCCG GGCGTGAGCA CGACCGGTTC ATCGGGCGGC CGGTCTCCAT GCACGCCATG
CGGGGTGTCT CCGCCCACTC CAACGGCTTC CAGGCGGCCC GGGCGCTGCA CCTTCTGCAG
CTGCTGCTCG GCTCGGTGGA CTGCCCCGGC GGGCACCGTG CCAAGCCCCC GTACCCGAAG
CCGATCCCGC CGCCGCTGCG CCCGGCCCGG GAGACGGCGC CGGAGACGCC GCTGTCGGCC
TCGCCGCTGG GGTTCCCGGT GGCCCCGGAG GACCTGGTCA TCGACGGCGA GGGGCGGCCG
CTGCGGATCG ACAAGGCGTT CTCCTGGGAG GCCCCGGTCT CGCCCCACGG CAAGATGCAC
ACCGTTATCA GCAACGCCCA CGACGGGGAT CCGTACCCCA TCGATACGCT GATGCTGTTC
ATGGCCAACA TGGCCTGGAA CTCCACCATG AACACCGCCA GCGTTCAGGA GATGCTGTGC
GCCCGGGACG ATAACGGGGA TTACCGCATC CCCTATCTGG TGGTGGTGGA CGCCTTCCAC
TCCGAGACGG TGCAGTACGC CGATCTGGTG CTGCCGGATA CCACCTATCT GGAGCGCCAC
GACTGCATCT CCATGCTCGA CCGACCGATC TCCACCGCCG AGGGACCGGC GGATGCCATC
CGTCAGCCGA TCCTCGAGCC GGAGGGCGAG GCGCGCCCCT GGCAGGAGGT GATGATCGAA
CTCGGTGCCC GCCTGGGGCT GCCGGCGTTC ACCGAGGCGG ATGGCAGCCC GAAGTACAGC
GGTTACGAGG ACTTCATCGT CCGCTTCGAG AAGGCCCCCG GGGTCGGCTT CCTGGCCGGC
TGGCGGGGTG AGGACGGCAG CAAACCGCTG CGCGGCGAGC CCAACCCGCA GCAGTGGGAG
CGTTACATCG AGAACGGCGG TTTCTTCCAG TACGAGCTGC CGCTCTCGCA CCAGTTCTAT
AAGTTCGCCA ACAAGGGGTA CCTGGAGTGG GCCGAGGAGG CCGGGATCAA CGGCTCCGCC
GAGCCGATGG TGATGAATGT CTACTCCGAG CCCCTGCAGC GCTTCCGGCT GGCCGGTCAG
GGCCTCTACG ACGGGCCGCA GCCCGAGGAT CCGGTGGATC GCGAGCGCAT CCTCGCCTAC
TGCGACCCGC TGCCCTTCTA CTACCCGCCG CTGGAGCAGA CGCGCCTGGC GGGGCAGGGG
TATACCTTCC ACGCCATCAC CCAGCGCCCC ATGACCCAGT ATCACGCCTG GGACAGCCAG
AACGCCTGGC TGCGCCAGAT CATGGCCGAC AACGTCCTGT ACATGAACCG GGCGCGGGGC
GAGGCGCTGG GCTTCGAGGA CGGTGACTGG GTCTGGGTGG AGTCCCACCG GGGGCGGATC
TGTGTTCCGC TGCAGCTGGT CGAGGGCGTT CAGGCCGACA CCGTGTGGAC CTGGAACGCG
GTGGGCAAGC GCTCCGGCGC CTGGGGACTG GAGCCGGGCG GGCCGGAGGC CACCCGCGGT
TTCCTGCTCA ATCACCTGAT CGATGACCGG CTGCCGCGCA ACGGGGACGA GAAGCCGTTG
AGCAACTCCG ACCCGATTAC CGGGCAGGCG GCCTGGTACG ACCTCCAGGT GCGCATCCAC
AAGGCGGCTC CTGGCGAGGG CGGGGTCTCC CACCCCCAGT TCGCCGACCT GACCCCGCCG
CCGGGGGTCG ATAACGGACC GCTGAAGTTC CTGCGGTACG CAACCCACCA TGCGGTGCGC
CTGCACCGCT CCATGCGCGA CATTCTAAGC CGGGGGCGTT GA
 
Protein sequence
MSVTPQAEVA DDPARLEPRD RQEVRYTTCY MCACRCGIQV TLEDGQIRFI QGNPDHPVNR 
GVLCAKGNAG IMKQNSRAKL RRPLRRKPGS ERGEGAFEEI SWETALDELT ERLRRIRAED
PKKLAYFTGR DQMQALTGLW ATQFGTINWA AHGGFCSVNM AAAGLYTLGH AFWEFGDPDF
ERTRYFLLWG TAEDHASNPF KLGIDTLKRR GGRFVAINPV RTGYQAVADE WVPIRPGTDG
MLAMALIHCL LRDGQFDWDY LIRYTNAPYL VVQTPGQAGD GLFLRDEQGA PLVRDLERED
FVDGTRAEIA PALFGAWTAP DGRPVKTAMT LLAERYLDPQ YAPDQAAEVC GVPAETIERL
AAEMAHVAFQ ETIEIECQWT DWAGREHDRF IGRPVSMHAM RGVSAHSNGF QAARALHLLQ
LLLGSVDCPG GHRAKPPYPK PIPPPLRPAR ETAPETPLSA SPLGFPVAPE DLVIDGEGRP
LRIDKAFSWE APVSPHGKMH TVISNAHDGD PYPIDTLMLF MANMAWNSTM NTASVQEMLC
ARDDNGDYRI PYLVVVDAFH SETVQYADLV LPDTTYLERH DCISMLDRPI STAEGPADAI
RQPILEPEGE ARPWQEVMIE LGARLGLPAF TEADGSPKYS GYEDFIVRFE KAPGVGFLAG
WRGEDGSKPL RGEPNPQQWE RYIENGGFFQ YELPLSHQFY KFANKGYLEW AEEAGINGSA
EPMVMNVYSE PLQRFRLAGQ GLYDGPQPED PVDRERILAY CDPLPFYYPP LEQTRLAGQG
YTFHAITQRP MTQYHAWDSQ NAWLRQIMAD NVLYMNRARG EALGFEDGDW VWVESHRGRI
CVPLQLVEGV QADTVWTWNA VGKRSGAWGL EPGGPEATRG FLLNHLIDDR LPRNGDEKPL
SNSDPITGQA AWYDLQVRIH KAAPGEGGVS HPQFADLTPP PGVDNGPLKF LRYATHHAVR
LHRSMRDILS RGR