Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1936 |
Symbol | |
ID | 4710616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2133728 |
End bp | 2136649 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639856409 |
Product | molybdopterin oxidoreductase Fe4S4 region |
Protein accession | YP_001003502 |
Protein GI | 121998715 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGTCA CCCCGCAAGC CGAGGTGGCG GACGATCCCG CTCGTCTTGA ACCTCGGGAC CGCCAGGAAG TCCGCTATAC CACCTGCTAC ATGTGTGCCT GTCGCTGCGG TATCCAGGTC ACCCTGGAGG ACGGTCAGAT CCGTTTCATC CAGGGCAATC CGGATCACCC GGTTAATCGC GGCGTGCTCT GCGCCAAGGG CAACGCCGGC ATCATGAAGC AGAACTCGCG GGCCAAGCTT CGTCGCCCGC TGCGTCGCAA GCCCGGTAGC GAACGGGGAG AGGGCGCCTT CGAGGAGATC TCCTGGGAGA CCGCCCTGGA CGAACTCACC GAGCGCCTCC GCCGCATCCG CGCCGAGGAT CCGAAGAAGC TCGCCTACTT CACCGGCCGC GACCAGATGC AGGCGCTGAC CGGGCTCTGG GCTACGCAGT TCGGGACCAT CAACTGGGCC GCCCACGGCG GCTTCTGCTC GGTGAACATG GCCGCGGCCG GCCTCTACAC CCTGGGTCAC GCCTTCTGGG AGTTCGGCGA TCCGGACTTC GAGCGCACCC GCTACTTCCT GCTCTGGGGG ACCGCCGAGG ACCACGCCTC CAACCCCTTC AAGCTGGGCA TCGACACCCT CAAACGCCGG GGTGGGCGCT TCGTTGCCAT CAATCCGGTG CGCACCGGCT ACCAGGCCGT TGCCGATGAG TGGGTGCCCA TCCGCCCGGG CACCGACGGC ATGCTGGCCA TGGCGCTGAT CCACTGCCTG CTGCGGGACG GCCAGTTCGA CTGGGACTAC CTGATCCGCT ACACCAACGC CCCCTACCTG GTGGTGCAGA CCCCGGGGCA GGCTGGCGAC GGCCTGTTCC TGCGCGACGA GCAGGGCGCG CCGTTGGTCC GGGATCTCGA GCGGGAGGAC TTCGTCGACG GCACCCGGGC CGAGATTGCG CCGGCGCTCT TCGGTGCCTG GACGGCCCCG GACGGTCGAC CGGTGAAGAC GGCCATGACC CTCCTGGCCG AGCGGTATCT GGATCCGCAG TACGCGCCGG ATCAGGCCGC CGAGGTCTGC GGCGTCCCGG CGGAGACCAT CGAGCGCCTG GCCGCCGAGA TGGCCCACGT CGCCTTCCAG GAGACCATCG AGATCGAGTG CCAGTGGACC GATTGGGCCG GGCGTGAGCA CGACCGGTTC ATCGGGCGGC CGGTCTCCAT GCACGCCATG CGGGGTGTCT CCGCCCACTC CAACGGCTTC CAGGCGGCCC GGGCGCTGCA CCTTCTGCAG CTGCTGCTCG GCTCGGTGGA CTGCCCCGGC GGGCACCGTG CCAAGCCCCC GTACCCGAAG CCGATCCCGC CGCCGCTGCG CCCGGCCCGG GAGACGGCGC CGGAGACGCC GCTGTCGGCC TCGCCGCTGG GGTTCCCGGT GGCCCCGGAG GACCTGGTCA TCGACGGCGA GGGGCGGCCG CTGCGGATCG ACAAGGCGTT CTCCTGGGAG GCCCCGGTCT CGCCCCACGG CAAGATGCAC ACCGTTATCA GCAACGCCCA CGACGGGGAT CCGTACCCCA TCGATACGCT GATGCTGTTC ATGGCCAACA TGGCCTGGAA CTCCACCATG AACACCGCCA GCGTTCAGGA GATGCTGTGC GCCCGGGACG ATAACGGGGA TTACCGCATC CCCTATCTGG TGGTGGTGGA CGCCTTCCAC TCCGAGACGG TGCAGTACGC CGATCTGGTG CTGCCGGATA CCACCTATCT GGAGCGCCAC GACTGCATCT CCATGCTCGA CCGACCGATC TCCACCGCCG AGGGACCGGC GGATGCCATC CGTCAGCCGA TCCTCGAGCC GGAGGGCGAG GCGCGCCCCT GGCAGGAGGT GATGATCGAA CTCGGTGCCC GCCTGGGGCT GCCGGCGTTC ACCGAGGCGG ATGGCAGCCC GAAGTACAGC GGTTACGAGG ACTTCATCGT CCGCTTCGAG AAGGCCCCCG GGGTCGGCTT CCTGGCCGGC TGGCGGGGTG AGGACGGCAG CAAACCGCTG CGCGGCGAGC CCAACCCGCA GCAGTGGGAG CGTTACATCG AGAACGGCGG TTTCTTCCAG TACGAGCTGC CGCTCTCGCA CCAGTTCTAT AAGTTCGCCA ACAAGGGGTA CCTGGAGTGG GCCGAGGAGG CCGGGATCAA CGGCTCCGCC GAGCCGATGG TGATGAATGT CTACTCCGAG CCCCTGCAGC GCTTCCGGCT GGCCGGTCAG GGCCTCTACG ACGGGCCGCA GCCCGAGGAT CCGGTGGATC GCGAGCGCAT CCTCGCCTAC TGCGACCCGC TGCCCTTCTA CTACCCGCCG CTGGAGCAGA CGCGCCTGGC GGGGCAGGGG TATACCTTCC ACGCCATCAC CCAGCGCCCC ATGACCCAGT ATCACGCCTG GGACAGCCAG AACGCCTGGC TGCGCCAGAT CATGGCCGAC AACGTCCTGT ACATGAACCG GGCGCGGGGC GAGGCGCTGG GCTTCGAGGA CGGTGACTGG GTCTGGGTGG AGTCCCACCG GGGGCGGATC TGTGTTCCGC TGCAGCTGGT CGAGGGCGTT CAGGCCGACA CCGTGTGGAC CTGGAACGCG GTGGGCAAGC GCTCCGGCGC CTGGGGACTG GAGCCGGGCG GGCCGGAGGC CACCCGCGGT TTCCTGCTCA ATCACCTGAT CGATGACCGG CTGCCGCGCA ACGGGGACGA GAAGCCGTTG AGCAACTCCG ACCCGATTAC CGGGCAGGCG GCCTGGTACG ACCTCCAGGT GCGCATCCAC AAGGCGGCTC CTGGCGAGGG CGGGGTCTCC CACCCCCAGT TCGCCGACCT GACCCCGCCG CCGGGGGTCG ATAACGGACC GCTGAAGTTC CTGCGGTACG CAACCCACCA TGCGGTGCGC CTGCACCGCT CCATGCGCGA CATTCTAAGC CGGGGGCGTT GA
|
Protein sequence | MSVTPQAEVA DDPARLEPRD RQEVRYTTCY MCACRCGIQV TLEDGQIRFI QGNPDHPVNR GVLCAKGNAG IMKQNSRAKL RRPLRRKPGS ERGEGAFEEI SWETALDELT ERLRRIRAED PKKLAYFTGR DQMQALTGLW ATQFGTINWA AHGGFCSVNM AAAGLYTLGH AFWEFGDPDF ERTRYFLLWG TAEDHASNPF KLGIDTLKRR GGRFVAINPV RTGYQAVADE WVPIRPGTDG MLAMALIHCL LRDGQFDWDY LIRYTNAPYL VVQTPGQAGD GLFLRDEQGA PLVRDLERED FVDGTRAEIA PALFGAWTAP DGRPVKTAMT LLAERYLDPQ YAPDQAAEVC GVPAETIERL AAEMAHVAFQ ETIEIECQWT DWAGREHDRF IGRPVSMHAM RGVSAHSNGF QAARALHLLQ LLLGSVDCPG GHRAKPPYPK PIPPPLRPAR ETAPETPLSA SPLGFPVAPE DLVIDGEGRP LRIDKAFSWE APVSPHGKMH TVISNAHDGD PYPIDTLMLF MANMAWNSTM NTASVQEMLC ARDDNGDYRI PYLVVVDAFH SETVQYADLV LPDTTYLERH DCISMLDRPI STAEGPADAI RQPILEPEGE ARPWQEVMIE LGARLGLPAF TEADGSPKYS GYEDFIVRFE KAPGVGFLAG WRGEDGSKPL RGEPNPQQWE RYIENGGFFQ YELPLSHQFY KFANKGYLEW AEEAGINGSA EPMVMNVYSE PLQRFRLAGQ GLYDGPQPED PVDRERILAY CDPLPFYYPP LEQTRLAGQG YTFHAITQRP MTQYHAWDSQ NAWLRQIMAD NVLYMNRARG EALGFEDGDW VWVESHRGRI CVPLQLVEGV QADTVWTWNA VGKRSGAWGL EPGGPEATRG FLLNHLIDDR LPRNGDEKPL SNSDPITGQA AWYDLQVRIH KAAPGEGGVS HPQFADLTPP PGVDNGPLKF LRYATHHAVR LHRSMRDILS RGR
|
| |