Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2940 |
Symbol | |
ID | 8743557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 3016401 |
End bp | 3019568 |
Gene Length | 3168 bp |
Protein Length | 1055 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646513525 |
Product | FAD linked oxidase domain protein |
Protein accession | YP_003404482 |
Protein GI | 284166203 |
COG category | [C] Energy production and conversion |
COG ID | [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCACAG AAGAGACGCG TTCCACGACC GACCACAGCG ACCAGACGTC GACTCCGGAC GAGAGCCAAC TCGGGCCGCC GGCGGATCCC CGCGAGGGAG ATCCGGCAGC GGATCCGCGG GCCGATTACG ACTACGTCGG CGGCGACGTC GACCGTCCCG CGCTCGTGGC GGCGCTGCGC GAGCGGATCG ACGGCGAGGT GCGCTTCGAC GAGTACAGCA GACAGCTGTA CGCGACCGAC GCGAGCGCCT ACGAGATGAC GCCCGTCGGC GTCGTCCTCC CGCGCTCGAC CGACGACGTC GCGAGCGTCG TCGGCTACTG CGCCGAGAAC GGTATCCCGG TGCTCCCGCG CGGCGGCGGA ACGAGCCTCG CCGGACAGGC GGTCAACGAG GCCGTCGTCC TCGATTTCAC GGCGCACATG GGCGACGTTC TCGAGATCGA CCCCGACGGG CAGCGGGCGA CCGTACAGGG CGGAGCCGTC CTCGCCGACC TGAACGGCGC CCTCGAGTCC CACGGACTGA AGTTCGCGCC GGATCCCGCG GCGGGCAACC GCAGCACCGT CGGCGGCGCC ATCGGGAACA ACTCGACGGG CGCCCACTCG CTGCAGTACG GGAAGACCGA CGCCTACGTC GAGGCGTGCG AGGTCGTCCT CGCCGACGGC TCCGTCGAGC GCTTCGGCGA GGTGACGGTC GGCGAACTCC GCGAGCGGGC CGATCCGGAC GGCGACCTGC TCGAGCGAAT CCACGAGGCG CTGCGCCGCG TGATCGACGA AGAGGCCGCC GCGATCCGCG AGGTCTTCCC GCGACTGAAG CGCAACGTCT CGGGCTACAA CCTCGATCGG CTGGTCGCGG AAGCCTACGG GACACCCGAG GCCTTCGACG AGAACGCCGC GGACTCGATC TCGATCGACG CCGACTCCGA CCCAGACCCC GACGCGACCG TCAACCTCGC CCGCGTCTTC GCCGGCAGCG AGGGGACCCT CGGCGTTGTC ACGGCGGCGA CAGTCTCGCT CGAGTCCGTT CCCGAGACCA CGTCGGTGGT GCTGTTGACC TACCGCGACC TGCTCGAGGC GATGGCCGAC GTCGACACCA TCGTCAGGAA CCACGACCCG GCGGCCGTCG AGGCGATCGA CGACGTCCTG ATCGAACTCG CGGAAAATAC TGAAGAGTTC GCCGCCGTCG CCGAGCGACT CCCCGCGGGC ACCGAGACCG CGCTGCTCGT GGAGTTCTAC GCCGAGGACG ACGACCACGG CCGCGAGCAG GTCGCGGACC TGCTGGCCGA TCGGCTGCCG GACGAGGAAG TTCCGGATCC GAGCGACTCC GAGACCGGCA GTTCGGCGCC GGTCCGTGCT TTCGACGCCC TCGAGGCCCA CGACCCCGAG GGGATCGCCG AACTCTGGAA GCTCCGCAAG AGCGCGGCGC CCATTCTGCT CTCGCGGACG TCCGACGAGA AGCACATCTC GTTCATCGAG GACACCGCCG TCCCCACCGA GAATCTCGCG GACTACGTGG CGGACTTTCA GGAAGTCCTC GAGGAGTACG ACACGTTCGC GAGCTTCTAC GCCCACGCCG GCCCCGGCTG TATGCACATG CGGCCGCTGG TCAGCACCAA GAGCACGGCC GGCCTCGAGG CGTTCGAGTC GCTGGCCGAC GACGTGACCG ACCTCGTCGT CGAGTACGGC GGGAGCGTGT CGGGCGAGCA CGGCGACGGT CGCGCCCGCA CCCAGTGGAA TCGCAAGCTC TACGGCGACG ACGTCTGGGA GCTCTTCCGC GACCTGAAGA CGGCGTTCGA CCCCGACTGG CTCCTGAATC CGGGCAACGT CTGCGGCGAC CACCGCATGA TCGAACAGCT CCGGTTCGAT CCCGACTACG AGTTCGAGGC CGGCTTTGAG CCCGCACTCG AGTGGGACAA CGATAACGGC TTCCAGGGGA TGGTCGAGCT CTGTCACGGC TGCGGCGGCT GTCGCAGCGA GCAGGAGACC ACCGGCGGCG TGATGTGTCC GACCTACCGC GCCGCCGAAG AGGAGAGCCT GAGTACCCGC GGGCGGGCGA ACATGCTCCG ACAGGCGATG AGCGGCGACC TTGAGACCGA TGCCATCGAC GAGGAGTTCC TGGCGGAGAT CATGGACCTC TGTGTCGGCT GCAAGGGCTG TGCGCGGGAC TGCCCGAGCG AGGTCGACAT GGCGAAGCTC AAGGCCGAAG TGGAGCACGC CGCCCATCAG AAGAACGGTG CCAGCTTGCG CGATCGGCTC TTCGCGAACG TCGATCGACT GAACGCCGTC GGGAGCGCGC TCGCGCCGCT ATCGAACTGG GCGGCGTCGC TTCCCGGGTC GGGTGCTATC GCCGAGAAGA CCCTCGGGAT CGCCCGCGAG CGTGAGCTGC CGACGTTCGC GAGCGAGAGC CTCGAGGACT GGTTTAAATC GCGGGGCGGT GCGACGGTGC CCCGCGAGTC GGCCAGCCGA CGGGTCGTGC TCGTCCCCGA CACGTACACG AATTACAACC ATCCCGACGC GGGGAAGGCC GCGGTTCGGG TGCTCGAGGC TGCGGGCGTT CACGTGACGA TCCCCGACGC GATCACGTCG ACGGGCCGGC CGGCCCACTC GAAGGGCTTC CTCGACCGCT CGCGAGAGCG CGCGCGGACG AACGTCGACG CCCTCGAGCC GTTCGTCGCG AACGGCTGGG AGGTCGTCCT CGTCGAGCCC TCCGACGCCG TGATGCTCCA GTCGGACTAC CTGGACCTGC TCTCCGGGCC CGACGTCGAG CGGGTGGCGG CGAACACCTA CGGCGTCATG GAGTACGTCG ACCGGTTCGA TCTGCTCGCG GAGGTCGGTG TCGACGAGTC GGCCGCCCGC GAGCGGCTGA CCTACCACGG CCACTGCCAC CAGAAGGCGA CGAAGAAGGA CGGCCACGCC GCGACGGTGC TCGAGGCCGC CGGCTACGAG GTCGACGCCC TCGACTCGGG CTGTTGCGGG ATGGCCGGGA GCTTCGGCTA TGAGGCCGAA CACTACTCGC TCAGTCGGAA GATCGGGTCG ATCCTCTTCG AACAAGTCGA CGACAGCGAC GGTGACGAAG TCGTCGCGCC CGGCGCCTCC TGTCGGACCC AACTCGAGGG CCACGAGGGC GACTATCCGG CCCACCCGGT CGAGAAACTC GCGGCCGTGG TCCGCTGA
|
Protein sequence | MATEETRSTT DHSDQTSTPD ESQLGPPADP REGDPAADPR ADYDYVGGDV DRPALVAALR ERIDGEVRFD EYSRQLYATD ASAYEMTPVG VVLPRSTDDV ASVVGYCAEN GIPVLPRGGG TSLAGQAVNE AVVLDFTAHM GDVLEIDPDG QRATVQGGAV LADLNGALES HGLKFAPDPA AGNRSTVGGA IGNNSTGAHS LQYGKTDAYV EACEVVLADG SVERFGEVTV GELRERADPD GDLLERIHEA LRRVIDEEAA AIREVFPRLK RNVSGYNLDR LVAEAYGTPE AFDENAADSI SIDADSDPDP DATVNLARVF AGSEGTLGVV TAATVSLESV PETTSVVLLT YRDLLEAMAD VDTIVRNHDP AAVEAIDDVL IELAENTEEF AAVAERLPAG TETALLVEFY AEDDDHGREQ VADLLADRLP DEEVPDPSDS ETGSSAPVRA FDALEAHDPE GIAELWKLRK SAAPILLSRT SDEKHISFIE DTAVPTENLA DYVADFQEVL EEYDTFASFY AHAGPGCMHM RPLVSTKSTA GLEAFESLAD DVTDLVVEYG GSVSGEHGDG RARTQWNRKL YGDDVWELFR DLKTAFDPDW LLNPGNVCGD HRMIEQLRFD PDYEFEAGFE PALEWDNDNG FQGMVELCHG CGGCRSEQET TGGVMCPTYR AAEEESLSTR GRANMLRQAM SGDLETDAID EEFLAEIMDL CVGCKGCARD CPSEVDMAKL KAEVEHAAHQ KNGASLRDRL FANVDRLNAV GSALAPLSNW AASLPGSGAI AEKTLGIARE RELPTFASES LEDWFKSRGG ATVPRESASR RVVLVPDTYT NYNHPDAGKA AVRVLEAAGV HVTIPDAITS TGRPAHSKGF LDRSRERART NVDALEPFVA NGWEVVLVEP SDAVMLQSDY LDLLSGPDVE RVAANTYGVM EYVDRFDLLA EVGVDESAAR ERLTYHGHCH QKATKKDGHA ATVLEAAGYE VDALDSGCCG MAGSFGYEAE HYSLSRKIGS ILFEQVDDSD GDEVVAPGAS CRTQLEGHEG DYPAHPVEKL AAVVR
|
| |