Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4692 |
Symbol | |
ID | 8745288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013745 |
Strand | - |
Start bp | 281365 |
End bp | 284355 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646515196 |
Product | FAD linked oxidase domain protein |
Protein accession | YP_003406143 |
Protein GI | 284172761 |
COG category | [C] Energy production and conversion |
COG ID | [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0507445 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGTTG AAAAGCCACA GGGCGATCTC AGCAACTACG AGGAGTTTCG AAATACGCTC GACCACGACT ACAGCGACGT CGGCGAGTAC GCCGAACTCG CGGACGATTT GCGATCGGTT ATGAAGGGGG AGGTTCGGTT CGACGAGTAC GCACAGGTGC TGTACTCGAC CGACGGAAGC ATCTACCAGG CCCGGCCGGC CGGTGCCGTT CTCCCGCGGG ACACGGAGGA CGTGCGAAAC GCTATCCGCG TCGCGGCCGA GCACGACGTC CCGATCATGG CCCGCGGCGC CGGGTCGTCC CTCGGCGGAC AGACCGTGGG GCCGGGCTGC GTCGTGCTCG ATCTGTCGAC CTACATGGAC GACATCCTCG AGGTCGACGC CGAGGAGAAA CGGGCGCGCG TCCAACCGGG TGTCGTCCAG GACCACCTCG ACGACCACCT CGCACAGTAC GGCCTCAAGT TCGCGCCGGA TCCTGCTTCA TCGAACCGCG CGACGATCGG CGGCGGGATC GGGAACAACT CGACCGGCGC CCACTCCGTT CGCTACGGAA TCACCGACGC CTACACCGAG GAGTTGAACG TCGTCCTCGC CGACGGCTCG CTGATCCACA CCCGCGAAGT CGTCCTCGAC TCCGAAGAGT ACGAGGAGAT CGTCTCGAGG GGCGACCGCG AGGCGGAGAT CTACCGAACC GTTCGCGCGC TCGTCGAGGA GAACGCCGAC GAGATCGAGG CCCGCTATCC GGACCTCAAG CGCACCGTCT CCGGCTACAA CCTCGACAGG GTGATCTACG AGAACGAGGA CGGAGAGACG GTCATCAACC TCTCGAAGCT CTTCGTCGGC GCGGAGTCCA CCCTCGGCGT GATCGTCGAA GCGGAGCTCA GTCTTGTGAC CCTCCCCGAG GAAACGGCGC TCGTGCTCTA CTGTTTCGAC GATCTGATCG ACGCGCTCGA GGCCGTCCCC GAGGCCCTCG AGTTCGATCC CAGCGCGGTC GAACTGATGG ACAGCGAGGT GTTCAGGCTC GCCCGGGAGT CCGAGCAGTT CTCCCGCTAC GAGGCGCCGA TCCCCGACGG GACGGAGGCG GCGCTGATGC TGGAGTATGA CTCCGAACTC CACGACGACT TCGAAGCGGC GATCGGGCGG ACGAACGCCC ACTTCGTCGA TGAGGGCGAC GCCTTCGAAG CCCTCGAGGC CTACACCGAG GACGAGCAGG CGGACCTCTG GAAGCTCCGG AAGGCGGCCA TTCCGCTGCT GATGAGTCTC GAGGGCGATC CGAAGCCGTA CCCGTTCATC GAGGACGCGA CGGTTCCGCC CGAGGAACTC GCCGAGTACG TCCAGAAGTT CATGGGGATC CTCGACGACC ACGACACCTC GGCGGCGTAC TTCGCCCACG CGGGCAGCGG CACGCTCCAC ATCCGGCCCA TCCTCAACCT GAAGGAGGAC GAGGGCGTCC AGGCGATGCA GTCGATCTCC GGGGACGTGA CGGACCTCGT CGTCGAACAC CACGGCTCGT TCTCCGGCGA GCACGGCGAC GGGCTCGCCC GCACGGCGTT CAATCCGAAG ATGTACGGCG AGGACCTCTG GGCCGCGTTC CAGGAGGTCA AGTCGGTCTT CGACCCCGAC TGGCGGATGA ACCCCGGCAA GGTCGTCTAC ACCGACGAGA ACCCGACCGA CATGCGCGAG AACCTCCGGT ACGGTGCGGG CTACACGTCG ATCGAGCCCC AGACGAAATT GGACTTCACC GACGAGGGCG GGTTCTCGCA GCTCGTCGAA CTCTGCAACG GCTGTGGGAC CTGTCGGCAG ACCGACGACG TGATGTGTCC GACCTACCGG GGGATGAAAG ACGAAATCGC GACGACCCGC GGTCGCGCGA ACATGCTCCG GGCAGCGATC TCCGGCGAGA TCGACGAGGA AGAAATGTAC TCCGAGCGCT TCCAGGAGGA GGTGCTCGAC CTCTGCGTCG GCTGCAAGGG CTGCAAGAGC GACTGTCCGA CCGGGGTCGA TCTGGCGAAA CTCAAGGCCG AGACCAAACA CCAGTACCAC GAGCGGGAGG GGATCGGGCT GCGCGAGCGG CTGTTCGGCA ACATCGATAC GGTCTCGAAG CTCGGAAGCG CGCTCGCGCC GCTGGCCAAC GCCGGGAAGC AACTGCCTGG CAGTCGCCTC GTGATGGAGA AGGTCGCCGG CATCGCGCCC GAGCGGACGC TGCCCTCCTT CGAGCGCGAG TCCCTCCAGG AGTGGTTCGC CCGGCGCGGG TCGCGGGTGA GCCCGGAAGC GGCCCACCGG AAGGTGGTGT TGTTCCCCGA CACCTACACG AACTACAACT ACACCCGGCC CGGGAAAGCC GCCGTGCGCG TGCTCGAGTC GGCCGGCGTC CACGTCGAGA TCCCCGAGGA CGTCGCGCCG AGCGGCCGCG CGGCGTTCTC GGTCGGCATG CTCGACGCGG CCGAGGATCG CGCCCGAACG AACGTCGACC TGTTCGCCGA GTACATCGAG GACGGGTACG ACGTCGTCGC GGTCGAACCG TCCGACGCGG TGGTCTTCCA GGACGAGTAC CGCGATCTGG TGAGCACGCC GCAGGCCGAG ACCGTCGCCG CCCACGCCTA CGGCATCAGC GAGTACATCG ACAAGTACCG GCTCCTCGAG CGACTGCCCC TCGACGAGAC CGAGGAGACG CTCGCGTATC ACGGCCACTG CCACCAGAAG GCGACCGGGA AGGACCACCA CGCCGTCGGC GTCCTCCGGC GGGCCGGCTA CGCGGTCGAC CCGATCGACT CTACCTGCTG CGGGATGGCC GGCTCGTTCG GCTACGAGGC CGAACACTAC GACCTCTCGA AGGCGATCGG CGAGCGGCTC TACGAGAAGT TAGACGAGAG CGACGGAATG CCGGTCGCGC CCGGCGCGTC CTGCCGGAGT CAGATCGGCG ATCAAGAAAC TCGAGACGAG AGACCGCCAC ATCCGATCGA GAAGGTGAAC GACCTCCTCA CGGGATCGTA G
|
Protein sequence | MAVEKPQGDL SNYEEFRNTL DHDYSDVGEY AELADDLRSV MKGEVRFDEY AQVLYSTDGS IYQARPAGAV LPRDTEDVRN AIRVAAEHDV PIMARGAGSS LGGQTVGPGC VVLDLSTYMD DILEVDAEEK RARVQPGVVQ DHLDDHLAQY GLKFAPDPAS SNRATIGGGI GNNSTGAHSV RYGITDAYTE ELNVVLADGS LIHTREVVLD SEEYEEIVSR GDREAEIYRT VRALVEENAD EIEARYPDLK RTVSGYNLDR VIYENEDGET VINLSKLFVG AESTLGVIVE AELSLVTLPE ETALVLYCFD DLIDALEAVP EALEFDPSAV ELMDSEVFRL ARESEQFSRY EAPIPDGTEA ALMLEYDSEL HDDFEAAIGR TNAHFVDEGD AFEALEAYTE DEQADLWKLR KAAIPLLMSL EGDPKPYPFI EDATVPPEEL AEYVQKFMGI LDDHDTSAAY FAHAGSGTLH IRPILNLKED EGVQAMQSIS GDVTDLVVEH HGSFSGEHGD GLARTAFNPK MYGEDLWAAF QEVKSVFDPD WRMNPGKVVY TDENPTDMRE NLRYGAGYTS IEPQTKLDFT DEGGFSQLVE LCNGCGTCRQ TDDVMCPTYR GMKDEIATTR GRANMLRAAI SGEIDEEEMY SERFQEEVLD LCVGCKGCKS DCPTGVDLAK LKAETKHQYH EREGIGLRER LFGNIDTVSK LGSALAPLAN AGKQLPGSRL VMEKVAGIAP ERTLPSFERE SLQEWFARRG SRVSPEAAHR KVVLFPDTYT NYNYTRPGKA AVRVLESAGV HVEIPEDVAP SGRAAFSVGM LDAAEDRART NVDLFAEYIE DGYDVVAVEP SDAVVFQDEY RDLVSTPQAE TVAAHAYGIS EYIDKYRLLE RLPLDETEET LAYHGHCHQK ATGKDHHAVG VLRRAGYAVD PIDSTCCGMA GSFGYEAEHY DLSKAIGERL YEKLDESDGM PVAPGASCRS QIGDQETRDE RPPHPIEKVN DLLTGS
|
| |