Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3126 |
Symbol | |
ID | 5734998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3944153 |
End bp | 3947041 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280269 |
Product | FAD linked oxidase domain-containing protein |
Protein accession | YP_001545891 |
Protein GI | 159899644 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCACG CAGCAGAACT TCAGGCTGAG GTTTTGGCTG CACTTCAGCG GTCGGTTCGT GGCAAGGTTC ACGCCGACCG CTTGACATGT GCGCTGTATA GCACCGATGC CTCTTCCTAC GCCGTCATGC CCAAAGCGGT GGTTATTCCG CACGATCGCG CCGATTTGCA GGCAATCGTC GAAATTGCTC AACGCTACCA TGTGCCAATT GTGCCGCGTG GCGGTGGCAC AAGCCTCTCG GGCCAAGCGA TTGGCGCGGG CATTATCGTC GATCTTTCGC ATTCATTCGG CCAAATTGGC AAATTCGATC CAAGTAGCCG CCAGATTTGG GTTGAAGCTG GCTGCACGCT TGATCGCTTG AATAATTTCC TCAAACCACA TGGCCTAATG TTTGGCCCCG ACCCTTCGTC GAGTGCTTCG GCCACGATCG GCGGCATGCT CGGCAATAAT TCAACTGGCT CGCATTCAAT TGTCTATGGC ATGACTGTTG ATCATACCCT CGCCATGGAA ACGATTTTAG ATGATAGCTC GCTGGCGATT TGGTCAACCC AAGCAGCCAA TAATCCGCGT CAGCAACAGA TTCAAGCTCA ACTTGGCCAA ATTGTTGAGC GTTATGCTCC CAACATTGAG CGCGATTATC CACAAGTTTG GCGCAATGTG GCAGGCTACA ATCTCCAACG CCTGCTGCAA CGCCAGCAAC AGCAACAACC ATTTAGCGCT GTGCCAATTT TGGTTGGTAG CGAGGGCACG CTCGGCTTGA CGGTTGCCGC CCAACTCAGC TTGGTACCAC GGCCAAAGGT CACACGGTTG GCACTCTGGC ATTTTGATGA TCTTCAGCGA GCCTTGGCTT TAGTACCCGA AATTCTGCGC TTTCAGCCAA GCGCTGTCGA ATTATTTGAT CGCTATTTTA TTAATTTGAC TCGCCAAAAC CCTGAGTATG GTTCGCGCCT GAGTTTTGTG GTGGGCGAGC CACGGGTGGT CTTAATTGTT GAATGGGCGG GAACCGATTC CACTGAATTA GCTGCCAACG ATGCGGCCTT AGAAGCTCAT TTACGCGCGT TGGGCGAAAC TGGCCTGATT GTGCGCGAAA CAACACCCGC TGCCATTGCC AATGTTTGGG CGGTGCGCAA GGCTGGATTA GGCTTGTTGA TGAGCCAGCG CGGCGATGCC AAACCCCTGG CTTTTGTTGA TGACGCAACC GTGCCAGTCG AACGTTTGGC CGAATATGCC CGTGGCGTTG AAGCGATTTG CCGCGAAGCT GGCACTGAAG CCACGTTTTA TGCTCATGCC TCGGTTGGCT GCTTACACAT CAATCCATTA ATTAATTTGA AAACTGAGCA TGGTTTAGCC CAGATGGCTC AAATTTCGCA GGCCGTTGCT AGTTTAGCAA TTAGCCTTGG CGGCACAACC ACGGGCGAAC ATGGCGAGGG TTTGGCTCGA TCTGCCTTCA ATCAAAAATT GTATGGCACG GAACTGCATC AAGCTTTTGG CGAAATCAAG CAATTGTTCG ACCCAAACCA AATCTTCAAC CCAGGCAAGA TTTTGACCGC ACCCCAGCCG TGGCAACCTG AAATCCTGCG CATCAACCCC AGCTACCAAA CGCCGCATGC GCCCAGCGTA ACTTTTTTTG ATTTTACGCC TGATGGCGGT TTTGCTGGCT TAGTCGAGAT GTGCAACGGC CAAGGGGTTT GCCGCAAAGA TGATGCAGGC GTAATGTGTC CATCGTATAT GGCGACTCAC GACGAGGCCA ACTCGACTCG TGGCCGTGCC AATGCCCTAC GTGCAGCCAT GACTGGTCAG CTTGGAACGG CTGGATTGAG CAGCCCTGAA TTGCACGAGG CCATGGATTT ATGCCTTGAA TGCAAGGCTT GTAAACGCGA ATGCCCATCG ATTGTTGATA TGGCGCGGCT CAAATCGGAG TGGCTAGCGC ACTATCAAGC AACTCATGGC GTGCCCTGGC GCAGCCGTTT GTTTGGCCAG ATCGCTAAAA TCAACCAACT CGGCATGTTG GTTCCACGCT TAAATAATTG GGTTTTGGCG CAACCAATCA CCCGTTGGCT ACTCGATCGC AGTTTGGGCA TCGATCAACG ACGGCAACTG CCAGCTTTGG CGCGTAGCTC ATTTCGGACA TGGTTTAAAC GCCAAACCCA AAACCAAGCT GCGCCGCTTG GCCCATTGAT TCTGTGGGAT GATACCTTTA CGCTCTACAA CGAACCTCAG ATCGGCCAAG CAGCGGTCAA AATGCTTAGC GCAGCAGGCT ACAAGGTTTA TTTAATCGAA GAACGGCATT GTTGTGGCAG GCCATTAATT TCTAAGGGAA TGTTGGCCGA GGCTCGGGCA AATGCTGAGT ATAATATCGC ACTACTCGCG CCATTCGCTG AGTTGGGCGT GCCAATTATT GGGCTAGAAC CAAGTTGTAT TGCCAGCTTT CGCGATGAAT ACCCAGCGCT GGTGACGACC AAGGCTGCCA ACATCGTGGC AGCACAAAGC TATTTCATTG AAGAATTTTT GGTTAAGCTG GCGGCTGAGG GCGTTACTTG GCATTGGAAA GAGCAGCTAC CAGCCGATCA GGTTTTGGTG CATGGTCATT GCTATCAAAA AGCCTTGATC AGCACCAACC CTCTGTTGGC GATGTTGCGG CTCGTGCCAA ACTTAGCAGT CCACGAAATT GAGAGCGGTT GTTGCGGAAT GGCTGGCTCG TTTGGCTATG AACGTGAGCA TTATGAAGTT TCAATGGCTT GTGGCGAGCA GCGCCTATTT CCAGCGATTC GCAGCAGCCA CCAGCCAATT CTTGCAGCTG GCATGTCTTG TCGCCATCAA ATTGAGGCTG GCACAGGCGT AATCGCCCAG CATCCGATTG TTTTTTTGGC CGATTGTTTA GCCGATTAA
|
Protein sequence | MNHAAELQAE VLAALQRSVR GKVHADRLTC ALYSTDASSY AVMPKAVVIP HDRADLQAIV EIAQRYHVPI VPRGGGTSLS GQAIGAGIIV DLSHSFGQIG KFDPSSRQIW VEAGCTLDRL NNFLKPHGLM FGPDPSSSAS ATIGGMLGNN STGSHSIVYG MTVDHTLAME TILDDSSLAI WSTQAANNPR QQQIQAQLGQ IVERYAPNIE RDYPQVWRNV AGYNLQRLLQ RQQQQQPFSA VPILVGSEGT LGLTVAAQLS LVPRPKVTRL ALWHFDDLQR ALALVPEILR FQPSAVELFD RYFINLTRQN PEYGSRLSFV VGEPRVVLIV EWAGTDSTEL AANDAALEAH LRALGETGLI VRETTPAAIA NVWAVRKAGL GLLMSQRGDA KPLAFVDDAT VPVERLAEYA RGVEAICREA GTEATFYAHA SVGCLHINPL INLKTEHGLA QMAQISQAVA SLAISLGGTT TGEHGEGLAR SAFNQKLYGT ELHQAFGEIK QLFDPNQIFN PGKILTAPQP WQPEILRINP SYQTPHAPSV TFFDFTPDGG FAGLVEMCNG QGVCRKDDAG VMCPSYMATH DEANSTRGRA NALRAAMTGQ LGTAGLSSPE LHEAMDLCLE CKACKRECPS IVDMARLKSE WLAHYQATHG VPWRSRLFGQ IAKINQLGML VPRLNNWVLA QPITRWLLDR SLGIDQRRQL PALARSSFRT WFKRQTQNQA APLGPLILWD DTFTLYNEPQ IGQAAVKMLS AAGYKVYLIE ERHCCGRPLI SKGMLAEARA NAEYNIALLA PFAELGVPII GLEPSCIASF RDEYPALVTT KAANIVAAQS YFIEEFLVKL AAEGVTWHWK EQLPADQVLV HGHCYQKALI STNPLLAMLR LVPNLAVHEI ESGCCGMAGS FGYEREHYEV SMACGEQRLF PAIRSSHQPI LAAGMSCRHQ IEAGTGVIAQ HPIVFLADCL AD
|
| |