Gene Haur_3126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3126 
Symbol 
ID5734998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3944153 
End bp3947041 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content52% 
IMG OID641280269 
ProductFAD linked oxidase domain-containing protein 
Protein accessionYP_001545891 
Protein GI159899644 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase
[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCACG CAGCAGAACT TCAGGCTGAG GTTTTGGCTG CACTTCAGCG GTCGGTTCGT 
GGCAAGGTTC ACGCCGACCG CTTGACATGT GCGCTGTATA GCACCGATGC CTCTTCCTAC
GCCGTCATGC CCAAAGCGGT GGTTATTCCG CACGATCGCG CCGATTTGCA GGCAATCGTC
GAAATTGCTC AACGCTACCA TGTGCCAATT GTGCCGCGTG GCGGTGGCAC AAGCCTCTCG
GGCCAAGCGA TTGGCGCGGG CATTATCGTC GATCTTTCGC ATTCATTCGG CCAAATTGGC
AAATTCGATC CAAGTAGCCG CCAGATTTGG GTTGAAGCTG GCTGCACGCT TGATCGCTTG
AATAATTTCC TCAAACCACA TGGCCTAATG TTTGGCCCCG ACCCTTCGTC GAGTGCTTCG
GCCACGATCG GCGGCATGCT CGGCAATAAT TCAACTGGCT CGCATTCAAT TGTCTATGGC
ATGACTGTTG ATCATACCCT CGCCATGGAA ACGATTTTAG ATGATAGCTC GCTGGCGATT
TGGTCAACCC AAGCAGCCAA TAATCCGCGT CAGCAACAGA TTCAAGCTCA ACTTGGCCAA
ATTGTTGAGC GTTATGCTCC CAACATTGAG CGCGATTATC CACAAGTTTG GCGCAATGTG
GCAGGCTACA ATCTCCAACG CCTGCTGCAA CGCCAGCAAC AGCAACAACC ATTTAGCGCT
GTGCCAATTT TGGTTGGTAG CGAGGGCACG CTCGGCTTGA CGGTTGCCGC CCAACTCAGC
TTGGTACCAC GGCCAAAGGT CACACGGTTG GCACTCTGGC ATTTTGATGA TCTTCAGCGA
GCCTTGGCTT TAGTACCCGA AATTCTGCGC TTTCAGCCAA GCGCTGTCGA ATTATTTGAT
CGCTATTTTA TTAATTTGAC TCGCCAAAAC CCTGAGTATG GTTCGCGCCT GAGTTTTGTG
GTGGGCGAGC CACGGGTGGT CTTAATTGTT GAATGGGCGG GAACCGATTC CACTGAATTA
GCTGCCAACG ATGCGGCCTT AGAAGCTCAT TTACGCGCGT TGGGCGAAAC TGGCCTGATT
GTGCGCGAAA CAACACCCGC TGCCATTGCC AATGTTTGGG CGGTGCGCAA GGCTGGATTA
GGCTTGTTGA TGAGCCAGCG CGGCGATGCC AAACCCCTGG CTTTTGTTGA TGACGCAACC
GTGCCAGTCG AACGTTTGGC CGAATATGCC CGTGGCGTTG AAGCGATTTG CCGCGAAGCT
GGCACTGAAG CCACGTTTTA TGCTCATGCC TCGGTTGGCT GCTTACACAT CAATCCATTA
ATTAATTTGA AAACTGAGCA TGGTTTAGCC CAGATGGCTC AAATTTCGCA GGCCGTTGCT
AGTTTAGCAA TTAGCCTTGG CGGCACAACC ACGGGCGAAC ATGGCGAGGG TTTGGCTCGA
TCTGCCTTCA ATCAAAAATT GTATGGCACG GAACTGCATC AAGCTTTTGG CGAAATCAAG
CAATTGTTCG ACCCAAACCA AATCTTCAAC CCAGGCAAGA TTTTGACCGC ACCCCAGCCG
TGGCAACCTG AAATCCTGCG CATCAACCCC AGCTACCAAA CGCCGCATGC GCCCAGCGTA
ACTTTTTTTG ATTTTACGCC TGATGGCGGT TTTGCTGGCT TAGTCGAGAT GTGCAACGGC
CAAGGGGTTT GCCGCAAAGA TGATGCAGGC GTAATGTGTC CATCGTATAT GGCGACTCAC
GACGAGGCCA ACTCGACTCG TGGCCGTGCC AATGCCCTAC GTGCAGCCAT GACTGGTCAG
CTTGGAACGG CTGGATTGAG CAGCCCTGAA TTGCACGAGG CCATGGATTT ATGCCTTGAA
TGCAAGGCTT GTAAACGCGA ATGCCCATCG ATTGTTGATA TGGCGCGGCT CAAATCGGAG
TGGCTAGCGC ACTATCAAGC AACTCATGGC GTGCCCTGGC GCAGCCGTTT GTTTGGCCAG
ATCGCTAAAA TCAACCAACT CGGCATGTTG GTTCCACGCT TAAATAATTG GGTTTTGGCG
CAACCAATCA CCCGTTGGCT ACTCGATCGC AGTTTGGGCA TCGATCAACG ACGGCAACTG
CCAGCTTTGG CGCGTAGCTC ATTTCGGACA TGGTTTAAAC GCCAAACCCA AAACCAAGCT
GCGCCGCTTG GCCCATTGAT TCTGTGGGAT GATACCTTTA CGCTCTACAA CGAACCTCAG
ATCGGCCAAG CAGCGGTCAA AATGCTTAGC GCAGCAGGCT ACAAGGTTTA TTTAATCGAA
GAACGGCATT GTTGTGGCAG GCCATTAATT TCTAAGGGAA TGTTGGCCGA GGCTCGGGCA
AATGCTGAGT ATAATATCGC ACTACTCGCG CCATTCGCTG AGTTGGGCGT GCCAATTATT
GGGCTAGAAC CAAGTTGTAT TGCCAGCTTT CGCGATGAAT ACCCAGCGCT GGTGACGACC
AAGGCTGCCA ACATCGTGGC AGCACAAAGC TATTTCATTG AAGAATTTTT GGTTAAGCTG
GCGGCTGAGG GCGTTACTTG GCATTGGAAA GAGCAGCTAC CAGCCGATCA GGTTTTGGTG
CATGGTCATT GCTATCAAAA AGCCTTGATC AGCACCAACC CTCTGTTGGC GATGTTGCGG
CTCGTGCCAA ACTTAGCAGT CCACGAAATT GAGAGCGGTT GTTGCGGAAT GGCTGGCTCG
TTTGGCTATG AACGTGAGCA TTATGAAGTT TCAATGGCTT GTGGCGAGCA GCGCCTATTT
CCAGCGATTC GCAGCAGCCA CCAGCCAATT CTTGCAGCTG GCATGTCTTG TCGCCATCAA
ATTGAGGCTG GCACAGGCGT AATCGCCCAG CATCCGATTG TTTTTTTGGC CGATTGTTTA
GCCGATTAA
 
Protein sequence
MNHAAELQAE VLAALQRSVR GKVHADRLTC ALYSTDASSY AVMPKAVVIP HDRADLQAIV 
EIAQRYHVPI VPRGGGTSLS GQAIGAGIIV DLSHSFGQIG KFDPSSRQIW VEAGCTLDRL
NNFLKPHGLM FGPDPSSSAS ATIGGMLGNN STGSHSIVYG MTVDHTLAME TILDDSSLAI
WSTQAANNPR QQQIQAQLGQ IVERYAPNIE RDYPQVWRNV AGYNLQRLLQ RQQQQQPFSA
VPILVGSEGT LGLTVAAQLS LVPRPKVTRL ALWHFDDLQR ALALVPEILR FQPSAVELFD
RYFINLTRQN PEYGSRLSFV VGEPRVVLIV EWAGTDSTEL AANDAALEAH LRALGETGLI
VRETTPAAIA NVWAVRKAGL GLLMSQRGDA KPLAFVDDAT VPVERLAEYA RGVEAICREA
GTEATFYAHA SVGCLHINPL INLKTEHGLA QMAQISQAVA SLAISLGGTT TGEHGEGLAR
SAFNQKLYGT ELHQAFGEIK QLFDPNQIFN PGKILTAPQP WQPEILRINP SYQTPHAPSV
TFFDFTPDGG FAGLVEMCNG QGVCRKDDAG VMCPSYMATH DEANSTRGRA NALRAAMTGQ
LGTAGLSSPE LHEAMDLCLE CKACKRECPS IVDMARLKSE WLAHYQATHG VPWRSRLFGQ
IAKINQLGML VPRLNNWVLA QPITRWLLDR SLGIDQRRQL PALARSSFRT WFKRQTQNQA
APLGPLILWD DTFTLYNEPQ IGQAAVKMLS AAGYKVYLIE ERHCCGRPLI SKGMLAEARA
NAEYNIALLA PFAELGVPII GLEPSCIASF RDEYPALVTT KAANIVAAQS YFIEEFLVKL
AAEGVTWHWK EQLPADQVLV HGHCYQKALI STNPLLAMLR LVPNLAVHEI ESGCCGMAGS
FGYEREHYEV SMACGEQRLF PAIRSSHQPI LAAGMSCRHQ IEAGTGVIAQ HPIVFLADCL
AD