Gene Haur_2076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2076 
Symbol 
ID5733964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2584118 
End bp2585239 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content59% 
IMG OID641279217 
Productputative monooxygenase 
Protein accessionYP_001544844 
Protein GI159898597 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0492] Thioredoxin reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0197584 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAGCT ATGATGTCCT TGTGGTTGGG GCTGGCGCGG CAGGCGTTGG CATCGGCTGC 
GCACTCCAAG AACTGACCCT TACTCCCAAC CAGTGGCTGA TCATTGATCG CACAGCGGTC
GGGAGTTCGT TTCGCCATTG GCCATGTGAA ATGCGCCTGA TTACCCCATC GTTCCCTGGC
AATGACTTTG GTGTCATTGA TCTTAATGCC GTCACCCCGC ACACATCACC CGCCCTGAGC
CTAGCGGCCG AGCACCCGAG CGGCCCGGAC TACGCACGCT ATCTGTGCAG CCTTGCCGAG
CATTTCGAGC TACCCATCCG CACTGATGTC TCGGTGACGG CGGTTGAACC GGCTGATGAT
GGGTTCATTG TCCGCACCAC CAGCGAGCCG CTCCACGCAC GGCTGGTCAT CTGGGCGGCG
GGCGAGTTTC AGTACCCGCG CACAACGGGC TTTTGCGGTG CGGAGCAGTG TCTCCTGGCG
AGCACCGTCA GCTCTTGGAA CCATATTATG GGGACTGATC CCATCATCAT CGGCGGTTAT
GAAAGTGGGA TGGATGCCGC CATTCATCTC GCCCGGCGCG GCATGGCGGT ACGCGTGATC
GATGCCGGTA CGCCCTGGGA TACCATTGAC ACGGATCCCA GCCGCACGCT GTCGCCTTAT
ACCCAAGAGC GACTGCGTGC GTTGCCCAAC GGTGCGCTCA CCTTGATTGG CGAGACGCGC
GTCGAGCGCG TCGTGACCGT TGCTGAGGGC TATTACGTAT TCACCAATCA CCATCCTGTG
CCTTTGTTCT CAGCAATGGC ACCCATTCTC GCAACAGGGT TTGCGGGCAG CCTGTCTTCG
CCTGCTATCG CCCCCTTGTT TGCTCGACGA GACGATGGAT ATGTAGTCCT TACCACGGAG
GATGAATCCA CCATCACGCC AGGACTGTTC GTCGTCGGGC CGAATGTACG TCACGATGAT
CTCATTTTCT GCTTTATCTA CAAGTTCCGT CAGCGCTTTG CTGTGGTGGC GCGGGCGATT
GGGCAGCGCC TGGGACTACC AACCGATGGG CTGGACTGGT ACCGCGAGCG CGGCATGTTT
CTCGATGATC TGTCCTGCTG CGATACCACG TGTGCCTGCT AG
 
Protein sequence
MQSYDVLVVG AGAAGVGIGC ALQELTLTPN QWLIIDRTAV GSSFRHWPCE MRLITPSFPG 
NDFGVIDLNA VTPHTSPALS LAAEHPSGPD YARYLCSLAE HFELPIRTDV SVTAVEPADD
GFIVRTTSEP LHARLVIWAA GEFQYPRTTG FCGAEQCLLA STVSSWNHIM GTDPIIIGGY
ESGMDAAIHL ARRGMAVRVI DAGTPWDTID TDPSRTLSPY TQERLRALPN GALTLIGETR
VERVVTVAEG YYVFTNHHPV PLFSAMAPIL ATGFAGSLSS PAIAPLFARR DDGYVVLTTE
DESTITPGLF VVGPNVRHDD LIFCFIYKFR QRFAVVARAI GQRLGLPTDG LDWYRERGMF
LDDLSCCDTT CAC