Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2076 |
Symbol | |
ID | 5733964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2584118 |
End bp | 2585239 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641279217 |
Product | putative monooxygenase |
Protein accession | YP_001544844 |
Protein GI | 159898597 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0492] Thioredoxin reductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0197584 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGAGCT ATGATGTCCT TGTGGTTGGG GCTGGCGCGG CAGGCGTTGG CATCGGCTGC GCACTCCAAG AACTGACCCT TACTCCCAAC CAGTGGCTGA TCATTGATCG CACAGCGGTC GGGAGTTCGT TTCGCCATTG GCCATGTGAA ATGCGCCTGA TTACCCCATC GTTCCCTGGC AATGACTTTG GTGTCATTGA TCTTAATGCC GTCACCCCGC ACACATCACC CGCCCTGAGC CTAGCGGCCG AGCACCCGAG CGGCCCGGAC TACGCACGCT ATCTGTGCAG CCTTGCCGAG CATTTCGAGC TACCCATCCG CACTGATGTC TCGGTGACGG CGGTTGAACC GGCTGATGAT GGGTTCATTG TCCGCACCAC CAGCGAGCCG CTCCACGCAC GGCTGGTCAT CTGGGCGGCG GGCGAGTTTC AGTACCCGCG CACAACGGGC TTTTGCGGTG CGGAGCAGTG TCTCCTGGCG AGCACCGTCA GCTCTTGGAA CCATATTATG GGGACTGATC CCATCATCAT CGGCGGTTAT GAAAGTGGGA TGGATGCCGC CATTCATCTC GCCCGGCGCG GCATGGCGGT ACGCGTGATC GATGCCGGTA CGCCCTGGGA TACCATTGAC ACGGATCCCA GCCGCACGCT GTCGCCTTAT ACCCAAGAGC GACTGCGTGC GTTGCCCAAC GGTGCGCTCA CCTTGATTGG CGAGACGCGC GTCGAGCGCG TCGTGACCGT TGCTGAGGGC TATTACGTAT TCACCAATCA CCATCCTGTG CCTTTGTTCT CAGCAATGGC ACCCATTCTC GCAACAGGGT TTGCGGGCAG CCTGTCTTCG CCTGCTATCG CCCCCTTGTT TGCTCGACGA GACGATGGAT ATGTAGTCCT TACCACGGAG GATGAATCCA CCATCACGCC AGGACTGTTC GTCGTCGGGC CGAATGTACG TCACGATGAT CTCATTTTCT GCTTTATCTA CAAGTTCCGT CAGCGCTTTG CTGTGGTGGC GCGGGCGATT GGGCAGCGCC TGGGACTACC AACCGATGGG CTGGACTGGT ACCGCGAGCG CGGCATGTTT CTCGATGATC TGTCCTGCTG CGATACCACG TGTGCCTGCT AG
|
Protein sequence | MQSYDVLVVG AGAAGVGIGC ALQELTLTPN QWLIIDRTAV GSSFRHWPCE MRLITPSFPG NDFGVIDLNA VTPHTSPALS LAAEHPSGPD YARYLCSLAE HFELPIRTDV SVTAVEPADD GFIVRTTSEP LHARLVIWAA GEFQYPRTTG FCGAEQCLLA STVSSWNHIM GTDPIIIGGY ESGMDAAIHL ARRGMAVRVI DAGTPWDTID TDPSRTLSPY TQERLRALPN GALTLIGETR VERVVTVAEG YYVFTNHHPV PLFSAMAPIL ATGFAGSLSS PAIAPLFARR DDGYVVLTTE DESTITPGLF VVGPNVRHDD LIFCFIYKFR QRFAVVARAI GQRLGLPTDG LDWYRERGMF LDDLSCCDTT CAC
|
| |