Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1711 |
Symbol | |
ID | 5733598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1990535 |
End bp | 1992040 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641278853 |
Product | FAD dependent oxidoreductase |
Protein accession | YP_001544482 |
Protein GI | 159898235 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1233] Phytoene dehydrogenase and related proteins |
TIGRFAM ID | [TIGR02733] C-3',4' desaturase CrtD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000831031 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTATG ATCTGATTGT AATTGGTGCA GGCATTGCGG GCTTGTCTGC TGCCGCCCTC TTGGCCAAGG ATGGCTACCG AGTTTTAGTG CTAGAGGCGC ATATCGAGCC TGGTGGCTGT GCTTCTTCCT ATCAACGCAA ACGCCCCAAT GGCGAACACT ATATTTTTGA TGTTGGCGCA ACTGTCTTTG CGGGCTTTCG GCCTGGCGGT GCGCATTATT GGCTTGGGCA AAAATTAGGC TTAACTTGGC CAATTCGCCC GGTCAATCCA GCGATGCAAG TTTGGTTGCC CGATCTGCGG GTGACCCGTT GGGGTGATGA GCGCTGGATC GCCGAACGTC AGCGGTTGTG CCTAAGCCAA GCATGGGAAG CTGAGCAATT TTGGCGTGAG CAAGAGCATT TGGCCGAGGT CGCTTGGCGC TTTGCTGGTC GGATGCCGGC GATGCCGCCC GAATCATTGG CCGATCTTGG TCAGTTGATA ACCCGAATTC GGCCTGAGAT GCTTGGTTTA TTGCCAGCCT TGCCGCGCAC GGTCAAGCAT GCGCTCAATC GACATAAGGT GGATGATCGA CGAATTCGCA CCTTTATTGA TGGTCAATTG TTGATTAGCG CCCAAAGTAG CCATGCCGAA TGTGCTTGGC TCTACGGAGC AGTGGCGCTC GATTTGGCGC GAATTGGTAC CTATTATGTT GAGGGTGGCG CATGGAATTT GGCCAAAACC CTCGAACAAG CCTTACTCAA GGCTGGCGGC GAAATTCGCT ATCGCCAAAA AGTAAGCCAA ATTCAGACCG ATCTGGGGCA AGTGGTTGGG GTAACCACTG AGCAGGGCGA GCGTTTTCGC ACAAAGCAGG TTATTGCTAA CACGACGGTT TGGGATTTAG CTGAGCTAAT CGATCAGCCG CCAAGTTGGA TGCTCCGCCG AACGATCAAG GCTGTACCAC AAGGTTGGGG TGCGGCAACG CTCTATCTGG GCATCGACGA AGCCGCAATT CCCCAAGGCT TAGCTGAGCA TCATCAAATT ATTGCCAACT ATGATCAAGC GCTTGGCGAG GCCAATAGTG TGTTTATTTC ATTGCATCCG GCTGATGATG CTTCGCGTGC GCCAGCTGGC CAACGAGCAA TGACCGTTTC GACCCATACC GATGTTGGGC GCTGGTGGCA TTGGCGGCAA ACTGACCCAG CTCGCTATCG GGCTGAAAAA ATAGCGATGG CTGAGCGCAT GCTCGATACT GTGGCGTTAG CAATGCCATC CATTCGCCAA CATATTCGTT ATCAACAAAT TGGTACGCCC GTTTCGTTTG CCCGCTATAC TCAGCGCAAA CGTGGTATGG TTGGCGGCTT GCCACAATGG CGTTCGGTTT CGGGCTTGCT TAGTTTGGGG CCACAAGCGG CACGCATTAA CGGCTTGTGG TTGGTCGGCG ATAGCACGTT TCCTGGGCAA AGCACCGCTG CCGTGACCCA AAGCGCGATT CAAGTTTATC AGAAAATTCG CCGTGCAGAT CGTTAA
|
Protein sequence | MDYDLIVIGA GIAGLSAAAL LAKDGYRVLV LEAHIEPGGC ASSYQRKRPN GEHYIFDVGA TVFAGFRPGG AHYWLGQKLG LTWPIRPVNP AMQVWLPDLR VTRWGDERWI AERQRLCLSQ AWEAEQFWRE QEHLAEVAWR FAGRMPAMPP ESLADLGQLI TRIRPEMLGL LPALPRTVKH ALNRHKVDDR RIRTFIDGQL LISAQSSHAE CAWLYGAVAL DLARIGTYYV EGGAWNLAKT LEQALLKAGG EIRYRQKVSQ IQTDLGQVVG VTTEQGERFR TKQVIANTTV WDLAELIDQP PSWMLRRTIK AVPQGWGAAT LYLGIDEAAI PQGLAEHHQI IANYDQALGE ANSVFISLHP ADDASRAPAG QRAMTVSTHT DVGRWWHWRQ TDPARYRAEK IAMAERMLDT VALAMPSIRQ HIRYQQIGTP VSFARYTQRK RGMVGGLPQW RSVSGLLSLG PQAARINGLW LVGDSTFPGQ STAAVTQSAI QVYQKIRRAD R
|
| |