Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4048 |
Symbol | |
ID | 5736918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5166053 |
End bp | 5167093 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281199 |
Product | oxidoreductase domain-containing protein |
Protein accession | YP_001546808 |
Protein GI | 159900561 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATTA ATCTGGCTTT GGTGGGTTTT GGGATGGCCA GTCAAGTGTT TCATGCTCCA CTGATTATGG CCGAACCACA ATTAACCTTA CATACGATTG TTCAACGCAG TAGCCAAAGC GCCCGCGAAA CGTATCCCCA TGTGCAGATT GTCAGCGATC TGGCTGAAGC GCTCGCGAAC CCAGCGATAG AGTTGGTAGT GATTGCCACA CCCAACCAAA CCCACGCCGA TTTGGCAGCC CAAGCCTTGC GGGCAGGCAA ACATGTGGTG GTCGATAAGC CGTTTACGCT CAATAGTCAA CAGGCTGATC AATTAATTGC CCTAGCCCAA GAGCAACAAC GTGTGTTAAG CGTCTTTCAC AATCGACGTT GGGACGGCGA TTTTTTGACT GCTCAAGCTG TGATTCAGGC TGGCTGGTTG GGGCGTTTAG TCAGCTACGA ATGCCATTAC GATCGCTATC GACCGCAACT AAAACAAGCG GCTTGGCGTG AACAAGCTGG CCTTGGCAGC GGGATTTTGT ACGATCTCGG CGCACATTTG ATTGATCAAG CCTTGGTGTT ATTTGGCATG CCCCAATCGA TCACAGCTCA GCTTGCCAAG CAACGTGAGG GTGCGCAAAC GGTCGATTAT TTTGATTTGT GTTTGGGTTA TGCTGAGTTG CAAGTCACGC TCAAAGCAGG TATGTTGGTG CGCGAATTGG GGCCGCGCTT CAGTCTCCAT GGCACAAACG GCTCATGGAT CAAATATGGC ATTGATCCCC AAGAAGCGGC CTTGAAAGCA GGCCAAGCGC CAAACAGCCC AAATTGGGGC CAAGAACCTG CAACTGATTG GGGTATGATC AACACGGAGC TGAATGGTTT AGCGTTGCGT GGCACAGTCG AAACCATCGC TGGCAAGTAT CCGGCCTATT ATTGCAATGT AGCACAAGCC ATTCAAGGTA AGGCTGAGAT CATCGTTCAG GCCAACCAAG CTCGAAGCGT GATTCGGCTG ATTGAGTTGG CCGAGCAAAG TGCAGCTGAA CAACGCACAA TGACAATTTA A
|
Protein sequence | MPINLALVGF GMASQVFHAP LIMAEPQLTL HTIVQRSSQS ARETYPHVQI VSDLAEALAN PAIELVVIAT PNQTHADLAA QALRAGKHVV VDKPFTLNSQ QADQLIALAQ EQQRVLSVFH NRRWDGDFLT AQAVIQAGWL GRLVSYECHY DRYRPQLKQA AWREQAGLGS GILYDLGAHL IDQALVLFGM PQSITAQLAK QREGAQTVDY FDLCLGYAEL QVTLKAGMLV RELGPRFSLH GTNGSWIKYG IDPQEAALKA GQAPNSPNWG QEPATDWGMI NTELNGLALR GTVETIAGKY PAYYCNVAQA IQGKAEIIVQ ANQARSVIRL IELAEQSAAE QRTMTI
|
| |