Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1039 |
Symbol | |
ID | 5732943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1185827 |
End bp | 1186867 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278174 |
Product | oxidoreductase domain-containing protein |
Protein accession | YP_001543815 |
Protein GI | 159897568 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATTG CAATTATTGG AACCGGTTGG GGTGCTCGCG TTCAGGTGCC AGCCTTTCGT TCGGCTGGGC TGAAAATCGT GGGGATCGCC GCCCAAAATT ATGAAAAAAC TCAGCGTGAA GCTGCCACTT TGAATGTTGA AGCCTTTGAA CATTGGCGTG ATTTGCTCAG CAGCGATGCC GATTTGATTT CGATTGTGAC CCCGCCAGGG ACGCATTGCG AAATCAGCGT AGCGGCCTTA GAAGCTGGCA AGCATGTGTT GTGCGAAAAA CCAACAGCAT TAAATGTGCT CGAAGCCCAA ACCATGCTCG AAGCCGCCCA AGCCCATCCT GAACAATTAA GTTTGATCGA TCATGAATTA CGCTTTTTAC CAATTTTTCA AATGGCGCGG GCGTTGATTA ATGATGGTGC GATCGGCCAG ATTCGCCATG TCAATAGCAG CGTGATCTTC TCGTCGCGAG CTGACCCGCA ACGTCCTTGG AACTGGTGGA GTGATAAAGA GCAAGCTGGT GGTGCTTGGG GTGCGATTGG CTCACACCAA ATTGATATGT TGCGCTGGTT GTGTGGCGAT TTTAGCTCAA TTCGCGCAAG CTTGCACACC TTTGTAACTG AACGACCACT CGACGATCAA CTCTTGCCTG TCACCAGTGA TGATTTTGCC ACGGCTCAAG TGCGTTTGGC GAATGGTGGT TTTGCCTCAA TTATGATTAG TGGCGTGGCG GCACTCAACG AAAACGATCG TATGATTATT CATGGCGAAC ATGGCGCGAT CAAAATTGAA GGCGCTCGTT TGTGGCATGC CGAGCGTGAT GGCGAGTGGC AAGAGCGCAC GCCTGCTCAT ACGGTAGCGA TTCCAAGCGA AATTAGTGGT AACTTCCCAG TGGGAACGGT CTATCTTGGC CATGCCTTGA AGGCCTACAG CCGTGGTCAG CTTGATGCGT TGGAGCAAGC CGCCACGTTT AGCGATGGCT TGCTGACCCA AAGTTTGCTT GATGCTGCTC ATCGCTCCGA TGAAAATGAC GGTGGCTGGA TTACGATCTA G
|
Protein sequence | MKIAIIGTGW GARVQVPAFR SAGLKIVGIA AQNYEKTQRE AATLNVEAFE HWRDLLSSDA DLISIVTPPG THCEISVAAL EAGKHVLCEK PTALNVLEAQ TMLEAAQAHP EQLSLIDHEL RFLPIFQMAR ALINDGAIGQ IRHVNSSVIF SSRADPQRPW NWWSDKEQAG GAWGAIGSHQ IDMLRWLCGD FSSIRASLHT FVTERPLDDQ LLPVTSDDFA TAQVRLANGG FASIMISGVA ALNENDRMII HGEHGAIKIE GARLWHAERD GEWQERTPAH TVAIPSEISG NFPVGTVYLG HALKAYSRGQ LDALEQAATF SDGLLTQSLL DAAHRSDEND GGWITI
|
| |