Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0919 |
Symbol | |
ID | 5732688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1050704 |
End bp | 1052536 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641278051 |
Product | hypothetical protein |
Protein accession | YP_001543695 |
Protein GI | 159897448 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03605] SagB-type dehydrogenase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCAGG CAATTGATTT ACACAGCCAA CTCCAGAGCG ATCCGCAGGT GCAACTTCCA ACTCGCCCAC GGATCTCCGA TGAAATTGCC CGAGTTTGGC ATAACGAACA GCTGTTGTTG TTGCTGGGCG GGCCGCAAGA TGTGGTGTTG CGTGGCAAAG CGGTGCGCAA TCTGCTGCCG CAACTCTTGC CCTTGCTGAA TGGCCTCAAC ACCCGCGAAA CGATTGGCGA ACGCTTGGCC CAATTCAAGC CCACAACGAT TGACGATAGC TTGCAGTTGT TGCATGTGCA GGGGGTGCTT GAAGAAGGCC CATTGCCGAG CAATACACTT GATCCGCAAG TTGCCCGTGC GTTCAGTCAA CAACTAGCTT TTTACAGCCG CTTTGTTGAC CAAACGCGGG TCTGTCGCAA TCGTTATGAA GTGCTGCAAC GCTTGCAAAC AACGCCTTTG TTGTTGATTG GTGGCGAACG CTTGGTTGCT CCATTGCTGC ATCAATTGGC GCAAGCTGGC TTAGGCCGCG CCACTTGGCT GGCCAATTCA GCTCCAACCA CAACCTATGC CCTGCCACAT TTGGCGCTTG ATCTTCAAGT TCCGCAGGCT GATCAATTAC CTGCAGCAGT TGATCATTGG TTAGCTGAGC ATCACGATGG CTTGATCTGT CTGCTGACCA GCACGGCGCA GGCCGAATTA ACCCAAGCGC TCAATCAACG GGCAATTCAA GCGCAAACTC GCTTTACCCG ACTTTGGCTG CACCCACAGC ATATTGAACT TGGCCCAACC ACCTTTGCTG GTGAAGCTGG CTGTTATGCC TGTGCCGAGC AAGTTGATAG CCACGAACCT CAATTAAGCG AGCCAACCGC GCCGCTGAGC CTCAACGAGC AGTTAGCCTT GAGCCAAGCA AGTTTAGTGG TTGGCAACCT AATTTCGGGT TTGAGTCCGG TGATTACGGG CGGCGTGCGT TATGTGCTTG ATCCAACGAC GCTTGAATTT GTGGCGCAAT CGGTGCATCG TTTGGTGTTG TGCCCAGTTT GCGGCCAGCC CAACCTCGAT GCGCCCCGCG ATCTGTTGGT GGGCGCGGGC CATTTCGAGA ATTTGCCGCT GTGGTATCAC GCCAACACCG ATGAGCGCAG TTACGCGATT TTCCCCAAGG CGCATCAGCA GCATTATGCG CCAAAAAATG CGATTGCAAT CGTCAGTGGA GCCAAGGGTT ATACCAATAG CCAACGCTAT GCCCTGGGCG AATTGCAAGG CCAATGGCAG CCTGATTCAA CTGTTGATCT GAGCGTTCAA CTGCCAGTTC AAACCTCGAT TGCGCCCTTG GCTTGGCTGT TTGAGCAAGC ATTTGTGCGC AAGCCAGCCA CCCAGGTGAT CGCCGCTGGC CAACGTTTTG TACCTTCGGG CGGCAATCTA GCCTCGCAAA CGGTCTATTT GCTCAATCAC AAGCTGAGCA ATTTGGCGGC TGGCATCTAT CACTTGAACC AACATGACGC TTCGTTGGAA GCAATGCGCC CACAGTTTAG CTGGGATGAC GTTGCCAAAG CCTTCCCCAG TGATCCGCTG AATCAACAAA CCTTGGGTTT GATTGTGCTG ACGGCGGCGA TTGGGCGGGT TGAAGGCAAA TATGGCGCAA AATCGTATCG GATCGCGCTC TACGATTGTG GCGTTGCCGC GCAGGCAATT GAATTTTTGG CAGCTGCGGC AGGCTGGCAA ATCGAGCAAA TCAGCGCCTT CTACGATCAA GAGCTGCGCG ATCTGCTCCA GGTCTATAGC CCAAGCGAAA CCCCGTTGCT GGTGCTGCGG CTGGTTGCGC CCAATCCTGA GGTGCTGCAA TGA
|
Protein sequence | MTQAIDLHSQ LQSDPQVQLP TRPRISDEIA RVWHNEQLLL LLGGPQDVVL RGKAVRNLLP QLLPLLNGLN TRETIGERLA QFKPTTIDDS LQLLHVQGVL EEGPLPSNTL DPQVARAFSQ QLAFYSRFVD QTRVCRNRYE VLQRLQTTPL LLIGGERLVA PLLHQLAQAG LGRATWLANS APTTTYALPH LALDLQVPQA DQLPAAVDHW LAEHHDGLIC LLTSTAQAEL TQALNQRAIQ AQTRFTRLWL HPQHIELGPT TFAGEAGCYA CAEQVDSHEP QLSEPTAPLS LNEQLALSQA SLVVGNLISG LSPVITGGVR YVLDPTTLEF VAQSVHRLVL CPVCGQPNLD APRDLLVGAG HFENLPLWYH ANTDERSYAI FPKAHQQHYA PKNAIAIVSG AKGYTNSQRY ALGELQGQWQ PDSTVDLSVQ LPVQTSIAPL AWLFEQAFVR KPATQVIAAG QRFVPSGGNL ASQTVYLLNH KLSNLAAGIY HLNQHDASLE AMRPQFSWDD VAKAFPSDPL NQQTLGLIVL TAAIGRVEGK YGAKSYRIAL YDCGVAAQAI EFLAAAAGWQ IEQISAFYDQ ELRDLLQVYS PSETPLLVLR LVAPNPEVLQ
|
| |