Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4399 |
Symbol | |
ID | 5736249 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5622285 |
End bp | 5623490 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641281561 |
Product | hypothetical protein |
Protein accession | YP_001547159 |
Protein GI | 159900912 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAATC TGAACAAAAC TAGCAATGCA ATTTTGCAGC AATTACTTGA TCAACACGAG CAGCCAGAGC GTCAGCGAGT CAATCGGGTG CAGATTAAAG CGGCCAAATT TTCGCGTTAT TTTGATGATA AACAGATTGA CGAACGCCAA CAAACCAATA ACTACTTGGT TGAGCTAGCA AAAAATCAGA TAATCAAGTT GTATTGGCGT AAATGGGAAG AAGGCAATTG GCTTGAAGCG GTTGATTTGC TTGATGCGGC GGCGCTATAT CGTTTGCTGA AACGCCAACC CTTGGCCGAG CAACAGCAGG CCTTGCGTAC ATTATTGGCT GAATATGTGC CAGTTCAAGG CTGGCTGGCT GATTGGCTGG CGTGGCTCGA ACAGCAATTA ATCCAACAGC GCTCGATCCA ACCGCTTGAT TTAACCGACC CTGCTTGGAA TCGCGATGTA CTACGGGCGA TTTATGGCCT GACTCAGCTA GAAACGCCAA TTTTAGAGCG TTTATTTAGC GTAGGTTGGC TGGGTCAGAG CAAACGATTT AGCGAGCTAG AAGGCGCAGT TTTGCGGGTT TTGCGCCAAT TTGCCCCGCA AGCCAAGCAA TTTGGCGACA ATGATCGGGC TTTGCTGCAA GCCTTTAATC TCGAAAAAGT GCCAGAATAT GTGCTACTTG CTGGCGATTT ACAACTGGAA TTGCATGGGA ATCGGCTGGA ATTAGGCGCG TTTCGGCCAA GTTTGGGCTT GCCTAGCTCG ATGTTGCGTC AAGCTCAGGT GCTGGATTCA GCCTGTACTG AAATTATTAC GATTGAAAAC TTGACCAGCT TCCACAGTAT GCTTGCCCGC CAACCACAGG CGTTGTTGAT CTATACCGGT GGCTTTGCTA GCCCCAGCCT CTGCCAATTT TTGAGCAAAC TAGCCATGGC TTTGCCCAAT TTAACGTGGT ATCACTGGGG CGATTACGAT GTAGGTGGTT TGCGAATTTT GGCGCATCTA CGCCAGCATG TTGCCAAGAT TCGCCTGTGG CAGCCTGATC CAGCGATTTT TCAGCGGGCT GGCAAAGCTA CCCAAAGCTT GAATTCTAAA GAACGCCAAA GCCTCACTGA ACTCCAACAA CACCCATTGC TTTATGATTG CCAAGCCTTG ATCGGGGCAA TGCTGGAACA GAATATCAAA CTTGAGCAAG AGCAACTTGA TCTTTTGGGG CACTAA
|
Protein sequence | MLNLNKTSNA ILQQLLDQHE QPERQRVNRV QIKAAKFSRY FDDKQIDERQ QTNNYLVELA KNQIIKLYWR KWEEGNWLEA VDLLDAAALY RLLKRQPLAE QQQALRTLLA EYVPVQGWLA DWLAWLEQQL IQQRSIQPLD LTDPAWNRDV LRAIYGLTQL ETPILERLFS VGWLGQSKRF SELEGAVLRV LRQFAPQAKQ FGDNDRALLQ AFNLEKVPEY VLLAGDLQLE LHGNRLELGA FRPSLGLPSS MLRQAQVLDS ACTEIITIEN LTSFHSMLAR QPQALLIYTG GFASPSLCQF LSKLAMALPN LTWYHWGDYD VGGLRILAHL RQHVAKIRLW QPDPAIFQRA GKATQSLNSK ERQSLTELQQ HPLLYDCQAL IGAMLEQNIK LEQEQLDLLG H
|
| |