Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4994 |
Symbol | |
ID | 5736830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 6332762 |
End bp | 6335176 |
Gene Length | 2415 bp |
Protein Length | 804 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641282161 |
Product | hypothetical protein |
Protein accession | YP_001547752 |
Protein GI | 159901505 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.158573 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACGAG TTTTTTGTTG GTTAGGATGT TTATTAATTG GATTAAGCTT CGTGCCGAGC CATACCAGCA CGGTTGCGGC CACGTATCAA GTGCAAGCCG AGTTTCAACA ATTTTGGCAG GCGAACGGCG GCGCAGCAAT CTTTGGTCAA CCAATAAGCG AAGCGCTGTG GTTCGATAGT CAGTTGGTAC AATATTTCGA AAACCAACGC TTGCACTTGG TCAACAATCA AGTTGTGCCC GCCAAGCTTG GGCTAGAGCT ATTTCAGGCC AGCCGCCGAA CCTGGCAAAA CCAACGCCAA CATCTGCCAA CTAGCCAGTG TTTGCGGGTT GAAACAACCG AACGCTCAAT CTGCGAGCCG TTTCGTCGCT ACTGGCAAGC CAATGGCGAG GCCAGTCGTT TGGGCAACCC CATCACTGAA ACGGTCAGCG AGACTAATCC ATTAACTGGT CAGCAACAAA TTATGCAATA TTTTGAGCAG CAATTGCTGA TCAGCACCAA CCAACAAATT ATGCTCAGCC CACTTGGACG GTGGCAGGCC GATTGGCTGC TGAATGGCAC ACGCCAAGCC AGCCCAATTC GTGCCAATTT AACTGGCCCA AGCCAACCAC TCAAGCCACT CGACCAATTT GAAATTCAAA TTGATGCTGG CAACTATAAC GGCGCAGCCA ACCTACGCAT CTTTGATAGT GCTGGCCAAC TTGAAACCAG CCAAACCCTA AACCTCACAG GCCAAAGCCA AACCCTCGCA TTCCAAGCCC AAGGCGCGTT AGGCCAGCAT TACGCCGTCT TATTGATCGA TGGCAAGGTG GCGGCGATCA ACAGCAGCAT CTATCAACTT GAGGCGACCA CCAGCCTTCA AACTGGAGTT GCGGCCTACG ATACGCTGCC CAACAAAGTG CGTAGCTTTT TGCGCAACGA CCTATCGATC TATCAATATC AAGGCTACAC GATTCGCGGC TATCGTTCGC CCGATAGCTA CCTGATTTGG CTGCGTGATC ATGTTCATCA AGGCTTGGGC TATCGCTATT TTGAGCAAGA TATGACCAGC ACGCTAGATT ATTTTCGGCG TGAGCAAAAG CCCAATGGAG CCTTCGACGA CTATTTTGCA ATGCTTGGCG GCGCACCAGT CCAAGGCCGT ACCGCTGTCG AGGCTGATTT AGAATATTTA TTTGTGCAGG GCGTGCATCA AGCCTGGCAA GCCACTGGCG ATACTAGTTG GATGCTTGAG CAAAAACCTG CCATGCTACG CGGCATCAAC TATAGCCTGA GCGATCCACA GCGCTGGAAT CCCACCCTGC GTCTAATTCG CCGCCCCTAT ACGATTGACA CATGGGATTT TGAATATGGT GGGCCGACGA TTGCCCCTGA TGGCAAAACC TCGCCACGCC ATTGGATCGA CGAAAAAACC CGCTTTGGAA TTATGCACGG CGATAATACG GGCATGGCTC ATGGCTTGTA TTTGCTGAGC AAACTTGAAC TAACTCAAGC AAATTTTGAG CAAGCTACTC AATGGCTGGT TCGTTCGCAA CAACTAACCA AGCAGCTGAA TCAGGTTGCG TGGAACGGCA AATTCTACAC CCACAATGTT TTAGAGCAGC CTTTCGATAT TCCTGAACTC GATGAGGCTC GCCAACTGTC GCTCTCCAAC TCGTATGCGC TCAATCGCTA TGGCATGGAA GCCAGCAAGG CCTTGGCAAT CATCGACGAA TATTATCAGC GGCGGGTGGC TGATTCTAGC AGTCTTTCCT CAGAATGGTT CAGCATCGAC CCGCCATTTC CTGCCGAGAG TTTTGGCACA TTGCCAGGCT GGGGCAACGT GCCTGGCGAA TATGTTAATG GTGGGCGGAT GCCGCTAGTT GGCGGCGAAT TAGCTCGTGG CGCATTTCGT TGGGGCCAGC CAGCCTATGG CTTTGATATT CTGCGCCGCT ATGCCCAAAT GATTGAAGCC CAAGGCGGCA GCTATTTGTG GTATTACCCG GTCGGCAACC CTGGTATCTC TGGCCCCGAC ACCCTCGCCA CCGACGGCTG GGGCAGCACG GCAATGCTGG CAGCGCTAAT CGAAGGCGCA GCCGGGGTGA CCGATCAAAG TGCGTTGTAT CAGCATGCAG TGCTCAGCCC ACGTTGGATT GTTGAGCCAG ATGTGCAACA AGCCCAGGTC ACCACCCGCT ATGCTGCTTC GCAAGGCTAT ATGAGCTATC GCTGGCAACG CCAAGCTCGT GGTTTTCAGC TTGATTTTAC CGGCAGCGCT GAACAAGTCA CATTGCAATT ATTGCTACCC AACGATGCTC CACAACACGT TAATTTAACG ATCAATGGGT TACCATCATT CGGCCATGAA CGCACAATCG GCCAAAGTCG CTACCTTGAA CTGCGGCTAA ACAAGGCCAC TGGTTCGATT ATGGTCAATT GGTAG
|
Protein sequence | MRRVFCWLGC LLIGLSFVPS HTSTVAATYQ VQAEFQQFWQ ANGGAAIFGQ PISEALWFDS QLVQYFENQR LHLVNNQVVP AKLGLELFQA SRRTWQNQRQ HLPTSQCLRV ETTERSICEP FRRYWQANGE ASRLGNPITE TVSETNPLTG QQQIMQYFEQ QLLISTNQQI MLSPLGRWQA DWLLNGTRQA SPIRANLTGP SQPLKPLDQF EIQIDAGNYN GAANLRIFDS AGQLETSQTL NLTGQSQTLA FQAQGALGQH YAVLLIDGKV AAINSSIYQL EATTSLQTGV AAYDTLPNKV RSFLRNDLSI YQYQGYTIRG YRSPDSYLIW LRDHVHQGLG YRYFEQDMTS TLDYFRREQK PNGAFDDYFA MLGGAPVQGR TAVEADLEYL FVQGVHQAWQ ATGDTSWMLE QKPAMLRGIN YSLSDPQRWN PTLRLIRRPY TIDTWDFEYG GPTIAPDGKT SPRHWIDEKT RFGIMHGDNT GMAHGLYLLS KLELTQANFE QATQWLVRSQ QLTKQLNQVA WNGKFYTHNV LEQPFDIPEL DEARQLSLSN SYALNRYGME ASKALAIIDE YYQRRVADSS SLSSEWFSID PPFPAESFGT LPGWGNVPGE YVNGGRMPLV GGELARGAFR WGQPAYGFDI LRRYAQMIEA QGGSYLWYYP VGNPGISGPD TLATDGWGST AMLAALIEGA AGVTDQSALY QHAVLSPRWI VEPDVQQAQV TTRYAASQGY MSYRWQRQAR GFQLDFTGSA EQVTLQLLLP NDAPQHVNLT INGLPSFGHE RTIGQSRYLE LRLNKATGSI MVNW
|
| |