Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1023 |
Symbol | |
ID | 5732927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1166937 |
End bp | 1168370 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278158 |
Product | hypothetical protein |
Protein accession | YP_001543799 |
Protein GI | 159897552 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCA CCACAAGTCA AGAGCTGATC GGCTGGCATC GGTATCTTGA GCAATTGGGT CCACGGCTGA CTGGCAGTGA GGCTCATCAG GCCTTTATCG AATTTTTAGC GACAGAATTA ACCAGTTTGG GCTGTGAGGT TCAGCGTGAT CGCTATTATT TTACCCGTTG GCAAGCTCAA AACTGGAGTT TAGCGCTCCT CGATAGCGCT GGCAACGAAA CGAGCATTCC CTGTAGCTTT TATTACCCAT ATTCAGGCTC TACACCGCCT GAGGGCATCA TTGGCGAGTT GGTGGATTGT GGCAAAAGCC CTGGCAATTT TCAGCAAGCT GCTGGCAAAA TTGCTTTGGT TGAAGTGGCA GTTCCGGCGT TACCAACCAT GCTATTTCTG CACCCAACCA AATTCGCCCA AGCCCAGAAA TTGCCAAAAC TGCTGCGAAA CCCAACCCTT GGCTCGTTTT TGACTGGCCC AAATTTAGCC GCCGCCAAAC AAGCTGGAGT AAAAGGGGTA ATTTGCATTT GGTCGAAAAT CTCAGCGGCC AATGCCGATG CTCAATATTT GCCCTTCACC ACCAGCTATC AAGCTTGCCC AGCGCTTTGG GTCAACGCTG CGGTTGGTCA GCAACTCAAA CAGGCAGTTG GCCAAAAGAT TCGCTTCACC CTCGAAGCAA CGCTCACTGA GCAATGCCCA ACTGATAGCT TGTATGTGGT TTTGCCAGGT CAGCAATCCA ACGAAAGCCT TTTGATTAAT ACTCATACCG ATGGGCCGAA CGCACCTGAG GAAAATGGCG CACTGGGCTT GCTGGCATTA GTGCGTTGGT TCAAACAGCA GCAGCATCAA CGGAATTTGA TTTTTATTTT TGCCACAGGC CATTTTCAAT TGCCGCAACT TGGCAAGCAT GGCCAAGCTA CCAGCACATG GCTCGCTGAG CACCCCGAAT TGTGGAATGG CCAGCAAATG CGGGCAATTG CAGGTGTGAC CTTAGAGCAT TTGGGCTGTA CTGAATGGCT CGATAATCGG GCATTAAGCG ATTATCAACC AAGCCAGCAA CCTGAACTTG AGCTAACCTA CACCACCAGC CCAATGTTGG AGCAACTGTA TTACACCGCG TTGTTGCAGC GCACCAAACA GCGGGTACTC ACGATTATGC CAATTAACGA GATTTATTTT GGCGAGGGCG AGCCATTCTA CAAAGCCAAC ATTCCGACAA TTTCGCTGAT TCCAGCGCCT AATTATCTAT GTGCCACACC GAGCAACGCT GTAATCGATA AACTTGATTT TGATTTGATG CAGCAACAAA TCGAAACCTT CGCCCGCGTG ATCAAGATGA TCGATCAGAT CAGCACTAGC CATTTGGGCG TAGCTGAACC CCAGCCATTT AGCCTTGTTG GCAGTGTATT CCGCCGCATG GTTGGAGCCA ATCAGCGGCA CTAA
|
Protein sequence | MKITTSQELI GWHRYLEQLG PRLTGSEAHQ AFIEFLATEL TSLGCEVQRD RYYFTRWQAQ NWSLALLDSA GNETSIPCSF YYPYSGSTPP EGIIGELVDC GKSPGNFQQA AGKIALVEVA VPALPTMLFL HPTKFAQAQK LPKLLRNPTL GSFLTGPNLA AAKQAGVKGV ICIWSKISAA NADAQYLPFT TSYQACPALW VNAAVGQQLK QAVGQKIRFT LEATLTEQCP TDSLYVVLPG QQSNESLLIN THTDGPNAPE ENGALGLLAL VRWFKQQQHQ RNLIFIFATG HFQLPQLGKH GQATSTWLAE HPELWNGQQM RAIAGVTLEH LGCTEWLDNR ALSDYQPSQQ PELELTYTTS PMLEQLYYTA LLQRTKQRVL TIMPINEIYF GEGEPFYKAN IPTISLIPAP NYLCATPSNA VIDKLDFDLM QQQIETFARV IKMIDQISTS HLGVAEPQPF SLVGSVFRRM VGANQRH
|
| |