Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0077 |
Symbol | |
ID | 5731950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 100187 |
End bp | 101365 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277199 |
Product | hypothetical protein |
Protein accession | YP_001542857 |
Protein GI | 159896610 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGATGG AAGGGTTTCG TTGGTTGTTG AGCGACGCTG GCCAACAACT CTTGGCTGAA CTCAGCAATG ATCGCGATTT GAATGAAGCT AACTTTTTGC GCTACACCAC CAAATTGCGC AAACACTACC CAGCCGAGGC GGTAACGGCG GCTTTGGAAA CCAACTTATT GCGCCGCGCC GCCCAAGCTA AATTTCCCCA AGCCAGCCAA CTATATTTTA CCCGTGAGGC CTTGGAGCAA GCTACGCCTT GGCTGGTTGC TAGCTATCGC CAACGCCATT TTGCGACTGG CTCGCGGCTG GTCGATTTGG GTTGCTCGGT TGGCGGCGAT GCCTTGGCCT TAGCGCAAAG TTGCTCGGTT TTGGCGATCG ATCGTGATCC ATTGCGGTTG GCAATGCTTG AGGCCAATGC TCAGGCGCTT GGGCTAAGCC AGCAAATCAG CATCCAAGAG GCCGATTTTA CCACGCTAGA ATTTGCGGGC TACGCTGGTT TATTTATCGA TCCGGCGCGG CGCAGCAATG GCAAGCGCAT TTGGGATGTT GAGCACTATC AGCCGCCGCT TTCTACCTTG GAGCGTTGGC GTGGGCAAGT GCCAATCCAT GGAGCTAAAG TTGCCCCAGG TATTCCCGAT GATGCCGTGC CTGCTGGCTA TGATCTTGAG TTTATTTCGC TTGATGGCGA TTTGCGCGAG GCCTGTTTGT GGTGGCAAGC TGGTCAGGTT GGCGGGCAAC GCAAGGCAGT GGTGCTCACT AGTGCTGGTG CTGAACACAG CTTAATTGCC GATTCCACCC AAGCCGCCGC TGCACTCAGC GAGCCACTGG CCTATTTGTA CGAGCCTGAT CCAGCGGTGA TTCGAGCGCA CGCCGTGGCA GATATTGCCA ATCAGTTGGA TTTAGCTCAA TTTGATGCCA GCATTGCCTA CCTTACCAGT GATCGCTTGG TACAATCGCC ATTTCTGCGA GCTTGGCAAA TTGAGCAATG GCTACCGTTT AATTTGAAAC TTCTGCGCCA AATATTGCAA GCGCGTGAGA TTGGCCGCGT AACCGTCAAA AAGCGTGGCT CGCCGATTAC CCCCGAAGAA TTAAGCAAAC AACTGCGCTT GAAGGGTCGC TACGAGCAAA CGTTGGTGCT GACCAAACTA CAAGGCCAGC CAGTCGTGCT GTTGGTAAAA TTGCTTTAA
|
Protein sequence | MEMEGFRWLL SDAGQQLLAE LSNDRDLNEA NFLRYTTKLR KHYPAEAVTA ALETNLLRRA AQAKFPQASQ LYFTREALEQ ATPWLVASYR QRHFATGSRL VDLGCSVGGD ALALAQSCSV LAIDRDPLRL AMLEANAQAL GLSQQISIQE ADFTTLEFAG YAGLFIDPAR RSNGKRIWDV EHYQPPLSTL ERWRGQVPIH GAKVAPGIPD DAVPAGYDLE FISLDGDLRE ACLWWQAGQV GGQRKAVVLT SAGAEHSLIA DSTQAAAALS EPLAYLYEPD PAVIRAHAVA DIANQLDLAQ FDASIAYLTS DRLVQSPFLR AWQIEQWLPF NLKLLRQILQ AREIGRVTVK KRGSPITPEE LSKQLRLKGR YEQTLVLTKL QGQPVVLLVK LL
|
| |