Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3255 |
Symbol | |
ID | 5735123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4117233 |
End bp | 4118333 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280401 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_001546020 |
Protein GI | 159899773 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0515461 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTACG TTAACCCCTC GCCAACGATT TCAAACGTTG TCTGCAATCA ATTATCAATT GTGCTCTTGG TGGGCGTAAT GTTGGGAACA TTAACTGCCT GTAGCACTTC AAGTTCTAAT CCGTTGCCTA CAATTGGCAT TGATGTCGGG CCAGGGGTTA ACTTTCGCGG CGATATTGTG GCGCTCCAGT TTCTGCCCAA CCAACAGCTT TTGGTGGGGG TTGGCTACGA TGGGGTTTAT CGTTGGAAGC TTGCTACGAG CACAATTGAA CAAACCCTTG CCGCCAAACA AATTTACTTG GCCTTTACAG CTTCAGGGCC ATTGGTTGTA AGCACTGATC GTAAAACGCT GACGACTTGG AATAGCACTG ATGGCCAAAA AATTCTGGCG TGGAATGCCA AACCATTGCA ACTGCCTAGC CAAACCACTG CCTTTGAAGT CAGTGCCCTC GCAATCACAC CTGATCAGCA GCAAATTATT GCAGCCTACA ACAAAGGTAG CATGCTCCAA GCTTGGAACG TGGCGACTGG TGCGGCAACA ACGACCTTTG GTGCTCCGGC CAAAACTGGC TCAATTGTTG AAATTGCGCT CAGCCCTGAT GGTCAATTAC TGGCCAGCAA CGATTTTAGT GGCGTAGTGC AGATTTGGGA TGTAGTGAGT GGTCAGCAAT TACATTCATT CAAAGAAGCC AGCCTCAACT ATCAACCAGG CAAATTGGCT TGGAGCCACG ATGGCAAATG GCTGGCAGCC AGTAGCGGCG ATAAAAACGG CGGCGGAGTC GCGATTTGGG ATACCAGTTC ATGGTCAATC TATGCCACGC ATCGCAGCAG TGAGCACCAA TTTGCTGGTT TGGCCTTTCA TCCAACCGCT CCAACGTTGG CGATTGGCAA TAGTAGTGGC TTGATCGAGT TGTACGATCT GACCAGCAAA CAAGTCAGCA ACAGCCTCAA AGGCCATGCC GAGCGGGTTA CAACCTTGGC ATGGAATGCT GATGGCAGCC AATTGGCCTC AGGCGGCAAA GAACCATTTG TCCTGATTTG GGATAGTACA AGCCTGACCG AGCAGCAACG TTTGGTTTTG CCAGCTACAC CATTACAGTA A
|
Protein sequence | MRYVNPSPTI SNVVCNQLSI VLLVGVMLGT LTACSTSSSN PLPTIGIDVG PGVNFRGDIV ALQFLPNQQL LVGVGYDGVY RWKLATSTIE QTLAAKQIYL AFTASGPLVV STDRKTLTTW NSTDGQKILA WNAKPLQLPS QTTAFEVSAL AITPDQQQII AAYNKGSMLQ AWNVATGAAT TTFGAPAKTG SIVEIALSPD GQLLASNDFS GVVQIWDVVS GQQLHSFKEA SLNYQPGKLA WSHDGKWLAA SSGDKNGGGV AIWDTSSWSI YATHRSSEHQ FAGLAFHPTA PTLAIGNSSG LIELYDLTSK QVSNSLKGHA ERVTTLAWNA DGSQLASGGK EPFVLIWDST SLTEQQRLVL PATPLQ
|
| |