Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2528 |
Symbol | |
ID | 5734406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3233289 |
End bp | 3234299 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641279668 |
Product | hypothetical protein |
Protein accession | YP_001545294 |
Protein GI | 159899047 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0857848 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACTAA TTGAACGTTA TCTTGCAGCA GTTAGCAATT ATTTACCTGC AAAACAACGC AGCGATATTT TGGATGAGCT ACGTTCTTCG ATCTATGATA GCCTTGAGCG CAACGATCAA GCGCTCGACG ATGAAGCGGC TGTTGTCGCA ACCCTACAAG CGCTGGGCGA GCCAGCCAAA GTAGCGGCGG CGTATGGTAA CAACCAACAA TATTTGATCA GTCCAGCACT GTTTCCACAA TTTCGATCGG TGGTGTTGCT GGTATTTAGC ATTATTATCG TTAGTCAGCT TGGTTTAGCC CTGCTGGCAA CGATCGGCAA TTATCACCTG AATATTGTCC AAGTTGCCTG GAACGCCATC AGCAATCTGC CAGCAACCTT TGGCTTGATT GTCGCGATTT TCTGGGGCGT ACAAAAGCTT GAAATTGAGG CCGAAACTGA GCCGAAAAAG CCGTTCGACC CACGCAAATT GCCAGCAATT ACTCTCGCTA ACGAGAAAAT TAGCCGTAGT TCGCAACTGA TTGGGATTGC AGTTCAGGTT ATTTTGTTGG GCTGGCTGAT GCAATTTCAG GCCGAAGGCG GCTTCCGCTG GGTTGATGGC AGTGGCTTGT TTGAAAACCC AGTGATTAGT CAATATTTTG CCTTGGTGGT CGTTGCCAGT ATTTTCAATG TGGTCGTTGA TCTGATTGTG ATGTGGCGTG GAGTTTGGCA AACCAGCACC CGTATCGCCT CACTAGCCGC CAGTGGCTTT AGTTTAATTG TGTTGTTCCT GTTGATTCGC GGCCATGCTG CCTGGCTGAC CAACGCAGGC TATCCCAATT CATTACGTCA ACTAATCCGC CTTGGTGAGC TAATTCGCGA AAATAACCCG GCAATTGGCA TGAGTTCGTT CTATTATGGC TTATCAATCA CCGCCTTTTT CGTAATTATC GACGCAGGCT ACACGGCTTA CAAGCTGTAC CAAGAGCGTA GCCAAGCCAA ATATAACAAC AATATTGCTC CAGCAAGCTA A
|
Protein sequence | MELIERYLAA VSNYLPAKQR SDILDELRSS IYDSLERNDQ ALDDEAAVVA TLQALGEPAK VAAAYGNNQQ YLISPALFPQ FRSVVLLVFS IIIVSQLGLA LLATIGNYHL NIVQVAWNAI SNLPATFGLI VAIFWGVQKL EIEAETEPKK PFDPRKLPAI TLANEKISRS SQLIGIAVQV ILLGWLMQFQ AEGGFRWVDG SGLFENPVIS QYFALVVVAS IFNVVVDLIV MWRGVWQTST RIASLAASGF SLIVLFLLIR GHAAWLTNAG YPNSLRQLIR LGELIRENNP AIGMSSFYYG LSITAFFVII DAGYTAYKLY QERSQAKYNN NIAPAS
|
| |