Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0785 |
Symbol | |
ID | 5732669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 885055 |
End bp | 886548 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641277915 |
Product | hypothetical protein |
Protein accession | YP_001543561 |
Protein GI | 159897314 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGTTG GTAGCTTATG GCTGGTTGCC CCAGAGGATT TTTGGTGGCA GCTCAAAATT GGCGATCTGA TTCGCACAAG TGGCAGAATC CCCACTGTTG GCGTTTTTTC CGCCACCCAA GCCAATACAG CTTTTTTCTA CCAAAACTGG CTCAGCCAAT TCCTCTTTTC GTGGATCTAT CAGCTTGGTG GTTTAGTTGC AATTTTACAA ATCCGCAGCC TTTTATTAAT CGGCAGTTAT GCTTTATTGC TCTGGCATAC CTGGCGACGG GTAAAAGCCA ATGGGCGGGC CGCCATGCTC GGTTTGTTGT TGAGTGTGCT GGTAAGTTTC AATCATTGGC AAGTTCAGCC TGCGATGTTT GTATGGCCGT TGTTTATCGC TAGTTTTGTG ATTGTTAGCG AAGTTGCTGC CGAGCGCTGG ACAACGAAAT ATCTCTGGTT GCTCCCGATC ATCCAATTAC TTTGGGTCAA CCTGCATGAA AGTTTCATCT TTGGGCCAAT CCTGGTTGCA ACAGCCGCTG TAGGCGCAAT CATCGATCGA CGGCGCGACC ATGATGAAGC ACCAATTTAT GCAACTGCCC GAGCGCTGCA AATCGCCACG GCAACGACGA CCATCGCGAG TTTTATCAAC CCGCATGGCT GGAATGGCTG GATCGCCGCA TGGCAACAAT TAACCAGCAT AGTTCCCGAA GCTTTACAAA CTCAATCAGG CTCACCACTG CTGAATTTTG CTACTCCGAT GGCTCAAGTG AGCTTGGCGG TTGGTTTAAT TGCAGCCATG ATGTTATCGG TAGTTTGGCA GCGCATGCGC AGCGCCGATC TGATCATAAC GGCAATCATG GCAGCCTTTA GTTTACTCAG CATGCGTTAT CAATTTTGGT TTGGTAGTGT GGCAGGCCCA ATTATTGCCG AGGCAATTGT GCGCCGTGGC CGCTTACGCC TGATTAAACG TAACCCCAGC GCACCGATAT GGATTGCCGG ATTAACAATC ACGATTGGTT TGATTGGGCT GCTTATGCAA CCAATTATTC GTATTTGGCT GCCTTTACCA GCGGCTTTGC AGGGTGCAAC TGGCAATTTG CCGCAAGCAA CATTAGCGAG TGCTGCTACC CCAATTCAAG CAGTCGAGTT TTTACAGGCC AATCCACCAA GCCAAGCCTA CTTCCATGAT CTTGGCTATG GCAGCTATTT AATCTGGCAA GCTGGTGAGC AATTGCCTGT ATTTATCGAT CCGCGAGTTA GCTTATACCC AACCGAACAT TGGCAAGCCT ATAGTTGTAT TATGGCAGGC CGCGATTGGG AACGGCTGCT AACTCAAGAT TCAATTGATA CAATATTAGT GGATCGCGGA AACGGCCAAC AATTGATCAG CGCAGTCCAA GCCAATTCAG CTTGGCGCGA GGTTTATGCC GATCAACAAA GCCTGATTTT CAAGCGTGAT CCGCAAGCAG CCCAACCAAC TGGCTCAGCC ACGAGCTGTC CAGCAACTAA GTAG
|
Protein sequence | MAVGSLWLVA PEDFWWQLKI GDLIRTSGRI PTVGVFSATQ ANTAFFYQNW LSQFLFSWIY QLGGLVAILQ IRSLLLIGSY ALLLWHTWRR VKANGRAAML GLLLSVLVSF NHWQVQPAMF VWPLFIASFV IVSEVAAERW TTKYLWLLPI IQLLWVNLHE SFIFGPILVA TAAVGAIIDR RRDHDEAPIY ATARALQIAT ATTTIASFIN PHGWNGWIAA WQQLTSIVPE ALQTQSGSPL LNFATPMAQV SLAVGLIAAM MLSVVWQRMR SADLIITAIM AAFSLLSMRY QFWFGSVAGP IIAEAIVRRG RLRLIKRNPS APIWIAGLTI TIGLIGLLMQ PIIRIWLPLP AALQGATGNL PQATLASAAT PIQAVEFLQA NPPSQAYFHD LGYGSYLIWQ AGEQLPVFID PRVSLYPTEH WQAYSCIMAG RDWERLLTQD SIDTILVDRG NGQQLISAVQ ANSAWREVYA DQQSLIFKRD PQAAQPTGSA TSCPATK
|
| |