Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0339 |
Symbol | |
ID | 5732249 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 405877 |
End bp | 406881 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277463 |
Product | hypothetical protein |
Protein accession | YP_001543119 |
Protein GI | 159896872 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00289774 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATCA CAATTGAACA TGTTCGCGTT TTAGAAGGCC CAAATATCTA TTACCCTCAG GCTGGGGTCG CTGCAAGCTT GCAAGTTAGC CACGATCTGC GCGATGAACT TGGTCGTCAG CTCAAAACTT GGGCGCAGGC AGTTGGCTTA ATCATTGGCT ATCTGCGCAT GAAAATCGAG CCGATTGACG ACCAATGGCT ATTAAGCCTG AGTTTTACCT GCAATCATCC CCAACTTGGT GCGGCTATCT TGCAGCATGC GGTCGAAGAT ATACTGGCCG GCGAACGCCA AGACGAAGAT TGGAACCACG ACGATGCCTT GTTTGATCTG CGTCGTCAGC GCATGCGGAT TGATCCGGTG TTGCCGTTGT TGCAATTACG GGCCGAAGCT CAAGGGCGGG TTCTGCCTGT GATCGCGGTT GGTGATGGCA TGCTCCAAAT CGGCACCGGC AGTGGTGGTT GGCAGTTTGA TCCGGCGCAG CTAAGCCTTG GCTTTGCGAT CAACCCGCCA TGGGAGCAAA TTCGTAGCGT GCCTTTGATT GCGATTGGTG GGGTTGGGGC TGAAATTGCC GCAGCCCAGA TTGCCCAAGG ATTAACTGCG GCTGGTTGGC AACAGGTTAT GCATATTGTT AAGGGCGATT TTGCTAGCGT GCGCCAAGCA TTTCTCCAGC CCAAGGCTGA AATCTTTGTG ATTGCTTTGG ATCATAGCGA TGCTGTCGAG CGTGGTTTAG CCTTTAATCG CTGCACGATG GGGGTGGTAC TTGGCATTAG CGATTTGCCT GAAACTCAGG CTTTGGCGGC GGGGTTGCCA GCCCTAACTG CCGATGAACT AGGCAATACC ATTTTGCTGG CCGATGATCA ACGGACTGCT GGCTTGGCGC GACGCACCGC AGCCCCAGTT GTGCAACTAC AACGTACCCA CGAACCAGCC AGTATTCAAC AACCCTTATT AGCGTTGGTG GTCAATCAAT TGCAACAATT GCTCGACGCT GGAGCATTTG ATTAA
|
Protein sequence | MPITIEHVRV LEGPNIYYPQ AGVAASLQVS HDLRDELGRQ LKTWAQAVGL IIGYLRMKIE PIDDQWLLSL SFTCNHPQLG AAILQHAVED ILAGERQDED WNHDDALFDL RRQRMRIDPV LPLLQLRAEA QGRVLPVIAV GDGMLQIGTG SGGWQFDPAQ LSLGFAINPP WEQIRSVPLI AIGGVGAEIA AAQIAQGLTA AGWQQVMHIV KGDFASVRQA FLQPKAEIFV IALDHSDAVE RGLAFNRCTM GVVLGISDLP ETQALAAGLP ALTADELGNT ILLADDQRTA GLARRTAAPV VQLQRTHEPA SIQQPLLALV VNQLQQLLDA GAFD
|
| |