Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1222 |
Symbol | |
ID | 5733115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1413309 |
End bp | 1414373 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641278362 |
Product | hypothetical protein |
Protein accession | YP_001543998 |
Protein GI | 159897751 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCCAC CAACCAGTTT GCAGCGTGGC ACTGGCTTTC ATCATCTGAT CGCCGATGAA TATCGGTTGA TTGCTCAGGT GTTGACCTAT TGCCACGCCG CTGGTTATGC CATCCCGCCT CTGTTAGTGA TTGATTATGT AATTGCGCTC AAAACCAGCC CATTTGTCTT GTTGTTTGGG CCGACTGGGC AGGGTAAAAC TGAATTGGCA CGGCTGTTTG CTCAGGCTTT GGTGCATCCC TTTGAAGATC AATATACCTA TGTGAACTTA GGGAGCAGCC TGCAAGCACC CGAAATCCAA GATCGTTTTG GTTGGATGAA ATTTGTCGAG ACCTTAGAGA ACGCCGCTGC TCCGGCCAAT GCTGGTCGTT TGTTCTTCTT ATGTCTCGAT AATTTGCGAC CCCATGATGT GTATACCTAC TTCGCCAATG TTAGCCGTGA TGCCGATGGA ATTACGCGCT TGGTGATGCG TGGTTATCCA CCTGATAAAT GGCCAGCCTT GCCCAGTAAC GTGATTATCA CCGGAACACT TGATGCTGAA CATCCGGTTG ATAGCGATCA AAGTGCCCTT TTAGCCCAAA TTAACTGTGT CTATATGCAG CCGCAATGGT TGCAAAGCAA TATTCGCACA GTTAAACGTC AACAACGCAT GGCTCCGGTT GGCATGCAAC GCTTGTTGCT TGAGCAACAG TATCGCAGTG ATGAGGCTGC CACTGAGCGT TTACACGCCT TGCTCGGCCC GCATCTCGAC GATATTTTGC AACCGCCTAG CGAGTTGATG GCAATTTTGT GGCAAAGTAG TTTGACCTAC ACCCAGACTT GGCGTAGTAC CATGTTGCGG GCTGTGGCAA ATAGCTTTAC CATAGAGGGG CACGGACTTT TTATTCCATA TGATGTACTT AACAACGCCC AGTTCGCCTA TTCGTTTGTC ATGGCACGCC AATTGTTGTT AAGGCTTTGG GGGCAACCGC AGCATAGCGC CGCAATCGAG GCCTTGTTGC ATCGTCAACT TGCCCAACTA CCAAGCACAA CCTTGGTAGG GTTAACCGAC TCTTTGCTCT ATTAG
|
Protein sequence | MSPPTSLQRG TGFHHLIADE YRLIAQVLTY CHAAGYAIPP LLVIDYVIAL KTSPFVLLFG PTGQGKTELA RLFAQALVHP FEDQYTYVNL GSSLQAPEIQ DRFGWMKFVE TLENAAAPAN AGRLFFLCLD NLRPHDVYTY FANVSRDADG ITRLVMRGYP PDKWPALPSN VIITGTLDAE HPVDSDQSAL LAQINCVYMQ PQWLQSNIRT VKRQQRMAPV GMQRLLLEQQ YRSDEAATER LHALLGPHLD DILQPPSELM AILWQSSLTY TQTWRSTMLR AVANSFTIEG HGLFIPYDVL NNAQFAYSFV MARQLLLRLW GQPQHSAAIE ALLHRQLAQL PSTTLVGLTD SLLY
|
| |