Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1046 |
Symbol | |
ID | 5732950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1193967 |
End bp | 1194893 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641278181 |
Product | abortive infection protein |
Protein accession | YP_001543822 |
Protein GI | 159897575 |
COG category | [R] General function prediction only |
COG ID | [COG1266] Predicted metal-dependent membrane protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00148937 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTATG GCAATGATGA TTCACAACGT GAACGCAATG ATGCAGTTTT GAAAGCACTC GGTGGTGTAC CGCAACCGCC AGTTGCTCCG TCTGTGCAAC CGCAACAACC CATGGCTCAA CCTGGCTGGC ATCCAGTCGA TCAGGGGTAT CAAGTGCCGC CAGTGCTCAT AGCCGAGGAA GGCAAAAGTA ATACCCCCAT GGCCTACTGG GTATTGCCAG ATTTGATCTT TGGCTTTTTG CTAGCCGTGT TGTTCCAAGT GATCATCATG GTTGGTTATA TGCTTGTAAA GGGCTTAAGC AGCTTATCGG ATATTGAAGG CTTGCTGACC GATCCAACCT TTATGCTGCT GAACGCTCCA ACCTTGGGCT TAGGTTTTAC TTTAGCAAGC ATCTTGCGGG TCAATATTCT GCGTAAATTG CCCTTAGCAT GGTTTGGCCT GAATGGCAAA AAGCTGGGGG TAGCTTTGGG TTTTGGCTTT ATTGCAGGCA TCAGCTTCTT AGTAACTAAT CTTATCTCGG GCGTGATTGC CCAAGCATTG GGTAGCAACC CTGATCAACA AGAACAGTTG ATTGGCCCTT TTAAAAATGC TAGCAACCTG CAAATTGGCC TGTTTGGGCT ATTTGTGGTG ATTATTGGGC CATTTTTAGA GGAAGTCTTT TTTCGTGGCT ATGCTTTTCG CGCCATCCGC CAAAAGCTTG GGGTTACGTG GGGCGTGGTG CTGAGTGGCA TTTTGTTTGC CCTGCCGCAT GCCTTTGGAG TTACAACGGG CTATTTAGGC TTGTTGATTC CGATTTTCCT TGGTGGGGCG ATTTTGGCCT TGGTTTACCA CTATACCAAT AATTTGTGGA GTGCTGTTTT AGCCCACTCA ATGAATAACT TTGTTGGATT TCTTGGCCTG CTAGCAGCCT TGAAGCTCGA CGTGTAG
|
Protein sequence | MSYGNDDSQR ERNDAVLKAL GGVPQPPVAP SVQPQQPMAQ PGWHPVDQGY QVPPVLIAEE GKSNTPMAYW VLPDLIFGFL LAVLFQVIIM VGYMLVKGLS SLSDIEGLLT DPTFMLLNAP TLGLGFTLAS ILRVNILRKL PLAWFGLNGK KLGVALGFGF IAGISFLVTN LISGVIAQAL GSNPDQQEQL IGPFKNASNL QIGLFGLFVV IIGPFLEEVF FRGYAFRAIR QKLGVTWGVV LSGILFALPH AFGVTTGYLG LLIPIFLGGA ILALVYHYTN NLWSAVLAHS MNNFVGFLGL LAALKLDV
|
| |