Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0201 |
Symbol | |
ID | 5732096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 235861 |
End bp | 237018 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277325 |
Product | IS605 family transposase OrfB |
Protein accession | YP_001542981 |
Protein GI | 159896734 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000194492 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAACGC TGACCCGTTG CTATAAATAT CGTCTTCAAC CCTCACCTAC CCACGTCGAA ACCTTGGTCC AGTGGGCGGG TTGTCGGCGC TTCGTCTGGA ACTGGGCGCT AGGTCAGAAA ACAGACCATT ATCGTGCAAC AGGTCAACGG CTAAGTTACT CGCAACTGGC GGCAGCGTTG GTTGATCTGA AACGTCAGCC CAAAACGGCT TTTTTGCGCG AGTGTCATTC GCAGCCCCTG CAACAAGCGC TGATAGATTT AGAAACGGCC TTTACCAACT TTTTTGCCAA ACGCGCCAAA TACCCTCGTT TCAAAGCCCG CAAAGTCACT CTGCACAGTC TCCGCTTCCC GCAAGGTGTG GCAGTAGTCA ATGAACGCAC CATTAGCGTA CCAAAAATCG GGCATATGCA GGCAATCATT CATCGACCGC TGCTGGGAAT CGTGAAGGGT GCAACGATTA AACAAGATAC CACAGGCGCA TGGTGGGTGG TGTTTGTCTG TCATACTGAG CGCCCTGATG TGCTGCTCAC GACTGATCGG CCTGTAGGCA TTGATGTGGG ACTTGAATCC TTCACCACGC TGTCAACAGG CGAGAAAACT GCACCACCCA AATTCTACCG CCGAAGCCAG AAGAAACTTG CCCGTGCTCA ACGCAAACTC TCTCGCGCAC AAAAGGGCAG CAACAACCGC TTGAAAGCAC GCAAGCGGGT TGCTCGTATT CACAAGAAAA TCAGCAACCA ACGCGCCGAT TGGCTCCATA AACAGGCGTT GGGGATGGTT CAACGATTCG ATGTGGTGTG CATCGAAGAC CTGAATATTA AAGGCCTCGC GAGAACCAAG CTGGCCAAAT CATTCAGTGA TGCCGCCCTG AGTACCTTCA TGCAACGATT GCAGGAAAAA GCCGAATGGC ACGGGCGGCG GGTGATTAAG GTCGGGCGGT TCTATGCCTC ATCAAAAACC TGTCACCACT GCCATATCAA AACCGCGTTG ACGTTGGCGG ATCGTGTGTG GACATGTCAC GCCTGTGGCA CGACCCATGA TCGTGATGGC AACGCCGCGA TCAACATCGT GCACGAAGGG CTACGACTGC TTGCCGTTGG GACGGCGGAA AGCCAAAACG CTGCTCGAGA TGGTGTAAAC CCAGCGAAAC GCTGGTAG
|
Protein sequence | MRTLTRCYKY RLQPSPTHVE TLVQWAGCRR FVWNWALGQK TDHYRATGQR LSYSQLAAAL VDLKRQPKTA FLRECHSQPL QQALIDLETA FTNFFAKRAK YPRFKARKVT LHSLRFPQGV AVVNERTISV PKIGHMQAII HRPLLGIVKG ATIKQDTTGA WWVVFVCHTE RPDVLLTTDR PVGIDVGLES FTTLSTGEKT APPKFYRRSQ KKLARAQRKL SRAQKGSNNR LKARKRVARI HKKISNQRAD WLHKQALGMV QRFDVVCIED LNIKGLARTK LAKSFSDAAL STFMQRLQEK AEWHGRRVIK VGRFYASSKT CHHCHIKTAL TLADRVWTCH ACGTTHDRDG NAAINIVHEG LRLLAVGTAE SQNAARDGVN PAKRW
|
| |