Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4737 |
Symbol | |
ID | 5736581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6047072 |
End bp | 6048187 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281902 |
Product | IS605 family transposase OrfB |
Protein accession | YP_001547496 |
Protein GI | 159901249 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAACGC TGACCCGTTG CTATAAATAT CGCCTGTATC CTGCCACTGA CCAACAAAAC ACCTTGGTAC AGTGGGCGGG TTGCCGACGA TTTGTCTGGA ATTGGGCATT GCACTGCAAG CAAACCCACT ACCAAGCAAC GGGTCAACGG CTGAGCTATC AACGGCTTGC GGCGATGTTG GTTGATCTGA AACGTCAGCC CAAAACGGCA TTTTTGCGTG ATTGCCATTC GCAACCCTTG CAACAAGCGC TGATGGATTT AGAAACGGCC TTTACTCACT TTTTTGCCAA ACGGGCGAAG TATCCCCGTT TCAAAGCACG CAAAGTCACA CCGCATAGCC TCCGCTTCCC GCAAGGCGTG GTCGTCGTTG ATGAACACAC CATCAGCGTG CCAAAAATCG GGCTGATGCA GGCGATCATT CATCGCCCAC TGCTGGGAAC AGCAAAGGGC GCAACGATCA AACAAGACGC AACGGGTGCA TGGTGGGTCG TTTTTGTTTG CCACATCAAC CGCCCTGATG TTTTGCTAAC TACTGATAAT CCTGTGGGCA TTGATGTGGG ACTTGAATCC TTCACTACCC TGTCAACGGG AGAGAAAACT ACACCGCCCA AATTCTACCG TCGAAGCCAA AAGAAACTTG CCCGTGCTCA GCGGAAACTC TCACGCGCCC AAAAGGGCAG CAACAACCGC TTGAAAGCTC GTAAGCACGT TGCCAAAATT CACCAGAAAA TTAGCAACCA ACGCGCCGAT TGGCTGCATA AGCATGCGTT GGGGATCGTT CGCCAATTTG ATGTGGTGTG CATTGAAGAC CTGAATCTCA AAGGCCTTGC GAAAACCAAG CTGGCCAAAT CATTCAGTGA TGCCGCCCTG AGTACCTTCA TGCAGATGTT ACACGATAAA GCGGAATGGC ACGGACGGCG AGTGATTAAG GTTGGGCGGT TCTACGCCTC ATCAAAAACC TGCCATCACT GCCAAACGAA AACCGCCTTG ATGCTATCAG ATCGCGTGTG GACATGCCCC ACCTGTGGCA CGATCCATGA TCGCGATAGG AATGCGGCGA TCAACATCGT GCACGAAGGA ATACGCCTGC TTGCCGTTGG GACGACGGAA AGCTAA
|
Protein sequence | MRTLTRCYKY RLYPATDQQN TLVQWAGCRR FVWNWALHCK QTHYQATGQR LSYQRLAAML VDLKRQPKTA FLRDCHSQPL QQALMDLETA FTHFFAKRAK YPRFKARKVT PHSLRFPQGV VVVDEHTISV PKIGLMQAII HRPLLGTAKG ATIKQDATGA WWVVFVCHIN RPDVLLTTDN PVGIDVGLES FTTLSTGEKT TPPKFYRRSQ KKLARAQRKL SRAQKGSNNR LKARKHVAKI HQKISNQRAD WLHKHALGIV RQFDVVCIED LNLKGLAKTK LAKSFSDAAL STFMQMLHDK AEWHGRRVIK VGRFYASSKT CHHCQTKTAL MLSDRVWTCP TCGTIHDRDR NAAINIVHEG IRLLAVGTTE S
|
| |