Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3396 |
Symbol | |
ID | 5735257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4278197 |
End bp | 4279354 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280543 |
Product | IS605 family transposase OrfB |
Protein accession | YP_001546160 |
Protein GI | 159899913 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000848869 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAATGC TGACCCGTTG TTACAAATAC CGTCTGCAAC TCACACCGAC CCACGTTGAA ACCTTGGTAC AGTGGGCGGG TTGTCGGCGC TTCGTCTGGA ATTGGGCGCT GCACTGCAAG CAAACCCACT ACCAAACAAC GGGTCAACGG CTGAGCTATC AACGGCTTGC GGCGGCATTG GTTGATCTGA AACGTCAGCC CAAAACGGCA TTTTTGCGTG ATTGCCACTC ACAACCGTTG CAACAAACCT TGATGGATTT GGAAACGGCC TTCAGCAACT TTTTTGCCAA ACGCGCCAAG TACCCGCGAT TCAAATCACG CAAAATCACG CCGCACAGCC TACGCTTCCC GCAAGGTGTG ATCGTGGTTG ATGAACATAC CATCAGCGTG CCAAAAATCG GGCTGATACG GGCGATCATT CATCGCCCCT TGCAAGGCAT AGCGAAGAGT GCAACGATCA AACAGGATGC CACAGGCGCG TGGTGGGTCA TTTTCGTCTG TCATATCGAC CTCCCTGATG TTCAACCAAC AGCTGATCGA CCTGTGGGCA TTGATGTCGG GCTTGAATCC TTCACCACGC TGTCAACGGG CGAGAAGACA GCACCACCAA AGTTCTACTG TCGAAGCCAA AAGAAACTTG CCCGTGCTCA GCGCAAACTC TCACGCGCCC AAAAGGGCAG CAACAACCGC TTGAAAGCAA AAAAGCACGT TGCCCGTATC CACAAGAAAA TCAACAACCA ACGTGCCGAT TGGCTGCATA AGCATGCGTT GGGGATAGTT CGCCAATTTG ACGTGGTGTG CATCGAAGAC CTGAATATTA AAGGCCTTGC GAGAACCAAG CTGGCCAAAT CATTCAGTGA TGCCGCACTG AGTACCTTCA TGCAACGATT GCAGGAAAAA GCTGAATGGC ACGGACGACG AGTTGTTAAG ATTGGGCGGT TCTACGCCTC ATCGAAAACT TGCCACTTCT GTCATTCCAA GACTGCCTTG ACGCTGGCTG ACCGCGTGTG GACATGCCCC ACCTGTGGCA CGACCCATGA TCGCGATGGC AACGCCGCGA TCAACATGCT GTATGAAGGG CTACGCCTGC TTGCCGTTGG GACGACGGAA AGCCAAAACG CTGCTCGAGA TGGTGTAAAC CCAGCGAAAC GCTGGTAG
|
Protein sequence | MRMLTRCYKY RLQLTPTHVE TLVQWAGCRR FVWNWALHCK QTHYQTTGQR LSYQRLAAAL VDLKRQPKTA FLRDCHSQPL QQTLMDLETA FSNFFAKRAK YPRFKSRKIT PHSLRFPQGV IVVDEHTISV PKIGLIRAII HRPLQGIAKS ATIKQDATGA WWVIFVCHID LPDVQPTADR PVGIDVGLES FTTLSTGEKT APPKFYCRSQ KKLARAQRKL SRAQKGSNNR LKAKKHVARI HKKINNQRAD WLHKHALGIV RQFDVVCIED LNIKGLARTK LAKSFSDAAL STFMQRLQEK AEWHGRRVVK IGRFYASSKT CHFCHSKTAL TLADRVWTCP TCGTTHDRDG NAAINMLYEG LRLLAVGTTE SQNAARDGVN PAKRW
|
| |