Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2045 |
Symbol | |
ID | 5733934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2556632 |
End bp | 2557792 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641279189 |
Product | transposase mutator type |
Protein accession | YP_001544816 |
Protein GI | 159898569 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3328] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCCCA TTAATAAGCA TACTCAGTCG CCCGTGATCG GTCAAGCCGA GGCCCAAGCT ACCTTCCACC AGATGGTGCA GCACCAGATT CGGCAGGCGA TTCGCGCGAC CTTCATCGAC ATTTTGGAAG ACGAAGTGGC CGCCTTCATC GGTGCGCAGC CCTACGAGCG TACCGACGAG CGCCGTGATC ACCGCGCCGG ACATCGCTCG CGCACCCTTG GCACGACCGC TGGGGTGATC GATGATTTGC CCGTCCCACG CACCCGTGGC GGCTTTCGCA CCCAGCTGTT TGACCGCTAC CAGCGCCGCA TGCACGACGT GGATACGCTC ATGCAGGATA TGTTTGTCGG CGGGGTCAGC CAAACGGCGG TTGGCACGGT GGTTGAGCAC TTAACGGGCA ACGCACCCAG TCCCTCAACC GTCTCACGGG TGTTTCATAC GCTTGACGAC GAGTTTCAGG CCTGGCAGGC TCGCCCGTTG CCCCCACGGT ACTGCTATGC CTTCGCCGAT GGCACCTATT TCAGCGTTAT CTATAACGGG CAAGGCCAGA AAATGCCGAT TTTGGCACTG ATTGGCATCA CCCCTGATGG CCAGCGCGAA GTGATCGCCT TCACGACGGG CGAGCGTGAA AACCAAGGCG CGTGGGAAAG TCTGCTGGCG GACATCAAGG ATCGCGGGGT CGAGACCGTT GATTTGTGGA TTACCGATGG GCATCAGGCC ATGCTGAACG CGATTGCGGC GAAGTTTCCT GCATCGCAGC GTCAGCGCTG TGTCGTCCAC AAAATGAGCA ACATCGAAAG CCACGTTCCC GAAAAATACC GCGATGCACT GCACGCGGAA CTGCGGACGA TTTTCTATCA GCCCGACCGA GCGCAGGCCG ACCAAGCTGC CGCAGCGTTT ATGGCGAAAT ACGAGCGCCA ATACCCGTCG GCGATCCGCT GCATGCAGCG TGACTGGGAA GCCTGCTTGA CGTTTTATGC CTATCCTGAG GGGCATTGGG TGAACATTCG CACGTCAAAT ATCATTGAAC GGACATTCGA GGAGGTCAAG AAGCGCAGCA AAAAAATGGC GACGGCGTTT CGGAATGAAG GCAGTTGTTT ATTGCTGTTT TATGCCGTTG TTCGAACACT TCAGCTTCGG AAAATTCGCA TGCCTGGCTA A
|
Protein sequence | MTPINKHTQS PVIGQAEAQA TFHQMVQHQI RQAIRATFID ILEDEVAAFI GAQPYERTDE RRDHRAGHRS RTLGTTAGVI DDLPVPRTRG GFRTQLFDRY QRRMHDVDTL MQDMFVGGVS QTAVGTVVEH LTGNAPSPST VSRVFHTLDD EFQAWQARPL PPRYCYAFAD GTYFSVIYNG QGQKMPILAL IGITPDGQRE VIAFTTGERE NQGAWESLLA DIKDRGVETV DLWITDGHQA MLNAIAAKFP ASQRQRCVVH KMSNIESHVP EKYRDALHAE LRTIFYQPDR AQADQAAAAF MAKYERQYPS AIRCMQRDWE ACLTFYAYPE GHWVNIRTSN IIERTFEEVK KRSKKMATAF RNEGSCLLLF YAVVRTLQLR KIRMPG
|
| |