Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5266 |
Symbol | |
ID | 5737224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | + |
Start bp | 41257 |
End bp | 42417 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641282430 |
Product | transposase mutator type |
Protein accession | YP_001548021 |
Protein GI | 159901776 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3328] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.801188 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCCCA GCAAGAAGCA TACCCATTCG CCCGTGATCG GTCAAGCCGA AGCCCAAGCC AGTTTCCAGC AGCTGGTGCA GCACCAGATT CGGCAGGCAA TTCGCGCCAC CTTCATCGAC ATTTTGGAAG ACGAAGTGGC GGCCTTCATC GGTGCACAGC CCTATGAACG TACCGACGAG CGGCGTGACC ATCGCGCCGG ACATCGCTCG CGCACGCTTG GCACGACCGC TGGCGTGATC GACGATTTGC CCGTTCCGCG TACTCGTGGT GGCTTTCGCA CCCAGCTGTT TGACCGCTAC CAGCGCCGCA TGCACGATGT CGATACCCTG ATGCAGGATA TGTTTGTCGG CGGGGTCAGC CAAACCGCCG TTGGCACGGT GGTCGAGCAC TTAACGGGCA ACGCGCCCAG TCCCTCAACG GTCTCGCGGG TGTTTCATAC CCTTGATGAC GAGTTTCAGG CCTGGCAAGC GCGTCCGTTG CCGCCACGCT ACTGCTATGC CTTTGCCGAT GGCACCTATT TCAGCGTGAT CTACAATGGC CAAGGCCAGA AAATGCCCAT TTTGGCCTTG ATTGGCATCA CCCCCGATGG CCAGCGCGAG GTGATTGCCT TCACGACGGG TGAGCGGGAA AACCAAGGCG CGTGGGAAAA CCTGCTGGCC GACATCAAGG ATCGCGGGGT GGACACCATT GATTTGTGGA TTACCGATGG ACATCAGGCG ATGCTGAACG CGATTGCGGC GAAATTTCCC GCGTCCCAGC GCCAGCGCTG TGTCGTGCAC AAAATGAGCA ACATCGAAAG CCACATTCCC GAGAAATACC GCGATGCACT GCACGCGGAG CTGCGGGCGA TCTTCTATCA ACCCGACCGA GCGCAGGCTG ACCAAGCGGC AGCGGCGTTT ATGGCCAAAT ACGAGCGCCA GTACCCATCA GCGATCCGCT GTATGCAACG CGATTGGGAG GCCTGTTTGA CCTTCTATGC CTATCCAGAA GGGCATTGGG TGAACATTCG GACATCGAAT ATCATTGAAC GGACGTTCGA GGAGGTCAAG AAGCGCAGCA AAAAAATGGC GACGGCCTTT CGGAATGAGG GCAGTTGTTT ACTGCTGTTT TATGCGGTCG TTCGAACACT TCAACTTCGC AAAATTCGCA TGCCTGGCTA A
|
Protein sequence | MTPSKKHTHS PVIGQAEAQA SFQQLVQHQI RQAIRATFID ILEDEVAAFI GAQPYERTDE RRDHRAGHRS RTLGTTAGVI DDLPVPRTRG GFRTQLFDRY QRRMHDVDTL MQDMFVGGVS QTAVGTVVEH LTGNAPSPST VSRVFHTLDD EFQAWQARPL PPRYCYAFAD GTYFSVIYNG QGQKMPILAL IGITPDGQRE VIAFTTGERE NQGAWENLLA DIKDRGVDTI DLWITDGHQA MLNAIAAKFP ASQRQRCVVH KMSNIESHIP EKYRDALHAE LRAIFYQPDR AQADQAAAAF MAKYERQYPS AIRCMQRDWE ACLTFYAYPE GHWVNIRTSN IIERTFEEVK KRSKKMATAF RNEGSCLLLF YAVVRTLQLR KIRMPG
|
| |