Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5033 |
Symbol | |
ID | 5736992 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 46270 |
End bp | 47982 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641282200 |
Product | transposase IS204/IS1001/IS1096/IS1165 family protein |
Protein accession | YP_001547791 |
Protein GI | 159901545 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3464] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.681228 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGTTC TATGGATCTT GCAACGTTGC TGCTTCCTGA TGCGACTGTC ATGCAGGTCG ATGACTGGAC GGTGGATACG ACCCATCAGC AGATCACCCT TCGCGCACAC CACGCCACGC ACGCTGCCAT GCCCGATCTG TTCCACGCCC GCTGTTCGCG TCCATAGTCG GTATGTCCGT ACCCTCACCG ACCTCCCGTG GGCGACGTAC ACCATTAAAT TGTCGCTCCG GGTTCGGAAA TTTTTCTGTG ACCTGCCGAC CTGCAATCGA CGAATTTTTA CCGAACGCAT CCCAACGATC CTCCAGCCCT CGGCACGCGT TACACTGCGC TGTACCCACG CGCAACGACA CGTAGGTCTT GCTACTGGCG GTGTCGGTGG TGCACAGCTC ACGACGCTCC TCCGCTTTGC CGCAGGCCGT GATCGTCTGT TACGTATCGT GCGTCAGATG CCCCTCCCCG CGATCGGGAA GCCCATCCAT ATTGGGGTCG ATGACTGGGC ATGGCGAAAA GGCCAGCGGT ATGGAACCAT CATTGTGGAT CTGGATACGC ATCGTCCATT GGCCGTGTTG CCTGATCGCC AAGCGAGTAC GTTGACCACC TGGCTCCATG CCCATCCCAG TGTTGCCATC ATTAGCCGTG ATCGCAGCAG TGCCTTTGCT GAGGGAGCTG CGGCAGGGGC ACCTCACGCA ATACAGGTTG CGGATCGTTT TCACTTAATG GTCAATGTGC GAGAAGCACT GCTTCAAACG TTGGTGGCCC ATCGCACGCT GGTCGAGGCC GTGCTGAGTC AGCCAACGCC CCTGACCGCC GCACCCCATC CGGACGCACA ACGAACCCTC ATGGTTGATG ACCCATTACC GCCCCAACCC ATTCCGGCGC AGGTTCGGCA TCGGAGCGAT GCCATCCACG CTCGTCGCCA ACGGCAGTTC GAACAGATTC AGGAATTGGT CGCACGTGGC TGGACGCATC GAGCCATTGC CCACGAGCTT GGACTCACAC GGCAAACCGT GGCACGCTAT CACGCCATTC ACGATGGGCC ACCACCGCAG CATCACGCGC ATCGTCGCAG TGTGCTTGAT CGCTTCAAGC CGTACCTGAT TGAACGATGG AATACGGGAT GCCTGAATGC CGCACAGCTC TGTTTAGAGA TTCAAGCGCA AGGGTATCAG GGAACTGCAC ACACGGTGCG ACGCTATGTC ACCCAACTTC GGAAAGCCAG TGGCCTACCC GCATGGAGTC GTGATGGTGC GGCGCGGCGC GGACATGTGG GACAGGCATG GCCGTGTGGC AGTCTCACCC AGTTGGTGTG GGAATTGGTG CGCCAGCCCA CGAAACAGGC AGCATGGCTG AGGCCGCTCA CCGAAGACCT CCGCACGCGC CATCCCCAGC TCGATAGCGC GATAACGCTG ATGGAATCGT TTTACGCTAT GATTCGGAAT CGGGAACCAA CAGGATTTGC GCCATGGCTC GCGCAAGCAC ACCAGAGCGG ATGTGCTGCG TTTGTGCGGC TAGCCCGCAG CTTTGAGGCT GATGCGGCGG CGATTGAGGC CGCGCTGACC ATGCGATGGA GTCAAGGACC AGTAGAAGGA ACCATTCATC GATTGAAACT CGTCAAGCGC CAAATGTATG GACGGGCAAA CCTTGATTTG CTGGTCCGGC GTGTTCTGTT GACGGGGCGG TCTCCACAAA AATTGCTGCC GTATTCGGTA TGA
|
Protein sequence | MEVLWILQRC CFLMRLSCRS MTGRWIRPIS RSPFAHTTPR TLPCPICSTP AVRVHSRYVR TLTDLPWATY TIKLSLRVRK FFCDLPTCNR RIFTERIPTI LQPSARVTLR CTHAQRHVGL ATGGVGGAQL TTLLRFAAGR DRLLRIVRQM PLPAIGKPIH IGVDDWAWRK GQRYGTIIVD LDTHRPLAVL PDRQASTLTT WLHAHPSVAI ISRDRSSAFA EGAAAGAPHA IQVADRFHLM VNVREALLQT LVAHRTLVEA VLSQPTPLTA APHPDAQRTL MVDDPLPPQP IPAQVRHRSD AIHARRQRQF EQIQELVARG WTHRAIAHEL GLTRQTVARY HAIHDGPPPQ HHAHRRSVLD RFKPYLIERW NTGCLNAAQL CLEIQAQGYQ GTAHTVRRYV TQLRKASGLP AWSRDGAARR GHVGQAWPCG SLTQLVWELV RQPTKQAAWL RPLTEDLRTR HPQLDSAITL MESFYAMIRN REPTGFAPWL AQAHQSGCAA FVRLARSFEA DAAAIEAALT MRWSQGPVEG TIHRLKLVKR QMYGRANLDL LVRRVLLTGR SPQKLLPYSV
|
| |