Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5222 |
Symbol | |
ID | 5737180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 322626 |
End bp | 323864 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641282386 |
Product | transposase Tn3 family protein |
Protein accession | YP_001547977 |
Protein GI | 159901731 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATCAGTC TGCTCGATAT GCTGAAAGAA ACCGATCTTC GCGTGGGAGT CACGACCGAC TTTCACACGA GTACGGCACG GGAACACCTT GATCGTGCCA CCCTCCAACG CCGCCTCTTA CTCTGCTGCT ATGGCTTGGG AACCAATATT GGCCTCAAAC GGGTCTGCGC AGGGTCGCCC GGTGACCAGC ATAAAGACCT CGCCTATGTG CGGCGGCGCT TTTTGCTGCG GGATCAGCTG CGGAATGGCA TTGCCAAAGT CGTCAATGCG CTCTTTGACG CGCGGTTACC GCAGATTTGG GGTGAGGGAA CCACCGCGTG TGCCTCGGAC TCCAAACTGT TTGGGGCATG GGATCAAAAT CTGATGACCG ATTGGCATCC CCGGCATCGC GCCGCTGGGG TCAAAATTTA CTGGCACGTG GATAAAAAAG CCGCCTGCAT TTACTCGCAG CTCAAACACC CCTTTGCCTC CGAGGTTGCC GCGATGATGG AAGGTCTGCT CCATCACAAT ACCACCATGA CGGTCGAACG CAACTATGTC GATACCCACG GCCAAAGCGA GATCGCCTTT GGCTTTTGTC ATGTTTTAGG CTTCACATTA ATGCCACGAT TCAAGGCCAT CCATCGGCAA AAACTCTATC GTCCTGAGCG CGGCAATCGG ACGGCCTACC CCAATCTCCA GCCGGTCTTG CAACGACCGA TTAATTGGGA GCGTATTCGG CGTGAATACG ACCAAATCAT CAAATATGCG ACGGCCCTCC GCCTGCGAAC TGCCGAAACC GATGCGATTC TGCGCCGATT TAGCCGACGC AATTTTCAGC ACCCAACCTT CAAGGCACTC CTTGAATTGG GGCGGGCCAT CAAGACCATC TTTCTGTGCC AATACCTGCA TTCGGAGGAT ATGCGTCGGG AGATACACGA GGGCTTACAG GTGGTGGAAA ACTGGAATGG CACGAACGAC TTCATCTTCT ACGGCAAGGG GCGTGCGTTC AATACCAACC AGCGAGCCGA TATGGAGGTG TCCATGTTGT GCCTGCATTT GCTCCAAGTC TCCATGGTGT ACATCAATAC CTTGCTCATT CAGGAGGTAT TGCGGGAGCC AGCGTGGGCG AATCGGTTAA CGCCCGATGA TCTGCGGGCA CTTACGCCGC TGATCTACAG TCATGTCAAT CCCTTTGGTG TGTTTCTGCT CGACCTCTCG CAGCGATTGC CGCTCAAACC GATGCGATTG GCGGCCTGA
|
Protein sequence | MISLLDMLKE TDLRVGVTTD FHTSTAREHL DRATLQRRLL LCCYGLGTNI GLKRVCAGSP GDQHKDLAYV RRRFLLRDQL RNGIAKVVNA LFDARLPQIW GEGTTACASD SKLFGAWDQN LMTDWHPRHR AAGVKIYWHV DKKAACIYSQ LKHPFASEVA AMMEGLLHHN TTMTVERNYV DTHGQSEIAF GFCHVLGFTL MPRFKAIHRQ KLYRPERGNR TAYPNLQPVL QRPINWERIR REYDQIIKYA TALRLRTAET DAILRRFSRR NFQHPTFKAL LELGRAIKTI FLCQYLHSED MRREIHEGLQ VVENWNGTND FIFYGKGRAF NTNQRADMEV SMLCLHLLQV SMVYINTLLI QEVLREPAWA NRLTPDDLRA LTPLIYSHVN PFGVFLLDLS QRLPLKPMRL AA
|
| |