Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0419 |
Symbol | |
ID | 4711255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 487062 |
End bp | 488156 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639854877 |
Product | transposase, IS4 family protein |
Protein accession | YP_001002010 |
Protein GI | 121997223 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3666] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGGTG AGGATGTCGA GCAACAGCAC ATGTTCAGCT ACGTCAGCCT CGAAGATCGC ATCCCCAAGA ATCACCCTCT TCGCGAGATC CGTGTCCTCG TTGATCGGCT GCTCGACTCG ATCTCGGACG AGCTGGACGC GGTGTACTCG GGCACCGGTC GGCCGTCGAT CCCGCCGGAG CGACTGATCC GGGCGCTTTT GCTTCAGGCC CTGTACTCGA TCCGCTCGGG GCGGCTGCTC ATGGAGCAGC TCGACTGCAA CCTCCTGTTC CGCTGGTTCG TCGGGCTCCA GGTCGATGAC CCCGTCTGGC ACCCGACCAC CTTCACCAAG AGCCGGGACC GCCTGATCGA GGCGGAGGTT GCGCAGCGCC TGCTGCGCTC GTTCACGGAA CTCGAGGAGG TTCGGCCCCT GCTCTCCGAC GAGCACTTCT CGGTCGATGG CACGCTCATC GAGGCCTGGG CCTCGATGAA GAGCTTCAAG CCGATCAGCC AGGACGGCGA CAGCTCGGGT GATGATGACT TGCAGAGCGG CGGGCGCAAT CCGACGGTCA ACTTCCGGGG TCAGCAGCGG CGCAACGATA CCCACGCCTC GAGCACGGAC CCGAATGCTC GGCTCTACCG CAAGGGCCAG GGGCAGCCTG CGCGGCTTTG CTATATCGGC CATGCCCTGA TGGAGAACCG TCATGGGCTC ATCGTCGATA CCCGCCTAAC CCAGGCCTGT GGCACCGCCG AGCGGGAGGC GGCCCTGGAG ATGATCCGCG AGATCCCTGC AGAGCGAGGC CGGCTGACGC TTGCTGGCGA CGCCGGCTAC AACACCCGAG ATTTCGTCCA GGCCCTGCGC GAGTACGAGG TTACGCCCCA TGTTGCCGAG AAGCGCCGGT TCAATGCTGT GGATGGTCGG ACGACGCGCC ACCCCGGCTA TGCGGTCAGC CAGCGCATCC GCAAGCGCGT CGAGGAGTTC TTCGGCTGGT CCAAGACCGT CGGTGGCCTT CGCAAGACCC GCTTCATCGG ACCAGACAGG GTGGGGTGGG ACTTCGGGTT CCATGCCCTG GCCTACAACC TGGTCCGCAC ACCCAAACTG CTCGGGGCTG GCTGA
|
Protein sequence | MRGEDVEQQH MFSYVSLEDR IPKNHPLREI RVLVDRLLDS ISDELDAVYS GTGRPSIPPE RLIRALLLQA LYSIRSGRLL MEQLDCNLLF RWFVGLQVDD PVWHPTTFTK SRDRLIEAEV AQRLLRSFTE LEEVRPLLSD EHFSVDGTLI EAWASMKSFK PISQDGDSSG DDDLQSGGRN PTVNFRGQQR RNDTHASSTD PNARLYRKGQ GQPARLCYIG HALMENRHGL IVDTRLTQAC GTAEREAALE MIREIPAERG RLTLAGDAGY NTRDFVQALR EYEVTPHVAE KRRFNAVDGR TTRHPGYAVS QRIRKRVEEF FGWSKTVGGL RKTRFIGPDR VGWDFGFHAL AYNLVRTPKL LGAG
|
| |