Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0784 |
Symbol | |
ID | 4709364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 870642 |
End bp | 872330 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639855244 |
Product | transposase, IS4 family protein |
Protein accession | YP_001002363 |
Protein GI | 121997576 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5421] Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATGTAC GCGTCGTCCG ATCCGGTCCC CGCCGCTACG TCCGCCTCGT CGAGGGCTAC CGCGACGACA ACGGACGGGT GAAGCAACGC ACCGTCGCGA ACCTCGGGCG GGCCGACCAA CTCAGCGATG ACGAGGTGGA CGGCCTCATC GAGGGCCTGC AACGCGCTGT TGGGCGGTCT GAGCCCCGGA GAGCGGAACC GGAGTTTCAG CGTGCTCGCG CCTTCGGAGA CCTTTGGACG CTCCATCAGC TCTGGCAGGA ACTCGGCCTG GATGACGCCC TTCGCAAGGC GCTGCGCTCG TCGCGGCGCG AGTTCGACGC GGAGGCGCTG ATCCGCGCCA TGGTCTTCAA CCGGCTGGCT GCACCGCGGA GCAAGCGCGG CGTGCTCGAC TGGCTTCAGG AGGATGTCAG CCTCCCGGGT GTGGATTCTG AGCAACTGCA CCACGAGCAG CTGCTGCGCG CCATGGACGC GCTGATGAAC CACCCCGAGC GGGTCGAGCA GGCCATTGCC GGGCAGCTCA AGCCCCTGCT CGATCAGGAG CTCAGCGTCG TGTTCTGGGA CATCACCACG GTGCGCATCC ACGGTGTGGA GGAGGTCGCG GACGATCTGC GCCAGCGCGG CAAGAGCAAG GACACAGGCG GCGTTGCCCG CCAATTCGCC CTCGGGGTCG TGCAGACCTC CGATGGCCTG CCGATCGCCC ACGAGGTCTT CGAGGGCAAT GTCGCGGAGA CGCGCACCCT GGCACCCATG CTCCAGCGCC TGCTCAGCCG CTACCCGATC AGGCGTGTGG CCGTGGTCGC GGATCGGGGG CTGCTCAGCC TCGACAATGT CGACACGCTG GAGGCGCTCA GCGCGCAGCA CGACCTGGCT ATCGATTACA TCCTGGCGGT GCCCGGACGC CGGTATGCGG AGTTCACCGG CTTGATGAGC GAGCTGCAGC CGCAGCTCGA GAAGCAGGCG GCAGAGCATG AGGGCGAAGC CGTCACCGAG ACCACTTGGC AGGGGCGCCG GCTGATTGTG GCGCACAACC CGCAACGGGC TGCCGAACAG CAGGCCTCGC GCCGGCACAA GATCCACCGG CTAGAGCAGA TCGGCGAGGC CCTGGCCCAG CGACTCGACA ACCAGGATGC GGGAAAGCCC GGCCGCGGGC GGCGCTCGAC TGATCGCAGC GCCCATCGGC GCTTCCATCA GCACGTCCTG CGCAGCAGCC TCTCGAGCAT CGTCAAGGCC GACCTCGGCG CCGAGCAATT CCGCTATGAC ATCGACGAAG AGGCGTGGGC GGCCGCCGAG CGGCTCGACG GCAAGCTGCT GCTGGTCACG AGCCTCCACG AGTACTCGCC GAGCGAGATC GCCGACCGCT ATCGGGCCCT TGCGGACATT GAGCGCGGCT TCCGCGTGCT CAAGAGCGAC ATCCAAATCG CGCCGGTCTA CCACCGGCTG CCGGAGCGCG TCCGTGCCCA CGCGCTCATC TGCTTCCTGG CCCTGGTTCT GCATCGCATC CTGCGCCGGC GTCTCAAGGC GGCTGGTAGC GGTTACTCAC CGGAGAACGC GTTACGGGCG CTGCGCCGTA TACAGCGCCA CAAAGTTCAC CTCGCCGGGC AGGACTACGA GCGGGTGACC AAGCCGACAC CGGAGCAACT CCAGCTCTTC GACGAGCTAG GTGTAGAGGT GCCTCAGCAC CCGGCTTGA
|
Protein sequence | MYVRVVRSGP RRYVRLVEGY RDDNGRVKQR TVANLGRADQ LSDDEVDGLI EGLQRAVGRS EPRRAEPEFQ RARAFGDLWT LHQLWQELGL DDALRKALRS SRREFDAEAL IRAMVFNRLA APRSKRGVLD WLQEDVSLPG VDSEQLHHEQ LLRAMDALMN HPERVEQAIA GQLKPLLDQE LSVVFWDITT VRIHGVEEVA DDLRQRGKSK DTGGVARQFA LGVVQTSDGL PIAHEVFEGN VAETRTLAPM LQRLLSRYPI RRVAVVADRG LLSLDNVDTL EALSAQHDLA IDYILAVPGR RYAEFTGLMS ELQPQLEKQA AEHEGEAVTE TTWQGRRLIV AHNPQRAAEQ QASRRHKIHR LEQIGEALAQ RLDNQDAGKP GRGRRSTDRS AHRRFHQHVL RSSLSSIVKA DLGAEQFRYD IDEEAWAAAE RLDGKLLLVT SLHEYSPSEI ADRYRALADI ERGFRVLKSD IQIAPVYHRL PERVRAHALI CFLALVLHRI LRRRLKAAGS GYSPENALRA LRRIQRHKVH LAGQDYERVT KPTPEQLQLF DELGVEVPQH PA
|
| |