Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0479 |
Symbol | |
ID | 8533606 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 505838 |
End bp | 506875 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 646382861 |
Product | transposase IS4 family protein |
Protein accession | YP_003262381 |
Protein GI | 261855098 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3039] Transposase and inactivated derivatives, IS5 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGC GTAGTGCCAT TAAGACGGAC CTGTTTGCTG ATACCCATCA CCGAGATAAG CTTGACACGT TGGGCGACCC CTTGGCAGAA ATCGAAGCCT GCATCGACTT TGCCGCCCTG GCGGCAGAAG TCGATCGCAT TGCGCCGCGG CCCGTCAGTG TTCAAGGCGG TCGTCCGCCG TACCCGACCG AGACCATGGT GCGCATTCTG GTCTTGAAAC GCCTGTACAA CCTGTCCGAC GAACAGATGG AATACCAACT GCTTGACCGC ATGAGCTACA AGCGCTTCTG TGGTCTGTCC CAGGCAACCA ACATCCCTGA TCGCACCACC GTGTGGACCT TCGAGAACCG GATCGGCGAA GTCGGAGCCC AGGTCATCTT TGATGGCGTC ACCACACAAC TCTTGAAGAA GGGCTTTATC GCCCGTGGTG GCCAGATCAT TGATGCTACG CTGGTGCCAG CGCCGAAGCA GTCTTTTTCC AAGGAAGACA AGGAACAGCT CAAAGCGGAT GCGATGCCGG CCGACTGGCA ACCGGGCAAG CGCCGTCAAA AAGACCTGGA TGCGACCTGG ACCAAAAAGC ATGGCAAGAG CCAGCACGGC TACAAGCTTT CAGTGAATGT TGACAAGAAG CACAAGTTCA TCCGCAAAAT CGTGACGGAC ACGGCCAGTA CCCACGACAG TCAGCATTTC GATGCAGTGA TCGATCCGGC CAATACGAGT CGGGATGTCT ACGCTGATCG GGGTTATCCG TCCGAAGAAC GCGAACAATG GCTCAAGGCA AATGGATACC GGAACCGGAT TCAGCGTAAG GGCAAACGGA ACAAGCCGTT ATCCGAAGCT CAACAGGGAC GCAACCATCG CATCGCCAAA ACACGCGCCA GGGTCGAGCA TGTGTTCGGT GCCATTGAGC AGATGGGGGG CAAGCTGCTG CGCACCATTG GTCAGGCGCG GGCAAACTTT GCGATGACAA TGATGGCGGC CTGCTACAAC CTGAAGCGGC TGGCGTATTT CCAGCGAGTG GGCATCACGG CTTTCTGA
|
Protein sequence | MKKRSAIKTD LFADTHHRDK LDTLGDPLAE IEACIDFAAL AAEVDRIAPR PVSVQGGRPP YPTETMVRIL VLKRLYNLSD EQMEYQLLDR MSYKRFCGLS QATNIPDRTT VWTFENRIGE VGAQVIFDGV TTQLLKKGFI ARGGQIIDAT LVPAPKQSFS KEDKEQLKAD AMPADWQPGK RRQKDLDATW TKKHGKSQHG YKLSVNVDKK HKFIRKIVTD TASTHDSQHF DAVIDPANTS RDVYADRGYP SEEREQWLKA NGYRNRIQRK GKRNKPLSEA QQGRNHRIAK TRARVEHVFG AIEQMGGKLL RTIGQARANF AMTMMAACYN LKRLAYFQRV GITAF
|
| |