Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1205 |
Symbol | |
ID | 8534358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 1308489 |
End bp | 1311455 |
Gene Length | 2967 bp |
Protein Length | 988 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 646383595 |
Product | transposase Tn3 family protein |
Protein accession | YP_003263088 |
Protein GI | 261855805 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCGTC GTTCAATCCT GTCCGCCGCC GAGCGGGAAA ACCTGCTGGC GTTGCCGGAC TCCAAGGACG ACCTGATCCG ACATTACACA TTCAGCGATA CCGACCTCTC GATCATCCGA CAGCGGCGCG GGCCAGCCAA TCGGCTGGGC TTCGCGGTGC AGCTCTGCTA CCTGCGCTTT CCAGGCGTCA TCCTAGGCGT CGATGAGCCG CCGTTCCCGC CCTTGTTGAA GCTGGTCGCC GAGCAGATCA AGGTCGGCGT CGAAAGCTGG GACGAGTACG GCCAGCGGGA GCAGACCCGG CGCGAGCACC TGGTCGAGCT GCAAACCGTG TTCGGTTTCC GGCCCTTCAC CATGAGCCAT TACCGGCAGG CCGTCCAGAT GCTGACCGAG CTGGCCATGC AAACCGACAA GGGCATCGTG CTGGCCAGTG CCTTGATCGA GCACCTGCGG CGGCAGTCGG TCATTCTGCC CGCGCTCAAC GCCGTCGAGC GGGTGAGTGC CGAGGCGATC ACCCGCGCCA ACCGGCGCAT CTACGACACC TTGGCCGAAC CACTGGCGGA CGCGCATCGC CGTCGCCTTG ATGACTTGCT CAAGCGCCGG GACAACGGCA AGACGACCTG GCTGGCCTGG CTGCGCCAGT CACCGGCCAA GCCCAATTCG CGGCATATGC TCGAACACAT CGAACGCCTC AAGGCATGGC AGGCACTCGA CCTGCCTTCC GGCATCGAGC GGCTGGTTCA CCAGAACCGG CTGCTCAAGA TCGCCCGCGA GGGTGGACAG ATGACGCCCG CCGACCTGGC CAAGTTCGAG GCGCAGCGGC GCTACGCGAC CCTGGTGGCG CTGGCCATCG AGGGCATGGC CACCGTCACC GACGAAATCA TCGACCTGCA CGACCGCATC CTGGGCAAGC TGTTCAATGC CGCCAAGAAC AAGCATCAGC AGCAATTCCA GGCATCCGGC AAGGCCATCA ACGCCAAGGT GCGGCTGTTC GGGCGCATCG GCCAGGCGCT GATCGAGGCC AAGCAATCGG GTCGCGATCC GTTCGCCGCC ATCGAGGCCG TCATGTCCTG GGACGCCTTC GCCGAGAGCG TCACCGAAGC GCAGAAGCTC GCGCAGCCCG AGGATTTCGA TTTCCTGCAC CGCATCGGCG AGAACTACGC CACGCTGCGC CGCTACGCGC CGGAATTCCT TGCCGTGCTC AAGCTGCGGG CCGCGCCCGC CGCCAAGGAC GTGCTCGACG CCATCGAAGT GCTGCGCGGC ATGAACAGCG ACAACGCCCG CAAGGTGCCC GCCGACGCGC CGACCGACTT CATCAAGCCA CGCTGGCAGA AGCTGGTGAT GACCGACACC GGCATCGACC GGCGTTACTA CGAGCTGTGC GCACTATCGG AGCTGAAGAA CGCACTGCGC TCGGGCGACA TCTGGGTGCA GGGATCGCGC CAGTTCAAGG ACTTCGAGGA CTACCTGGTG CCGCCCGCGA AATTCGCCAG CCTCAAGCTG GCCAGCGAAT TGCCGCTGGC CGTGGCCACC GACTGCGATC AGTACCTGCA TGAACGGCTG ACGCTACTGG AAACGCAGCT TGCCACCGTC AACCGCATGG CAGCGGCCAA TAACCTGCCG GATGCCATCA TCACCGAGTC GGGCCTGAAG ATCACGCCGC TGGATGCGGC GGTGCCCGAC ACCGCGCAGG CCCTGATCGA CCAGACGGCG ATGATCCTGC CGCACGTCAA GATCACCGAA CTGCTGCTGG AGGTGGACGA GTGGACAGGC TTTACCCGTC ACTTCGCACA CCTGAAGTCA GGCGACCTGG CCAAGGACAG GAACCTGCTG CTGACTACTA TCCTGGCCGA CGCGATCAAC CTGGGCCTGA CCAAGATGGC CGAGTCCTGC CCCGGCACGA CCTACGCCAA GCTCGCCTGG CTGCAAGCCT GGCACATCCG CGACGAAACT TACTCGACGG CGCTGGCCGA GCTGGTCAAC GCGCAGCTCC GCCACCCGTT CGCCGAGCAT TGGGGCGACG GCACCACGTC ATCGTCAGAC GGCCAGAATT TTCGCACCGG CAGCAAAGCC GAGAGCACCG GCCACATCAA CCCGAAATAC GGCAGCAGCC CAGGGCGGAC GTTCTACACC CACATCTCCG ACCAGTACGC GCCGTTCCAC ACCAAGGTGG TCAATGTCGG CGTGCGTGAC TCGACCTACG TCCTCGACGG GCTGCTGTAC CACGAATCCG ACCTGCGCAT CGAGGAGCAC TACACCGACA CGGCAGGTTT CACCGATCAC GTCTTCGCGC TGATGCACCT CTTGGGCTTC CGCTTCGCCC CGCGCATCCG CGACCTGGGC GACACCAAGC TCTACATCCC GAAGGGTGAT GCCACCTACG AGGCATTGAA ACCGATGATC GGCGGCACCC TCAACATCAA GCACGTCCGC GCCCATTGGG ACGAAATCCT GCGGCTGGCC ACGTCGATCA AGCAGGGGAC GGTGACGGCC TCCCTCATGC TCAGGAAGCT CGGCAGCTAC CCGCGCCAGA ACGGCCTGGC CGTCGCGCTG CGCGAGTTGG GCCGCATTGA GCGCACGCTG TTCATCCTGG ACTGGCTGCA AAGCGTCGAG CTGCGCCGCC GCGTGCATGC CGGGCTGAAC AAGGGCGAGG CGCGCAACGC GCTGGCCCGT GCCGTGTTCT TCAACCGCCT TGGTGAAATC CGTGACCGCA GTTTCGAGCA GCAGCGCTAC CGCGCCTCCG GCCTCAATCT GGTAACGGCC GCCATCGTGT TGTGGAATAC GGTCTATCTG GAGCGGGCCG CGAACGCCCT GCGTGTCCAC GGCCAGACTG TTGATGACGG CCTATTGCAG TATCTGTCGC CGCTGGGCTG GGAACACGTC AACCTGACCG GCGATTACCT CTGGCGCAAC AGCGCCAAGA TCGGCGCAGG CAAGTTCAGG CCGCTACGGC CACTGCATCC GGCTTAG
|
Protein sequence | MPRRSILSAA ERENLLALPD SKDDLIRHYT FSDTDLSIIR QRRGPANRLG FAVQLCYLRF PGVILGVDEP PFPPLLKLVA EQIKVGVESW DEYGQREQTR REHLVELQTV FGFRPFTMSH YRQAVQMLTE LAMQTDKGIV LASALIEHLR RQSVILPALN AVERVSAEAI TRANRRIYDT LAEPLADAHR RRLDDLLKRR DNGKTTWLAW LRQSPAKPNS RHMLEHIERL KAWQALDLPS GIERLVHQNR LLKIAREGGQ MTPADLAKFE AQRRYATLVA LAIEGMATVT DEIIDLHDRI LGKLFNAAKN KHQQQFQASG KAINAKVRLF GRIGQALIEA KQSGRDPFAA IEAVMSWDAF AESVTEAQKL AQPEDFDFLH RIGENYATLR RYAPEFLAVL KLRAAPAAKD VLDAIEVLRG MNSDNARKVP ADAPTDFIKP RWQKLVMTDT GIDRRYYELC ALSELKNALR SGDIWVQGSR QFKDFEDYLV PPAKFASLKL ASELPLAVAT DCDQYLHERL TLLETQLATV NRMAAANNLP DAIITESGLK ITPLDAAVPD TAQALIDQTA MILPHVKITE LLLEVDEWTG FTRHFAHLKS GDLAKDRNLL LTTILADAIN LGLTKMAESC PGTTYAKLAW LQAWHIRDET YSTALAELVN AQLRHPFAEH WGDGTTSSSD GQNFRTGSKA ESTGHINPKY GSSPGRTFYT HISDQYAPFH TKVVNVGVRD STYVLDGLLY HESDLRIEEH YTDTAGFTDH VFALMHLLGF RFAPRIRDLG DTKLYIPKGD ATYEALKPMI GGTLNIKHVR AHWDEILRLA TSIKQGTVTA SLMLRKLGSY PRQNGLAVAL RELGRIERTL FILDWLQSVE LRRRVHAGLN KGEARNALAR AVFFNRLGEI RDRSFEQQRY RASGLNLVTA AIVLWNTVYL ERAANALRVH GQTVDDGLLQ YLSPLGWEHV NLTGDYLWRN SAKIGAGKFR PLRPLHPA
|
| |