Gene Hneap_1205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1205 
Symbol 
ID8534358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1308489 
End bp1311455 
Gene Length2967 bp 
Protein Length988 aa 
Translation table11 
GC content65% 
IMG OID646383595 
Producttransposase Tn3 family protein 
Protein accessionYP_003263088 
Protein GI261855805 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCGTC GTTCAATCCT GTCCGCCGCC GAGCGGGAAA ACCTGCTGGC GTTGCCGGAC 
TCCAAGGACG ACCTGATCCG ACATTACACA TTCAGCGATA CCGACCTCTC GATCATCCGA
CAGCGGCGCG GGCCAGCCAA TCGGCTGGGC TTCGCGGTGC AGCTCTGCTA CCTGCGCTTT
CCAGGCGTCA TCCTAGGCGT CGATGAGCCG CCGTTCCCGC CCTTGTTGAA GCTGGTCGCC
GAGCAGATCA AGGTCGGCGT CGAAAGCTGG GACGAGTACG GCCAGCGGGA GCAGACCCGG
CGCGAGCACC TGGTCGAGCT GCAAACCGTG TTCGGTTTCC GGCCCTTCAC CATGAGCCAT
TACCGGCAGG CCGTCCAGAT GCTGACCGAG CTGGCCATGC AAACCGACAA GGGCATCGTG
CTGGCCAGTG CCTTGATCGA GCACCTGCGG CGGCAGTCGG TCATTCTGCC CGCGCTCAAC
GCCGTCGAGC GGGTGAGTGC CGAGGCGATC ACCCGCGCCA ACCGGCGCAT CTACGACACC
TTGGCCGAAC CACTGGCGGA CGCGCATCGC CGTCGCCTTG ATGACTTGCT CAAGCGCCGG
GACAACGGCA AGACGACCTG GCTGGCCTGG CTGCGCCAGT CACCGGCCAA GCCCAATTCG
CGGCATATGC TCGAACACAT CGAACGCCTC AAGGCATGGC AGGCACTCGA CCTGCCTTCC
GGCATCGAGC GGCTGGTTCA CCAGAACCGG CTGCTCAAGA TCGCCCGCGA GGGTGGACAG
ATGACGCCCG CCGACCTGGC CAAGTTCGAG GCGCAGCGGC GCTACGCGAC CCTGGTGGCG
CTGGCCATCG AGGGCATGGC CACCGTCACC GACGAAATCA TCGACCTGCA CGACCGCATC
CTGGGCAAGC TGTTCAATGC CGCCAAGAAC AAGCATCAGC AGCAATTCCA GGCATCCGGC
AAGGCCATCA ACGCCAAGGT GCGGCTGTTC GGGCGCATCG GCCAGGCGCT GATCGAGGCC
AAGCAATCGG GTCGCGATCC GTTCGCCGCC ATCGAGGCCG TCATGTCCTG GGACGCCTTC
GCCGAGAGCG TCACCGAAGC GCAGAAGCTC GCGCAGCCCG AGGATTTCGA TTTCCTGCAC
CGCATCGGCG AGAACTACGC CACGCTGCGC CGCTACGCGC CGGAATTCCT TGCCGTGCTC
AAGCTGCGGG CCGCGCCCGC CGCCAAGGAC GTGCTCGACG CCATCGAAGT GCTGCGCGGC
ATGAACAGCG ACAACGCCCG CAAGGTGCCC GCCGACGCGC CGACCGACTT CATCAAGCCA
CGCTGGCAGA AGCTGGTGAT GACCGACACC GGCATCGACC GGCGTTACTA CGAGCTGTGC
GCACTATCGG AGCTGAAGAA CGCACTGCGC TCGGGCGACA TCTGGGTGCA GGGATCGCGC
CAGTTCAAGG ACTTCGAGGA CTACCTGGTG CCGCCCGCGA AATTCGCCAG CCTCAAGCTG
GCCAGCGAAT TGCCGCTGGC CGTGGCCACC GACTGCGATC AGTACCTGCA TGAACGGCTG
ACGCTACTGG AAACGCAGCT TGCCACCGTC AACCGCATGG CAGCGGCCAA TAACCTGCCG
GATGCCATCA TCACCGAGTC GGGCCTGAAG ATCACGCCGC TGGATGCGGC GGTGCCCGAC
ACCGCGCAGG CCCTGATCGA CCAGACGGCG ATGATCCTGC CGCACGTCAA GATCACCGAA
CTGCTGCTGG AGGTGGACGA GTGGACAGGC TTTACCCGTC ACTTCGCACA CCTGAAGTCA
GGCGACCTGG CCAAGGACAG GAACCTGCTG CTGACTACTA TCCTGGCCGA CGCGATCAAC
CTGGGCCTGA CCAAGATGGC CGAGTCCTGC CCCGGCACGA CCTACGCCAA GCTCGCCTGG
CTGCAAGCCT GGCACATCCG CGACGAAACT TACTCGACGG CGCTGGCCGA GCTGGTCAAC
GCGCAGCTCC GCCACCCGTT CGCCGAGCAT TGGGGCGACG GCACCACGTC ATCGTCAGAC
GGCCAGAATT TTCGCACCGG CAGCAAAGCC GAGAGCACCG GCCACATCAA CCCGAAATAC
GGCAGCAGCC CAGGGCGGAC GTTCTACACC CACATCTCCG ACCAGTACGC GCCGTTCCAC
ACCAAGGTGG TCAATGTCGG CGTGCGTGAC TCGACCTACG TCCTCGACGG GCTGCTGTAC
CACGAATCCG ACCTGCGCAT CGAGGAGCAC TACACCGACA CGGCAGGTTT CACCGATCAC
GTCTTCGCGC TGATGCACCT CTTGGGCTTC CGCTTCGCCC CGCGCATCCG CGACCTGGGC
GACACCAAGC TCTACATCCC GAAGGGTGAT GCCACCTACG AGGCATTGAA ACCGATGATC
GGCGGCACCC TCAACATCAA GCACGTCCGC GCCCATTGGG ACGAAATCCT GCGGCTGGCC
ACGTCGATCA AGCAGGGGAC GGTGACGGCC TCCCTCATGC TCAGGAAGCT CGGCAGCTAC
CCGCGCCAGA ACGGCCTGGC CGTCGCGCTG CGCGAGTTGG GCCGCATTGA GCGCACGCTG
TTCATCCTGG ACTGGCTGCA AAGCGTCGAG CTGCGCCGCC GCGTGCATGC CGGGCTGAAC
AAGGGCGAGG CGCGCAACGC GCTGGCCCGT GCCGTGTTCT TCAACCGCCT TGGTGAAATC
CGTGACCGCA GTTTCGAGCA GCAGCGCTAC CGCGCCTCCG GCCTCAATCT GGTAACGGCC
GCCATCGTGT TGTGGAATAC GGTCTATCTG GAGCGGGCCG CGAACGCCCT GCGTGTCCAC
GGCCAGACTG TTGATGACGG CCTATTGCAG TATCTGTCGC CGCTGGGCTG GGAACACGTC
AACCTGACCG GCGATTACCT CTGGCGCAAC AGCGCCAAGA TCGGCGCAGG CAAGTTCAGG
CCGCTACGGC CACTGCATCC GGCTTAG
 
Protein sequence
MPRRSILSAA ERENLLALPD SKDDLIRHYT FSDTDLSIIR QRRGPANRLG FAVQLCYLRF 
PGVILGVDEP PFPPLLKLVA EQIKVGVESW DEYGQREQTR REHLVELQTV FGFRPFTMSH
YRQAVQMLTE LAMQTDKGIV LASALIEHLR RQSVILPALN AVERVSAEAI TRANRRIYDT
LAEPLADAHR RRLDDLLKRR DNGKTTWLAW LRQSPAKPNS RHMLEHIERL KAWQALDLPS
GIERLVHQNR LLKIAREGGQ MTPADLAKFE AQRRYATLVA LAIEGMATVT DEIIDLHDRI
LGKLFNAAKN KHQQQFQASG KAINAKVRLF GRIGQALIEA KQSGRDPFAA IEAVMSWDAF
AESVTEAQKL AQPEDFDFLH RIGENYATLR RYAPEFLAVL KLRAAPAAKD VLDAIEVLRG
MNSDNARKVP ADAPTDFIKP RWQKLVMTDT GIDRRYYELC ALSELKNALR SGDIWVQGSR
QFKDFEDYLV PPAKFASLKL ASELPLAVAT DCDQYLHERL TLLETQLATV NRMAAANNLP
DAIITESGLK ITPLDAAVPD TAQALIDQTA MILPHVKITE LLLEVDEWTG FTRHFAHLKS
GDLAKDRNLL LTTILADAIN LGLTKMAESC PGTTYAKLAW LQAWHIRDET YSTALAELVN
AQLRHPFAEH WGDGTTSSSD GQNFRTGSKA ESTGHINPKY GSSPGRTFYT HISDQYAPFH
TKVVNVGVRD STYVLDGLLY HESDLRIEEH YTDTAGFTDH VFALMHLLGF RFAPRIRDLG
DTKLYIPKGD ATYEALKPMI GGTLNIKHVR AHWDEILRLA TSIKQGTVTA SLMLRKLGSY
PRQNGLAVAL RELGRIERTL FILDWLQSVE LRRRVHAGLN KGEARNALAR AVFFNRLGEI
RDRSFEQQRY RASGLNLVTA AIVLWNTVYL ERAANALRVH GQTVDDGLLQ YLSPLGWEHV
NLTGDYLWRN SAKIGAGKFR PLRPLHPA