Gene Hneap_0616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_0616 
Symbol 
ID8533751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp661782 
End bp664133 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content56% 
IMG OID646383004 
ProductType I site-specific deoxyribonuclease 
Protein accessionYP_003262516 
Protein GI261855233 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAAG CCGAAACCCG CGCCGAACAT ATCGACCCTG CCTTGGCTGC TGCCGGATGG 
GGCGTGGTGG CCGGTTCACG CATTCGGCGA GAATACCCGA TTACCTTGGG CCGCATTGAA
GGCGCGGGTA AACGCGGCAA GGCGCTGACG GCCGATTATG TGCTGGAGTA CCGCAATACC
AAGTTGGCGG TGGTCGAGGC CAAGGCGTGG AATAAACCAC TCACCGAGGG CGTGGGCCAA
GCCAAGGATT ATTCCGGCAA ACTCGCCATC CGTTTTGCGT ATGCCACGAA CGGTCAGGGC
ATTTATGGCA TCGATATGGA ATCCGGCGTC GAAGCCGAGC TCGCAAATTA TCCATCCCCG
GATGAATTGT GGGCGCGCAC CTTTGCCAGC CAGAATATTT GGCGTGATCG CTTCGCGGAG
GTGCCGTTCG AGGATCGGGG CGGGTACTTT CAAAGTCGGT ATTATCAAGA CATTGCCATC
GAGCGCGTGC TGGCGGCAAT CGCCGATCAT CAATCGCGCA TTTTGCTCAC CTTGGCGACC
GGTACGGGCA AAACCTTTAT TGCCTTTCAA CTGGCGTGGA AGCTGTTTCA TAGCCGCTGG
AACTTGCGCG ACTGGCAGCG TGAAGCCGAA CCGAGCCGCC GCCCGCGTAT TTTGTTTTTA
GCCGACCGCA ACATTCTCGC CAATCAGGCC TTCAATGCCT TTTCGGCGTT CCCGGAAGAT
GCGTTGGTAC GGATTGATCC TGCCGATATT CGCAAGCAGG GAAGGGTGCC GAAAAACGGC
AGTCTGTTTT TCACGATTTT CCAGACATTC ATGAGTGGGC AGGATGCCGA AGGCCAGCCT
GCACCGTACT TTGGCGATTA CCCGCCGGAT TTTTTCGATT GCATCATTAT CGACGAGTGC
CATCGCGGTG GCGCGAACGA TGAAAGCAAC TGGCGCGGCA TTCTGGCGTA TTTCGCGCCC
GCCGTGCAGC TTGGCTTGAC CGCCACGCCC AAGCGCAAAG ACAACGTGGA TACCTATCAA
TACTTCGGCG AGCCGGTGTT TGTGTATTCA TTGAAGGACG GCATCAATGA TGGTTTTTTG
ACTCCGTTCC GAGTGAAGCA AATCGCCACC ACGCTCGATG AATATGTGTA CACGCCCGAT
GACACGCTGG TGGAAGGCGA GATTGAAGCG GGCAAGCGCT ACGAAGAAGC CGACTTCAAC
AAGATCATCG AGATCAAGGA ACGTGAGCAA AAGCGTGTCG AGATTTTCAT GGCGCAAATT
GACCAGCGCG AGAAAACCAT CGTGTTTTGT GCCACCCAAG AACATGCCCT GGCCGTGCGG
GATTTGATCA ACCAGATCAA GTCCAGCAGC AACCCCGATT ACTGCCAACG GGTAACCGCC
AATGATGGTG CGCGGGGTGA ACACTATCTG CGCGATTTTC AGGACAACGA GAAAACTATC
CCGACGATCC TGACCACATC GCAAAAGCTC TCGACCGGCG TGGACGCCCG CAACGTGCGC
AATATCGTGC TGATGCGCCC CGTCAATTCG ATGATCGAAT TCAAACAGAT TATCGGGCGC
GGCACGCGGC TGTATGACGG CAAGGATTAC TTCACCATCT ATGATTTCGT GAAGGCGCAC
CATCACTTCA ATGACCCCGA ATGGGACGGC GAGCCGCTGG AACCAGAGCC AACCGACCCC
CGCCCACCCC AACCGCCGAG TGAACCAACC CCGCCCGATG GCGTGCGTGA ACCCAGTTCG
TCTTATGAGC GCAAGCCCAA AGTGAAAGTG CAGCTTTCCG ATGGCAAGGC CCGCACCATC
CAGCACATGA TGAGCACGAG CTTCTGGCAC CCGGACGGTA CACCGATGTC TGCCCAGCAG
TTCATGGAAT CGCTGTTTGG TCGCTTGCCG GAGTTTTTCA AGGACGAAGA CGAATTACGG
GCCCTGTGGA GTGACCCCGA AACCCGCAAA CGCTTGCTCG AAGGGCTGGC CGAAAAAGGC
TTCGGTACGG ATCAACTGCG GGAAATGCAA AAGATCATCG ATGCACAAAA TAGTGATCTG
TTCGATGTGT TGGCTTATGT GGCCTACGCC CAAACGCCGC TGAGCCGGGA AGATCGCGCG
GATCGTGCCA TGGCGCTCAT CAGCAGCCAC TTCAACAGCA AACAGCAAGT GTTTCTGGAT
TTCGTGCTTT CGCAGTACAT CAGCGTGGGG GTGGAGGAGT TGGACAAAAC CAAACTCGGC
AGTTTGCTCC GCCTGAAATA CCACGACTCC ATCAACGATG CTATCGCCGA CCTCGGCAAG
CCCGATGAAA TCGGCCAGAT GTTTAGCGGG TTTCAGAAGT TTTTGTATCA GCCAGTGCAA
GCGAAGGTTT AG
 
Protein sequence
MNEAETRAEH IDPALAAAGW GVVAGSRIRR EYPITLGRIE GAGKRGKALT ADYVLEYRNT 
KLAVVEAKAW NKPLTEGVGQ AKDYSGKLAI RFAYATNGQG IYGIDMESGV EAELANYPSP
DELWARTFAS QNIWRDRFAE VPFEDRGGYF QSRYYQDIAI ERVLAAIADH QSRILLTLAT
GTGKTFIAFQ LAWKLFHSRW NLRDWQREAE PSRRPRILFL ADRNILANQA FNAFSAFPED
ALVRIDPADI RKQGRVPKNG SLFFTIFQTF MSGQDAEGQP APYFGDYPPD FFDCIIIDEC
HRGGANDESN WRGILAYFAP AVQLGLTATP KRKDNVDTYQ YFGEPVFVYS LKDGINDGFL
TPFRVKQIAT TLDEYVYTPD DTLVEGEIEA GKRYEEADFN KIIEIKEREQ KRVEIFMAQI
DQREKTIVFC ATQEHALAVR DLINQIKSSS NPDYCQRVTA NDGARGEHYL RDFQDNEKTI
PTILTTSQKL STGVDARNVR NIVLMRPVNS MIEFKQIIGR GTRLYDGKDY FTIYDFVKAH
HHFNDPEWDG EPLEPEPTDP RPPQPPSEPT PPDGVREPSS SYERKPKVKV QLSDGKARTI
QHMMSTSFWH PDGTPMSAQQ FMESLFGRLP EFFKDEDELR ALWSDPETRK RLLEGLAEKG
FGTDQLREMQ KIIDAQNSDL FDVLAYVAYA QTPLSREDRA DRAMALISSH FNSKQQVFLD
FVLSQYISVG VEELDKTKLG SLLRLKYHDS INDAIADLGK PDEIGQMFSG FQKFLYQPVQ
AKV