Gene Hneap_0614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_0614 
Symbol 
ID8533749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp658845 
End bp660050 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content50% 
IMG OID646383002 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_003262514 
Protein GI261855231 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGGA AGGTCGTACC ACTTAAGGAT CTTTTTCAAA TTGGATCTAG CAAGCGTGTT 
TTGAAGTCAC AGTGGAAAGC AGAAGGAGTG CCTTTCTATC GCGGACGCGA GGTCACGCGA
TTAGCAATGG ATGGCTTCGT AGACAACGAG CTGTTTATTT CTGAGGCTCA TTATGCAGAG
CTTGCAAATC AGTATGGAGC TCCGAGAACT GACGATATAG TCATCACAGC GATCGGAACT
ATTGGCAATT CGTACATCGT CCAGGATGGC GACAGGTTCT ATTTCAAGGA TGCCAGCATC
CTTTGGATGA AGAGAATCAG CGATGTCAGT AGCAAGTTCG TCAATTTTTG GTTGAAATCC
ACCATGTTTC TCGATCAACT GGATCATGGG AATGGGGCAA CCGTAGACAC GCTGACGATT
CAAAAACTCC AGAGCGTCCA GATATGGGTT CCCCCCATTG CCGAACAACA CCGCATTGTC
TCCATTCTCG ACGAAGCCTT TGAAGGCATC GCCAAAGCCC GAGCCCATGC CGAACAGAAC
CGCCAGAACG CCCGCGCCTT GTTTGAAAGC CACCTGCAAT CCGTGTTCAC GCAGCGGGGT
GAGGGGTGGG CGGAAAAGTC GCTTGAGGAA GTGGTAGATG CGCAATGCAC ACTTTCATAT
GGCATCGTTC AGCCGGGTCA CGAATACGCT AAAGGAATGC CGATTGTTCG TCCTACGGAC
TTGACGGCAA AATTGATTAC GCTTAACGGA TTGAAACGTA TCGACCCAAA GCTGGCCGAT
GGCTATCGCA GAACTACGCT GCGTGGCGGC GAACTTCTGC TCTGTGTTCG AGGAAGTACC
GGAGTGTTGG CGGTCACATC CTCAGAACTT GCTGGCGCTA ACGTAACGCG CGGCATAGTT
CCGATCATGT TTGATCCATC GTTACTTAGC CAAGATTTTG GCTATTTCCT GATGACTTCA
GAGGCAGTGC AGAGCCAAAT CCGCATCAAA ACTTATGGAA CAGCGCTAAT GCAAATAAAC
ATTGGGGATT TGAGAAAAAT TGCTGTCTCA TTTCCTCCGC TAAAGGAACA GGAAAGGATG
ACGGCACAAC TCGAAGAGTT GTCTGCCGAA ACCCAACGCC TGGAATCAAT CTACCAACAA
AAACTCGCTG CCCTCGATGA ACTGAAAAAA TCCCTGCTGC ATCAAGCCTT CTCCGGCTCA
CTTTAG
 
Protein sequence
MKGKVVPLKD LFQIGSSKRV LKSQWKAEGV PFYRGREVTR LAMDGFVDNE LFISEAHYAE 
LANQYGAPRT DDIVITAIGT IGNSYIVQDG DRFYFKDASI LWMKRISDVS SKFVNFWLKS
TMFLDQLDHG NGATVDTLTI QKLQSVQIWV PPIAEQHRIV SILDEAFEGI AKARAHAEQN
RQNARALFES HLQSVFTQRG EGWAEKSLEE VVDAQCTLSY GIVQPGHEYA KGMPIVRPTD
LTAKLITLNG LKRIDPKLAD GYRRTTLRGG ELLLCVRGST GVLAVTSSEL AGANVTRGIV
PIMFDPSLLS QDFGYFLMTS EAVQSQIRIK TYGTALMQIN IGDLRKIAVS FPPLKEQERM
TAQLEELSAE TQRLESIYQQ KLAALDELKK SLLHQAFSGS L