Gene Hneap_1707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1707 
Symbol 
ID8534865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1834665 
End bp1836878 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content56% 
IMG OID646384091 
Productpeptidase S8 and S53 subtilisin kexin sedolisin 
Protein accessionYP_003263579 
Protein GI261856296 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATTAC GCCACACTCT CTATACAAAA TGCGCACTTG GCCTGCTGTG TGCGCTCGCC 
CTCAACCTGC CCGCCCAGGC CGGCCTGCCC GTTTCAACAA CGAATATCGC ACTTCCGGCA
CCCTCTGCCG CCAGCTCATA CAAGCCCGAC ACAGTGGTCC CCAACGAGGT TCTGGTTAAA
TTCGGCAGCA CGCTATCGGC ACAGAACATG ACCCCCACGC TGAATAGCAT GGCGAGCAGC
GTCAAGCAGA TCAACCGTTC GGGTTTGACC TTGGTCAAAC TCACGCCCGG CACCGAAACG
ATTTCATCGG CGATGGCCAC GTTACGCGCC ATGCCGGGTG TGATTTCCGT CCAACCGAAC
TTCATTTACC ACAGCACAGC CCTTCCCAAC GATCCCGAGA TTGGCCAACA GTGGGCGCTG
AAAAATATAG GGCAGACTGT CTCGAACGCG ACCTATGCCA CCAGTAATCC GGGCACACCT
GGGGATGACA TTGATGCAGA AAGCGCCTGG CAATACCAAT CCGACTGCTC GTCGGTCACG
GTCGCCGTCG TCGATACAGG TATCAACTAC ACCCAGCAGG ATCTGGTCAA CAGCATGTGG
AATGGTGGGA CTCAATACCC ACACCACGGC TATGATTTTG TCGATAACGA CAATGATCCC
TATCCCACAA CCGGTGATGA ACTCCATGGC ACCCATGTGG CAGGCATTAT CGGTGCTGAG
GGCAACAATG GGATTGAAGG CTCCGGCGTC TGCCAGAAAG CCAGCATCAT GTCCGTGCGC
TCGCTGGACT CATCAGGCGG CAGCACAGCA AGTGTAGTTC AAGGCGTTTA CTTTGCCATT
GATCATGGCG CTCGAATCAT CAATATGAGT CTGGGTGGCA GCGGCGGTTT CGACCAAGAC
TTCTCCGACG CAATCAGCTA CGCCCAAAGC AAGGGGGCGC TGGTCGTTGT TGCTGCAGGC
AATGGCGATG CAAATGGAAA TGGCGTTGAC GTCGATCAAA CCCCTTTTTA TCCCTGCGCT
TTCCCGCAAG ATAACCTGAT CTGCGTCGCT GCGCTGGATC AATCGTTCCA GCTCGCCAGC
TTTTCTAATT ACGGCGCCAC CAGCGTGGAT GTGGGTGCGC CGGGAACGAA TATTCTCAGC
ACGTTTGCAG GACCGACTCT GACAACGGAT TTTTCCAGCG GTTGGACGGC CAGCCTGGGT
ACCAGCACGG GCTGGGGTTA CGGGAAAACC ACATCCGGTA TCCCGATACT TGTTAATCCG
GTTGATTACG GCAAGAGCAA TTATGCACCG AGTACAGATG ATCGTATCTG GACCGACTTT
ACGTTTGCAC CCGGTACACA ACACGTTGCT TTAAATTATT ACCTGCAAGG GCGCATGGCT
ACGGGTGATT ATCTCAATTC TGGCGTCGCC GTCGGAAGTA ATACGGATCC GTTTGGAAGC
AGTGGCACCC AACTCCAACA CGAAACGGAC ACCCTCAGCT CCCCAGCGGC TGCGTACCCG
ATTGATCAGT GTGCTGGCAA AACGTGTTCG ATCGGCTTCC AGTTGACCAG CACGCCCGTA
AGTGCAGGCG ACACCGGCCC GCTGATCGCT TTTTTCGAGC TCAATACCGT TGCAACCAAC
ACCCATGAGA TGGGTATTGA GAACGGCACA TCAATGGCGG CGCCCGTCGT CTCCGGCATT
GCCGCCTTGT TGATGGCCTC CGACCCGGCC GCCAGCGATA TCGACGTCGT CCAGGCAATC
AAAAACTCAG GCATACCGGT GCCTACCCTT TCGGGTGTCA CCACAACGGG CAAGGCTGTC
AATGCCATGC GCGCGCTTGC CAATCTGCAC CTGAGTGTTA CTGGCCTTGC TGACCAAACG
GGTACTGCCG GACAACCGCT TTCTGTCACA TTCTCCATCA GCGGCTTGAA CGCGCTAGCT
GTTTCCGCGA GCAGTAGCAA TACTTCTGTA TTGGCCAACA CCGCAATTAC AGGTCAAAAC
AGTTGCACAC AGACAGGGGG CTGTACGCTC CAGCTTCTCC CGGCAATGGG TGGTACTTCT
ACGATCTATG TGACCGTAAG CGACACATTC GGCCAACAGT CGACGGGCAG TTTCCTGCTG
ACAGTTCCCT CTTCCGGTGG TGGTGGCGGC GGCAGCATGA ACTGGACTTT CCTGCTGGTA
CTGGCCGTCA TTCTGGCTGC CGGTCAATGG CGCAGGCGAG GGTTGGAATC ATGA
 
Protein sequence
MTLRHTLYTK CALGLLCALA LNLPAQAGLP VSTTNIALPA PSAASSYKPD TVVPNEVLVK 
FGSTLSAQNM TPTLNSMASS VKQINRSGLT LVKLTPGTET ISSAMATLRA MPGVISVQPN
FIYHSTALPN DPEIGQQWAL KNIGQTVSNA TYATSNPGTP GDDIDAESAW QYQSDCSSVT
VAVVDTGINY TQQDLVNSMW NGGTQYPHHG YDFVDNDNDP YPTTGDELHG THVAGIIGAE
GNNGIEGSGV CQKASIMSVR SLDSSGGSTA SVVQGVYFAI DHGARIINMS LGGSGGFDQD
FSDAISYAQS KGALVVVAAG NGDANGNGVD VDQTPFYPCA FPQDNLICVA ALDQSFQLAS
FSNYGATSVD VGAPGTNILS TFAGPTLTTD FSSGWTASLG TSTGWGYGKT TSGIPILVNP
VDYGKSNYAP STDDRIWTDF TFAPGTQHVA LNYYLQGRMA TGDYLNSGVA VGSNTDPFGS
SGTQLQHETD TLSSPAAAYP IDQCAGKTCS IGFQLTSTPV SAGDTGPLIA FFELNTVATN
THEMGIENGT SMAAPVVSGI AALLMASDPA ASDIDVVQAI KNSGIPVPTL SGVTTTGKAV
NAMRALANLH LSVTGLADQT GTAGQPLSVT FSISGLNALA VSASSSNTSV LANTAITGQN
SCTQTGGCTL QLLPAMGGTS TIYVTVSDTF GQQSTGSFLL TVPSSGGGGG GSMNWTFLLV
LAVILAAGQW RRRGLES