Gene Hneap_1232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1232 
Symbol 
ID8534386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1334939 
End bp1336348 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content56% 
IMG OID646383623 
Productpeptidase U32 
Protein accessionYP_003263115 
Protein GI261855832 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0266585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCGC CTGAATTGCT ATCTCCTGCC GGTACGCTCA AATCGATGCG TTACGCCTTT 
GCCTACGGTG CCGATGCCGT TTATGCGGGG CTGCCACGGT ATAGCCTGCG GGTGCGTAAC
AATGATTTTC TTGAAGACAA TCTGCGAATC GGTATTGATG AAGCGCATGC GGCGGGCAAG
CAATTTTTTA TGACGGTGAA TCTGTCGCCG CACAATGCAA AGCTCAGAAC ATTCATCAAG
GACATGACCC CGCTGGTCGA GATGCAGCCG GATGCGTTCA TCATGGCCGA CCCCGGCCTG
ATCATGATGG TGCGTCAGCA GTGGCCTGAT CTCCCGATCC ATCTTTCCGT ACAGGCCAAT
ACCGTCAATT GGGCGACAGT CCAGTTCTGG AAAAACGCAG GTATTTCGCG GGTGATTCTC
TCGCGTGAAC TCTCTCTGGA CGAGATCAGC GAGATCCGTG ATCGATGCCC GGATATGGAG
CTGGAAGTAT TCGTGCACGG CGCTCTGTGC ATCGCCTACT CTGGGCGTTG CCTGCTTTCC
GGCTACTTCA ATCATCGTGA TCCGAATCAG GGCACTTGCA CCAACGCCTG CCGCTGGGAA
TACAAAACCG AGGCCGCAAC GGAAGATCTC TCCGGTGTCT ATCGCCCGGC CGAATCGTCC
ATTATTCCGC TCGATGCCGT CGGCGGCACG ACTGCCTTGG AGGGCTTTGA TTTCTCGGGC
GCCGAACAAT CATTCGGCCC GACGGGAAGC GATAACCGCC ATCCCAAGGC CAACGATGTG
TACTTCATCG AAGAGCCGAA TCGACCGGGC GAATTGATGC CAATCGAAGA AGACGAGCAC
GGCACGTATA TCCTCAACTC ACGTGATTTG CGCGCGGTGG AACATGTCCA GGCACTGACG
CGCATCGGTG TGGATTGCCT TAAAATCGAG GGCCGGACCA AATCGCATTA CTACGTTGCC
CGAACCGCGC AAGTGTATCG CCAGGCCATC GATGATGCGG TGGCCGGTCG GCCCTTCGAC
CCGAATCTGA TAGCCGAACT CGATAACCTC GCACATCGCG GTTATACCGA CGGATTCTTC
GAACGGCATC ACACCAAGGA ATACCAGAAC TACATCGAGG GCGTTTCCAA ACACCGCGAG
CAGCAGTTTG TGGGCGAGAT CACCTCGGTG CAGGGCGATT GGGCTGAGGT GGATATCAAG
AACAAGTTGG CTGTCGGTGA TTCGGTCACC TTCATGCTGC CCGACGGCAA CCGTATCGAA
CAACTGGCTG ATATGCAGAC GCTTGAAGGT GCCTCTATGA CCGAAGCGCC GGGTGGTGGC
TACAAGGTTC GGCTCAAGTT GCCGGCAGGC GCCGGAGCAT TTGGCGACCG TCTGAAATAT
GGTCTTATTG CCAAACATCT GAACGGCTGA
 
Protein sequence
MKAPELLSPA GTLKSMRYAF AYGADAVYAG LPRYSLRVRN NDFLEDNLRI GIDEAHAAGK 
QFFMTVNLSP HNAKLRTFIK DMTPLVEMQP DAFIMADPGL IMMVRQQWPD LPIHLSVQAN
TVNWATVQFW KNAGISRVIL SRELSLDEIS EIRDRCPDME LEVFVHGALC IAYSGRCLLS
GYFNHRDPNQ GTCTNACRWE YKTEAATEDL SGVYRPAESS IIPLDAVGGT TALEGFDFSG
AEQSFGPTGS DNRHPKANDV YFIEEPNRPG ELMPIEEDEH GTYILNSRDL RAVEHVQALT
RIGVDCLKIE GRTKSHYYVA RTAQVYRQAI DDAVAGRPFD PNLIAELDNL AHRGYTDGFF
ERHHTKEYQN YIEGVSKHRE QQFVGEITSV QGDWAEVDIK NKLAVGDSVT FMLPDGNRIE
QLADMQTLEG ASMTEAPGGG YKVRLKLPAG AGAFGDRLKY GLIAKHLNG