Gene Hneap_2353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_2353 
Symbol 
ID8535517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp2519937 
End bp2521277 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content55% 
IMG OID646384727 
Productcarboxyl-terminal protease 
Protein accessionYP_003264209 
Protein GI261856926 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0201351 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCAGT GGTTACGTAA AACATCTGCC CTTGGCGTTG CCGCCGTTTT CGGGTTCTCT 
GTGGCGATTG CCGTTAACGC GCTTGCTGAC AAGGATCAGG CCACTGATTC CAACATTCCG
CTGAGCGAAT TGCGCACATT CACCGATGTT TTAACACGCG TAAAAGCGGA CTATGTCGAC
AATGTCACCG ACAAGACCCT GATGGACAAT GCCATTCGCG GCATGATCGA TCGCCTCGAC
CCGCATTCGA ATTATCTCGA TAAGGCCGAG TTCAAGGATC TGCGGGAAAC CACAACCGGC
AAATTCGGTG GCTTGGGATT ACAGGTCGGC ATGAAGGACA AGGTGATCAC CGTCATCTCG
CCAATCGACG ACACCCCGGC CCAAAAAGCT GGTATTAAGG CCGGCGACAG GATCGTCAAG
ATCAATGGTG AGTTCACCCA GGGTCTTGAC CTTGAGAAAG CAGTAAAACA GATGCGCGGC
GACCCGGGCA CGAAAATCAC GCTGACACTG GTTCGCGATG GCGTCGACAA ACCGTTTGAC
GTCACACTTG AGCGCGCCAT CATCAACGTC AAGTCGGTCA AGGCGCGCAT GCTTGATCCG
AACTTCGGCT ATGTGCGCAT CGCCCAATTC CAGTCCGACA CGACAGAACA GTTGCATGAT
GCGCTCAATC AACTGATCAA GGACAACGAC AACAAGCCAC TCAAAGGCTT GGTACTTGAT
CTGCGCAACA ATCCCGGCGG CGTGCTTCAG GCAGCAGTCG GCGTGGTGGA TACCTTCGTC
AACAAGGGTT TGATCGTATA TACCAAGGGT CGTGTCGAAG ATGCGCAGAT GAGCTTCAAG
GCGCATGAGG GCGACATGCT CAACGGCGCC CCCATCGTGG TGTTGGTGAA CGGCGGCTCG
GCGTCAGCAT CGGAAATCGT TGCCGGTGCT CTGCAAGATG ACAGCCGGGC ATTGATTGCC
GGTGAACGCA CCTTCGGCAA GGGCTCGGTT CAGTCCATCA TGCCATTGAC CAATGGCGGG
GCGCTGCGCC TGACCACGGC CCGCTACTTC ACGCCCTCGG GCCGCTCAAT TCAGGGCGAG
GGCATCAAGC CGGACGTGGA AGTGCATCAG TTGAAAGTTA CCGATATCGA CAAGGTGTTC
TCCATCAAGG AAGCTGATCT GGCCGGTCAC ATCAGCAACC CCACCAAGCC AGATCAAAAA
CCGGCTACCC AGCCGATCAA GCAAATGATC GACTCCGACG GTAAGCCGTT GGTAGAAACC
GATTATCAGC TCTACGAGGC ACTCAATCTA CTCAAGGGCA TGTCCATCGT GGCGCATCGA
GATCGGGATT CCGCACAGTA A
 
Protein sequence
MAQWLRKTSA LGVAAVFGFS VAIAVNALAD KDQATDSNIP LSELRTFTDV LTRVKADYVD 
NVTDKTLMDN AIRGMIDRLD PHSNYLDKAE FKDLRETTTG KFGGLGLQVG MKDKVITVIS
PIDDTPAQKA GIKAGDRIVK INGEFTQGLD LEKAVKQMRG DPGTKITLTL VRDGVDKPFD
VTLERAIINV KSVKARMLDP NFGYVRIAQF QSDTTEQLHD ALNQLIKDND NKPLKGLVLD
LRNNPGGVLQ AAVGVVDTFV NKGLIVYTKG RVEDAQMSFK AHEGDMLNGA PIVVLVNGGS
ASASEIVAGA LQDDSRALIA GERTFGKGSV QSIMPLTNGG ALRLTTARYF TPSGRSIQGE
GIKPDVEVHQ LKVTDIDKVF SIKEADLAGH ISNPTKPDQK PATQPIKQMI DSDGKPLVET
DYQLYEALNL LKGMSIVAHR DRDSAQ