Gene Hneap_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_0040 
Symbol 
ID8533153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp46091 
End bp47545 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content54% 
IMG OID646382419 
Productprotease Do 
Protein accessionYP_003261953 
Protein GI261854670 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAACCC AGATTCTTGT TGAAGCCATT CGTGCAGGCT TGCCGCGCCG TGGACGGCTT 
TTGATGGCCG CCGCTCTGAT TGCCACACCG TTGATGGCGA TGACACCGGC GATCAGTTTT
GCCGACAACG GCGTCCCCGA TTATGTGCAG TTGGTAAAAC AGGCCAGCCC GTGGGTGGTC
AATATCAGCA GCGTGAGCAA TCCCAAAACC CAAGAAGCTT TTAACAACGG CGAAATGCCA
ACCTTTCCGC CAGGACCTGC GGGCGATATG TTCCGGCATT TTTTCCAAGA ACAAATGCCG
CAAATGAAGC GTGAACCGAT TCGCTCTCTG GGTTCGGGTT TTATTATTTC CGCCGATGGC
TACATTCTTA CCAACGCGCA TGTAGTCAAC GGCGCGGACA AAATCACGGT GCGATTGCCC
GATCAGCAAA CCTACAAGGC CAAAGTGATC GGCAAAGACA AACGCACCGA CATCGCGTTG
CTGAAAATCG ATGCGAAAAA TCTGCCTGTT GCCCCCATTG GCAACTCGGA TAATATCCAA
GTGGGCGAAT GGGTTCTTGC CATTGGCGAG CCTTTCGGGC TCGATCACAC CGCAACGCAC
GGCATCGTGT CTGCCCTGGG CCGCGATTTG CCCGATGAGA GCTACGTGCC CTTCATTCAA
ACCGATGCGC CCGTCAATCC GGGCAACTCG GGCGGTCCAT TGATCAATGC TAACGGCAAA
GTCATCGGCA TCAATTCGCA GATTTATACG AAATCTGGCG GGTTTATGGG GATCTCGTTT
GCGATTCCGA TCAATGTTGC CATGAACGTG GTCGATCAGA TCAAGTCTAC CGGTCATGTG
ACGCGAGGCT ATTTGGGCGT GTTGATCCAG CCGGTCACCT ACGATTTGGC GCAATCGTTC
GGTCTGGATA CCACCAAAGG CGCACTGGTG GCTAAGGTGG AGCCCAACAC ACCGGCGGCC
AAAGCAGGTC TAAAATCGGG CGATATCATT CTCAAGTTTA ACGGCAGCGA GATCAAACAC
TCCGGCGAAT TGCCCATCAT GGTTGGCATG TCGCCGATTG GCAAACCGGC CACCCTCACC
TTGATGCGCG ATGGCAAGCA GATGGAGCTT AATGTCACCA TCGAAAAGCT CGACAAGAAA
GCACTCGAAG CTGAATCGGG CACCAGTGAA GCCATTGAAA AAATGGGTCT TCAGGTCACC
GAACTCTCCC CAGACGAACT GCAGCAGCTG AACATCAAAT ACGGCATAAA AGTGAAAAGC
GTCAAAAATG ACTCGCAATT CGCATCGGTT ATTGCCCCCG GCGACATCCT GCTGGAAGTC
AATCGGATGC CGATGAAGTC TGCGACCGAT CTGAAAAAAG CGCTCGACAG TGCGCCGAAG
GACCGACCGA TTGCCATCCG GCTGTTGCGG GATGGTCAAC CGTTATTCAT GGCGGTTCAA
CTCGGTACTC AGTAA
 
Protein sequence
MRTQILVEAI RAGLPRRGRL LMAAALIATP LMAMTPAISF ADNGVPDYVQ LVKQASPWVV 
NISSVSNPKT QEAFNNGEMP TFPPGPAGDM FRHFFQEQMP QMKREPIRSL GSGFIISADG
YILTNAHVVN GADKITVRLP DQQTYKAKVI GKDKRTDIAL LKIDAKNLPV APIGNSDNIQ
VGEWVLAIGE PFGLDHTATH GIVSALGRDL PDESYVPFIQ TDAPVNPGNS GGPLINANGK
VIGINSQIYT KSGGFMGISF AIPINVAMNV VDQIKSTGHV TRGYLGVLIQ PVTYDLAQSF
GLDTTKGALV AKVEPNTPAA KAGLKSGDII LKFNGSEIKH SGELPIMVGM SPIGKPATLT
LMRDGKQMEL NVTIEKLDKK ALEAESGTSE AIEKMGLQVT ELSPDELQQL NIKYGIKVKS
VKNDSQFASV IAPGDILLEV NRMPMKSATD LKKALDSAPK DRPIAIRLLR DGQPLFMAVQ
LGTQ