Gene Hhal_1376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1376 
Symbol 
ID4711356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1483363 
End bp1485453 
Gene Length2091 bp 
Protein Length696 aa 
Translation table11 
GC content70% 
IMG OID639855843 
Productoligopeptidase B 
Protein accessionYP_001002945 
Protein GI121998158 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.175482 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGT CGACGCCGCA GCCGCCCCGG GCCGCCGAGA CCCCCGAGGT GATCGAACGC 
TTCGGGGTGC GCCGCACCGA CCCCTACGCC TGGTTGCGCG ATCCGAACTG GCGGGAGGCG
ATGCTCGACC CGCAGCAGCT GCAGTCGTCC ATCCGTGCCC ACCTGGAGGC CGAGAACGCC
TACACCGAGG CGGTGCTGCA GCCGGTGGCC GGGCTGCGTG AGCGGCTTTT CGCCGAGCTG
CGCGGGCGGA TCAAGGAGGA GGACGCCTCG GTCCCCGATC CCGACGGCGC CTACGAGTAC
TATGTCCGCT TCCGCGCCGG CGGGCAGCAC CCGGTGGCCT GCCGTCGGCC TCGCGGGGGC
GGGGCCGAGG AGGTGCTGCT CGATGGCGAC GCCCTGGCCG AGGGGCACGC TTACTTCCAG
CTCGGCGACT GGGCGCACAG CGACGACCAC CGCTACCTGG CCTACGCCGT GGATACCAGC
GGTGCCGAGG CGTACACCAT TCGCTTTCGG GATCTTGCCG GTGGGGCCGA TCTGCCGGAT
GCCCTGGAGC AGGCCCGCGG CGACTTCGTC TGGGCCGGCG ACGGCCAGAC CCTGCTCTAT
ACGGTGCTCG ATGACGAGCA CCGGCCGCGC TGGGTTTATC GCCACTGTCT TGGCACCGAT
TCGGCCAGCG ACGCGCTGGT CTACGAGGAG ACTGATCCGG GGTTCTTCGT CGGCCTGGAC
CGCACCGAGA GCCGGCGCTA CGTGCTCATT GAGACCCACG ATCACACCAC CTCGGAGGTC
CACGCTGCGC TGGCCGACGA CCCGGCGGCC GGCTTTCGCT GCCTCATGCC CCGCGAGCGC
GGGGTGGAGT ACGCAGTCAG CGACAGCGGC GACCGCTGGC TGATCCTGAC CAACCGGCAG
GCGCAGGACT TCCGCATCGT CCAGGCGCCC CTGGAAGCCC CGGAACCGGG GCATTTCGAA
GAGGTCGTGC CCCACCGCCC CGGGGTGCTG ATCCACGATA TGCTGCTGTT CCGGGAGCAC
CTGGTGCGCC TGGAGAGCGA GGATGCGCTG CCGCGCATCG TGGTCCGTCG TCTGGGCGAC
GGCGCCGAGC ACAGCGTCGC CTTCGACGAG GCGGCCTACG CGCTGGGCAT CTCGGGCGGC
TTCGAGTACG ACACCACCAC CCTGCGCTTC AGCTACTCGT CGTTGACGAC GCCGGGCCGG
GTCTACGACT ACGACATGGA GACCCGGGTG CGCAGCTTGC GCAAGGAGCA GGAGATCCCT
AGCGGGCACG ATCCGGCCGC CTACGTGGCG CACCGCATCG AGGCCACGGC CCCGGATGGT
GAGCGGGTGC CGATCTCGCT CGTCCACCGC GCGGATCTGC AGCCGGGCCC GGATACCCCG
CTGTGGCTCA ACGGCTACGG TGCCTACGGC ATCAGCCAGC CGGCGGCCTT CTCGCCGCAT
CGACTGTCGC TGGTCGACCG CGGCTTCGTC TTCGCCATTG CTCACGTGCG CGGCGGCAAG
GAACGCGGCT ACCGCTGGTA CGACGCCGGC AAGCTCGAGC ACAAGCCGAA TACCTTCAGC
GACTACATCG CCTGCGCCGA GCACCTGATC GACGCCGGTT ACACCGGGGC CGGGCGGGTG
GTGGTCCACG GCGGCTCGGC CGGCGGCATG CTCGTCGGCG CCGTGCTCAA TCAGCGCCCG
GAGCTCTTCG GGGCCGCGGT GGCGGATGTG CCCTTTGTGG ATGTGCTCAA CACCATGAGC
GACCCGAGCC TGCCGCTCAC CCCGCCGGAG TGGCCGGAGT GGGGCAACCC CATCGAGGAC
GAGCGGGCTT TCCATACCAT CCTCGGCTAC TCGCCCTACG AGAACATCCA GGCGCAGGCC
TATCCGCCCA TCCTGGCCAC CGCCGGGGTC TCGGATCCGC GGGTGACCTA CTGGGAGCCG
GCCAAGTGGG TGGCCCGTCT GCGTGCCCTG AAGACCGACG ACAACCCGCT GCTGCTGCAG
ACCAACATGA GCGCCGGCCA CGCCGGGCCG GGCGGTCGCT TCGACTACCT GGAGGAGGCG
GCCTTGCGCT TTGCCTTCGT GCTCTGGGTC TTCGGCCGGG CCGTGGGCTG A
 
Protein sequence
MTESTPQPPR AAETPEVIER FGVRRTDPYA WLRDPNWREA MLDPQQLQSS IRAHLEAENA 
YTEAVLQPVA GLRERLFAEL RGRIKEEDAS VPDPDGAYEY YVRFRAGGQH PVACRRPRGG
GAEEVLLDGD ALAEGHAYFQ LGDWAHSDDH RYLAYAVDTS GAEAYTIRFR DLAGGADLPD
ALEQARGDFV WAGDGQTLLY TVLDDEHRPR WVYRHCLGTD SASDALVYEE TDPGFFVGLD
RTESRRYVLI ETHDHTTSEV HAALADDPAA GFRCLMPRER GVEYAVSDSG DRWLILTNRQ
AQDFRIVQAP LEAPEPGHFE EVVPHRPGVL IHDMLLFREH LVRLESEDAL PRIVVRRLGD
GAEHSVAFDE AAYALGISGG FEYDTTTLRF SYSSLTTPGR VYDYDMETRV RSLRKEQEIP
SGHDPAAYVA HRIEATAPDG ERVPISLVHR ADLQPGPDTP LWLNGYGAYG ISQPAAFSPH
RLSLVDRGFV FAIAHVRGGK ERGYRWYDAG KLEHKPNTFS DYIACAEHLI DAGYTGAGRV
VVHGGSAGGM LVGAVLNQRP ELFGAAVADV PFVDVLNTMS DPSLPLTPPE WPEWGNPIED
ERAFHTILGY SPYENIQAQA YPPILATAGV SDPRVTYWEP AKWVARLRAL KTDDNPLLLQ
TNMSAGHAGP GGRFDYLEEA ALRFAFVLWV FGRAVG