Gene Hhal_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1034 
Symbol 
ID4709768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1113567 
End bp1114562 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content70% 
IMG OID639855505 
Productpeptidase U32 
Protein accessionYP_001002612 
Protein GI121997825 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.159666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACTGC TCTGCCCGGC CGGTAACCTG ACCGCCCTGC GCGCCGCCGT GGACAACGGG 
GCCGATACGG TCTACATCGG CTTCCGGGAC GCCACCAATG CCCGCCACTT CCCCGGCCTC
AACTTCACCC CGGAGCAGGC CGCGCGCGGG GTCGAATACG CCCACCAGCG GGGCGTACGG
GTCCTTGTGG CCGTCAACTC CTACGTCCAG GCTGGCGGCT GGTCGCAGTG GCAGCGTTCC
ATCGACGAGG CGGCGCGTAT CGGCGCCGAC GCCGTCATTG TTGCCGACCT GGGCCTACTG
GAATATACCG CCGAGCAGTG GCCGGATCTC GGCCTGCACC TCTCCGTCCA GGCCTCCGCC
ACCACCCCCG AGGCCCTCGA CTTCTACAAA CGGTGCTACG GGGTCAGCCG TGCCGTACTG
CCCCGCGTGC TCTCCATCCA GCAGGTCGAG GCGCTGGCGG GCGACACCGA CGTCGAACTT
GAAGTCTTCG GCTTCGGCTC CCTGTGTGTC ATGGCCGAAG GGCGCTGCCT GCTCTCCTCC
TACGCCACCG GCGAGTCGCC CAACACCGTG GGTGCCTGCT CGCCGGCCTG GGCCGTGCAG
TGGCAGGAGA CCCCCGAGGG CCGCGAGGCG CGGCTGGGCG GGTTGTTGAT CGATCGCTTT
GCGCCGGACG AGCCGGCGGG CTACCCGACC ATCTGCAAAG GGCGTTACGA GGTCGAGGGG
GCCTTGGAGC ACGCCTTCGA GTCGCCCACC AGCCTCGAGA CCGCCGAGTT GCTGCCCCGG
CTCAAGCGGG CCGGTATCCA CGCGGTGAAG ATCGAAGGGC GGCAGCGCAG CCCGGCCTAC
GTCGGCAAGG TGACGCGCAT CTGGCGGCAG CTGATCGACC GTATCCCGGA GGCCGAGGGC
GACTACACCC CGGATCCGGC GCAGGTGGCG GCCCTGCGAG AGTTCTCCGA GGGGGCGACC
ACCACCCTCG GTCCCTACGA GCGCAATTGG CAGTGA
 
Protein sequence
MELLCPAGNL TALRAAVDNG ADTVYIGFRD ATNARHFPGL NFTPEQAARG VEYAHQRGVR 
VLVAVNSYVQ AGGWSQWQRS IDEAARIGAD AVIVADLGLL EYTAEQWPDL GLHLSVQASA
TTPEALDFYK RCYGVSRAVL PRVLSIQQVE ALAGDTDVEL EVFGFGSLCV MAEGRCLLSS
YATGESPNTV GACSPAWAVQ WQETPEGREA RLGGLLIDRF APDEPAGYPT ICKGRYEVEG
ALEHAFESPT SLETAELLPR LKRAGIHAVK IEGRQRSPAY VGKVTRIWRQ LIDRIPEAEG
DYTPDPAQVA ALREFSEGAT TTLGPYERNW Q