Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1034 |
Symbol | |
ID | 4709768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1113567 |
End bp | 1114562 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639855505 |
Product | peptidase U32 |
Protein accession | YP_001002612 |
Protein GI | 121997825 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.159666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACTGC TCTGCCCGGC CGGTAACCTG ACCGCCCTGC GCGCCGCCGT GGACAACGGG GCCGATACGG TCTACATCGG CTTCCGGGAC GCCACCAATG CCCGCCACTT CCCCGGCCTC AACTTCACCC CGGAGCAGGC CGCGCGCGGG GTCGAATACG CCCACCAGCG GGGCGTACGG GTCCTTGTGG CCGTCAACTC CTACGTCCAG GCTGGCGGCT GGTCGCAGTG GCAGCGTTCC ATCGACGAGG CGGCGCGTAT CGGCGCCGAC GCCGTCATTG TTGCCGACCT GGGCCTACTG GAATATACCG CCGAGCAGTG GCCGGATCTC GGCCTGCACC TCTCCGTCCA GGCCTCCGCC ACCACCCCCG AGGCCCTCGA CTTCTACAAA CGGTGCTACG GGGTCAGCCG TGCCGTACTG CCCCGCGTGC TCTCCATCCA GCAGGTCGAG GCGCTGGCGG GCGACACCGA CGTCGAACTT GAAGTCTTCG GCTTCGGCTC CCTGTGTGTC ATGGCCGAAG GGCGCTGCCT GCTCTCCTCC TACGCCACCG GCGAGTCGCC CAACACCGTG GGTGCCTGCT CGCCGGCCTG GGCCGTGCAG TGGCAGGAGA CCCCCGAGGG CCGCGAGGCG CGGCTGGGCG GGTTGTTGAT CGATCGCTTT GCGCCGGACG AGCCGGCGGG CTACCCGACC ATCTGCAAAG GGCGTTACGA GGTCGAGGGG GCCTTGGAGC ACGCCTTCGA GTCGCCCACC AGCCTCGAGA CCGCCGAGTT GCTGCCCCGG CTCAAGCGGG CCGGTATCCA CGCGGTGAAG ATCGAAGGGC GGCAGCGCAG CCCGGCCTAC GTCGGCAAGG TGACGCGCAT CTGGCGGCAG CTGATCGACC GTATCCCGGA GGCCGAGGGC GACTACACCC CGGATCCGGC GCAGGTGGCG GCCCTGCGAG AGTTCTCCGA GGGGGCGACC ACCACCCTCG GTCCCTACGA GCGCAATTGG CAGTGA
|
Protein sequence | MELLCPAGNL TALRAAVDNG ADTVYIGFRD ATNARHFPGL NFTPEQAARG VEYAHQRGVR VLVAVNSYVQ AGGWSQWQRS IDEAARIGAD AVIVADLGLL EYTAEQWPDL GLHLSVQASA TTPEALDFYK RCYGVSRAVL PRVLSIQQVE ALAGDTDVEL EVFGFGSLCV MAEGRCLLSS YATGESPNTV GACSPAWAVQ WQETPEGREA RLGGLLIDRF APDEPAGYPT ICKGRYEVEG ALEHAFESPT SLETAELLPR LKRAGIHAVK IEGRQRSPAY VGKVTRIWRQ LIDRIPEAEG DYTPDPAQVA ALREFSEGAT TTLGPYERNW Q
|
| |