Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4178 |
Symbol | |
ID | 5736040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5327805 |
End bp | 5329490 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281333 |
Product | peptidase M3A and M3B thimet/oligopeptidase F |
Protein accession | YP_001546938 |
Protein GI | 159900691 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR02289] oligoendopeptidase, M3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0341575 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTGA AGATTGACCC ACGCGATTGG GCCACAGTTC AGCCCTACTA CGATTCACTT GCAGCCGAAG AGCTTACGGC TGCCAATGCA ACTGCTTGGT TAGGCCGTTG GAGTGAGTTG GAAGCCCATT TGCAAGAATT CGGCTTCAAG GCCTATCGGG CGATGACCGA AAATACGCAG GATCAGGCCG CCGAAGAGCT GTACCTCTAC ACAATTGAGG AGTTGACCCC GAAATCCAAG GTTGCCGCCC AAACCTTGAA AGAAAAAATT TTGGCGCTTG ATCCAACGCT GTTGCCAGCC AAAATGCAAG AAATGCTGCG GCGATTTCAT GCTGAAGCCA ATTTATTCCG CGAAGAAAAT GTGCCACTAT TGACCGAAGT TGAGCGCCTG AGTGCGGAAT TTAGCAAATT GATGGGTGCA ATGAGCGTTG AGTGGCAAGG CGAAACCCTG ACCATGCAAG CAGTCGTTAA GCTGTTTAGC GACCCAGATC GCAGTGTGCG CGAAGCTGCA TTTAAGGCTT ATCATAGCCG TTTTTTGCAA GACCGAGCAG CTTTCAACGA GATTTATCTC AAGCAACTTA AGCTTCGTCG CCAAATTGCC ACCAATGCAG GTAAAGCCAG CTATTTGGAT TATGTCTGGG ATCTCTATGG GCGCTTCGAT TATACGCCTG CCGATTGCCG CACCTTCCAC GATGCAATTG AACATGAAGT TGTGCCGTTG GTTGAAAAAT GGTTGGCAGT CCATCAAGCC GAATTAGGCG TTGATAGCTT GCGACCATGG GATATTTTGG TTGATTCGCA ATCACGCCCA GCTTTAACTC CATTCCAAAG CGCCGATGAA CTCGAAAGCG TCTGCCAAAC AATCTTCGAT CGCGTTGATC CTGAACTGGG AGCGCAATAC ACTCGCATGC GTGATGGCTG GCTTGATTTG GATTCACGGC CTGGTAAAGC ACCTGGTGGT TATTGTGGCG GGATGTTTGT TTCCAAAGTG CCCTACATTT TTATGAATGC GGTTGGCACC CATGATGATG TCCAAACCAT GTTGCACGAA GGTGGCCATG CTTTCCACTT TATGGAATCA TCGGCAACCA ACGATTTGAT TTGGAATTAC GATGGGCCAA CTGAATTTTG TGAAGTCGCT TCGATGGCGA TGGAATTGTT GGCAGCGCCC TATTTAGCCA ACGCTAAGGG CGGTTTCTAC AGCGAAACTG ATGCCCGTCG TGCTCGTGCA GAGCATCTTT GGAGTATGCT CAAATTCTTG CCCTATATGG CGACCGTCGA TGCCTTCCAA CATTGGGTCT ACATTGAAGC GCCTGAAGAT GTAACGGCTG AGCAGCTTGA TGCCGTGTGG AAAGACCTCT ATCAGCGCTT TATGAGTGGG GTGAATTGGG ATGGCTTCGA AGATGTACAG ACCACAGGCT GGCATCGCAA GCAGCATATT TTTGAAGCTC CGCTCTATTA TGTAGAGTAT GGCTTAGCCC AATTGGGGGC GCAACAAGTT TGGCGCAACG CTTTAGCTGA TCAAGCGCAG GCCGTAGCAC AATATCGCGC AGCCTTAGCC TTGGGCAATA CCAAACCACT GAGTCAATTA TTTGCAGCGG CTGGGGCAAA ATTTGCCTTC GATCGGGCAA CTGTCGGCGA ATTAGCCCGC TTAGTCGATC AGCATCTGAC CACACTCCGT GATTAA
|
Protein sequence | MSLKIDPRDW ATVQPYYDSL AAEELTAANA TAWLGRWSEL EAHLQEFGFK AYRAMTENTQ DQAAEELYLY TIEELTPKSK VAAQTLKEKI LALDPTLLPA KMQEMLRRFH AEANLFREEN VPLLTEVERL SAEFSKLMGA MSVEWQGETL TMQAVVKLFS DPDRSVREAA FKAYHSRFLQ DRAAFNEIYL KQLKLRRQIA TNAGKASYLD YVWDLYGRFD YTPADCRTFH DAIEHEVVPL VEKWLAVHQA ELGVDSLRPW DILVDSQSRP ALTPFQSADE LESVCQTIFD RVDPELGAQY TRMRDGWLDL DSRPGKAPGG YCGGMFVSKV PYIFMNAVGT HDDVQTMLHE GGHAFHFMES SATNDLIWNY DGPTEFCEVA SMAMELLAAP YLANAKGGFY SETDARRARA EHLWSMLKFL PYMATVDAFQ HWVYIEAPED VTAEQLDAVW KDLYQRFMSG VNWDGFEDVQ TTGWHRKQHI FEAPLYYVEY GLAQLGAQQV WRNALADQAQ AVAQYRAALA LGNTKPLSQL FAAAGAKFAF DRATVGELAR LVDQHLTTLR D
|
| |