Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4682 |
Symbol | |
ID | 5736529 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5979222 |
End bp | 5981042 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281846 |
Product | peptidase M3A and M3B thimet/oligopeptidase F |
Protein accession | YP_001547441 |
Protein GI | 159901194 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR02290] oligoendopeptidase, pepF/M3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTACCC AAACGCTCCC ACACTGGGAT CTTTCAGTTG TCTACCCCAG CCTAGACTCG CCTGAATTTG CCGCAGGCTT TCAACGGATC AAGCAACAAG TGAGCGATCT CAAAAGCCTG TTTGATCAAG CTGCTATCCA ACGTAGTGAA AACCAAGTGC TTGATTCCGC CACCATCGCG ACCTTTGAAA CCGTAACTAC TGCCTTCAAC CAAACCCAAG ATGCGATGAT GACCAATTTT AGCTATGTGA TGGGCTTTGT CGCCACCGAC ACCCGTGATA CGCTAGCCCA AGCTCGGTTT AGCGAAATGC AACAATTGGG CATGCAATTG CAACAATTGG GTCTGCGCTG GGTCGCTTGG ATTGGTGGCT TGGATATTGA TCAATTGATT GCTCAATCAA GTTTGGCCGC CGAGCATGCC TTTATTTTGC ACAAAACCAA GCAAGAAGCG TTGCACCTGA TGTCGCCAGC CGAAGAAGTG TTGGCAGTCG AACTCAATTT GAGCGGTGGC AGCGCTTGGA GCAAATTGCA CACGGTGGTT AGCTCACAGC TTAGCGTGCC AGTCACGCAT GCTGGCGAAA CCAAGCATAT GCCAATGAGC ATGGTACGCA ATTTGGCGTT TGATCCAAAT CGTGAAGTTC GGCGCAGCGC TTACGAGGCC GAATTGGCTG GCTGGAAAGG GGTCGAAACG CCCATGGCAG CGGCAATCAA CAGCATCAAA GGCCAAGTTA ACACGTTGGC TGGCCATCGT GGTTGGGCTA CAGCGTTGGA TGAGGCGCTG GCGGATAACA ATATTGATCG CCAAACCCTT GATGCAATGT TAACCGCCGC CCGCGAAACC TTCCCAGATT TCCGCCGCTA CTTGCGGGCC AAAGCACGGT TGGTGGGCAG CGAACGGTTG GCATGGTTCG ATCTCTTCGC GCCGATTGGC AACGATGTCA GCACTTGGGA ATACGCCGAA TCGGAAGCCT TCATTTTGGA GCATTTCGGC AGCTATTCAC AACGCTTGCG TGATTATGCC GCCCGCGCCT TCGATGAAAA CTGGATCGAT GCTGAGCCAC GGGCTGGCAA ACGTGATGGC GCGTTCTGTA TGCGTTTAGT TGGCGATCAA TCACGCATCC TGAGCAACTA TAAGCCCTCG TTTGCTGGAA TGCGCACCCT TGCACATGAG TTGGGTCACG GCTATCATAA CCTTAACTTA GCCGAAACCA CCCCATTGCA GCGCCAAATT CCGATGACCT TGGCTGAAAC TGCCAGCATT TTCTGTGAAA CGATTGTGCG CAACGCCGCC TTGGCTAAGG CTGATGCTGC AACCCAATTA GCAATCATCG AATCATCGTT GCAAAATAGC TGCCAGTTGG TTGTCGATAT CAGCAGCCGC TTTATTTTCG AGCAAAGCCT GTTCGAAGCC CGTCAAGCGC GTGAGCTAAG CGTGCAGGAA ATTAACGACC TAATGCTCAA GGCCCAAGCT GAAACCTATG GCGATGGCTT GGACGAACAG TACTACCATC CGTATATGTG GGCGGTCAAA GGTCATTACT ATAGCACTGA ACGCTCATTT TACAATTATC CCTATATGTT TGGCTTATTG TTCGGCTTAG GCTTGTATGC CCAGTATGTA GCAACACCCG ATGAGTTTCG GGCCAAATAC GATGATTTAT TGGCGGCTTC GGGCAAGCAT GACGCAGCAA CCTTGGCCGA ACGTTTTGGG ATTGATATTC GGACCCCCGA TTTTTGGCGG GCAAGTTTGG CCACCATCAA GGCCGATATT GACCGTTTCG TTGAGCTAAG CGAGCAGTTG TTGGATACCG CTGCCCACTA A
|
Protein sequence | MTTQTLPHWD LSVVYPSLDS PEFAAGFQRI KQQVSDLKSL FDQAAIQRSE NQVLDSATIA TFETVTTAFN QTQDAMMTNF SYVMGFVATD TRDTLAQARF SEMQQLGMQL QQLGLRWVAW IGGLDIDQLI AQSSLAAEHA FILHKTKQEA LHLMSPAEEV LAVELNLSGG SAWSKLHTVV SSQLSVPVTH AGETKHMPMS MVRNLAFDPN REVRRSAYEA ELAGWKGVET PMAAAINSIK GQVNTLAGHR GWATALDEAL ADNNIDRQTL DAMLTAARET FPDFRRYLRA KARLVGSERL AWFDLFAPIG NDVSTWEYAE SEAFILEHFG SYSQRLRDYA ARAFDENWID AEPRAGKRDG AFCMRLVGDQ SRILSNYKPS FAGMRTLAHE LGHGYHNLNL AETTPLQRQI PMTLAETASI FCETIVRNAA LAKADAATQL AIIESSLQNS CQLVVDISSR FIFEQSLFEA RQARELSVQE INDLMLKAQA ETYGDGLDEQ YYHPYMWAVK GHYYSTERSF YNYPYMFGLL FGLGLYAQYV ATPDEFRAKY DDLLAASGKH DAATLAERFG IDIRTPDFWR ASLATIKADI DRFVELSEQL LDTAAH
|
| |