Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4827 |
Symbol | |
ID | 5736672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 6154274 |
End bp | 6156031 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281992 |
Product | peptidase M61 domain-containing protein |
Protein accession | YP_001547585 |
Protein GI | 159901338 |
COG category | [R] General function prediction only |
COG ID | [COG3975] Predicted protease with the C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGTGT CTGTTGTCCT TGATTACACC CTCGCGATTC CTCAACCTCA ACGCCATCTC ATCAACGTTA CATTGCGGAT TGAGGGTTTA ACGGGTTCTG AAACGTCGTT GCAATTGCCA GCATGGACAC CAGGCTCGTA TCTGCTCCGC GAATATGCCC GCCACGTGCG CTTTGTCGAG GCTCATACCG ACGGCCAAGC GCTAGCCATC CACAAAACTG ATCGCTTGAC ATGGCAGGTG CAAACCAATG GGGCCAGTGC GATCACGGTA ACCTATCAAG TATATGGCTA TGAATTGACC GTGCGCACCA ATCACATTGA TGCAACGCAC GCGCATATTG TGCCAGCCGC CACCTTGCTT TATTTGCCCG AATATCAGGA TTGTCGCTAT AACGTACATC TGGAACTGCC AAGTAGCTGG GAAGTTGCCA CGGGCTTGCC CAAATTGGCC GATGGTTGTT ATGGTACCAA CGATTTAGAT GAATTGATGG ATTGTCCGTT TGAGGTCGGT GAATTGCGCC GCTACCGCTT TGACGTAGCC GAAAAGGCTC ACGAGGTGGT GGTTTGGGGC CATGGCAACG AAGATATCGA GCAAGTGCTG GCCGACACGA AGCAAATTGT CGAAACTGAG TATGCCTTCT GGGGCGATTT ACCCTACGAT TATTATTTGT TTATTGTGTT GCTAGCTGGA GCCAATACTT ATGGCGGCTT AGAGCATCGT AATTCAACCT CGCTCTTGCT ACCGCGCCAT GTCTTCAAAC CAAGCAAAAC CTATGAACGC GCTCAAGGCC TAATTGCCCA TGAATTTTTT CATACCTGGA ATGTCAAACG GCTACGGGCT GCCCCACTCG GCCCCTTCGA TTACACCCGC GAGAATTACA CTCGCTTGTT GTGGGTGATG GAAGGCTTCA CCGAATACTA CACCGATTTG ATGTTGGTTC GCGCAGGCTT GATGACTCCC CAACGCTACC TTGAGCGCTT GGCCGATGAT ATTAGCACGC TGCAAAATAC TCCTGGCCGC TTGGTTCATA GCCTCAGCAG TTCCTCATTC GATGCTTGGA TCAAGTTCTA TCGACCTGAT GAATCAACGC CTAACACGAC AGTTTCCTAT TATCTCAAGG GTGGGTTGGC GGCATTGGTG CTCGATATGC AGTTGCGCGA GCAGAGTAAC GGTCAGCAAT CGCTTGATGA TTTGATTCGC TATCTCTATC AGACGTATCC AATCACTGGC CCGGGGATTC CCGAAGCCGA TGGCATGCAG CAAGCCCTGC AAGCACTCAC AGGCAGCGAT TGGAGTGACT ATTTCGCCAA GTATATCGAT GGATTAAGCG AATTACCCTA CGCCGAAGCC TTTGCCACCG TTGGCTTGCA AATGCAATGG AACTACAAAG ATCGTGATGC GCAAGGCAAT CCACGGCCCC AATTAGGGAT TCGCAGCAAA GCTGTTGATG GTCGGGTGCA AATTACCCAT GTGCTCGATG GCGGCGACGC TGATCGGGCT GGCTTGGCCG CTGGCGATGA ATTAATTGCG CTCGATGGTT GGCGCATCGA TGAAGATGGC TTGAACAAGC GGCTAGCCGA TTATCCAATT GGCGCAACCG TGCAATTAAG TTTCTTCCGC CGTGATGAAT TATTGCACGT GCCCGTTACA TTTAGCCAAC CCAACCCTGA TTTCTTGAGC CTAACCTTGG TGAGCCAACC AAGCGCCAGC CAACGCCAAC AAGCAGCGAC ATGGCTCGGT ACGCCATTGT TTAAATAA
|
Protein sequence | MPVSVVLDYT LAIPQPQRHL INVTLRIEGL TGSETSLQLP AWTPGSYLLR EYARHVRFVE AHTDGQALAI HKTDRLTWQV QTNGASAITV TYQVYGYELT VRTNHIDATH AHIVPAATLL YLPEYQDCRY NVHLELPSSW EVATGLPKLA DGCYGTNDLD ELMDCPFEVG ELRRYRFDVA EKAHEVVVWG HGNEDIEQVL ADTKQIVETE YAFWGDLPYD YYLFIVLLAG ANTYGGLEHR NSTSLLLPRH VFKPSKTYER AQGLIAHEFF HTWNVKRLRA APLGPFDYTR ENYTRLLWVM EGFTEYYTDL MLVRAGLMTP QRYLERLADD ISTLQNTPGR LVHSLSSSSF DAWIKFYRPD ESTPNTTVSY YLKGGLAALV LDMQLREQSN GQQSLDDLIR YLYQTYPITG PGIPEADGMQ QALQALTGSD WSDYFAKYID GLSELPYAEA FATVGLQMQW NYKDRDAQGN PRPQLGIRSK AVDGRVQITH VLDGGDADRA GLAAGDELIA LDGWRIDEDG LNKRLADYPI GATVQLSFFR RDELLHVPVT FSQPNPDFLS LTLVSQPSAS QRQQAATWLG TPLFK
|
| |