Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0492 |
Symbol | |
ID | 5732406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 574177 |
End bp | 575307 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277618 |
Product | peptidase M20 |
Protein accession | YP_001543271 |
Protein GI | 159897024 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0471333 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCGTC AGCTTTTAAC CTATGTTGAT GCTTGCCTAC CCGATCTGCT GGATGAAATG CGTCAATGGA TCGAAATTGA ATCGTTTACC CGTGATATTA CGGCGGTTTC TTGGATGGTC AACGTGGTTG GCGAGCGTTT GAGTAAGCTT GGTGCAAGTG TGCGCAAATA TAATGGCAAG CCCCAAGCCG ACCATTTGTT GGCAAGTTGG CCAGGCGAGG GCGAACCATT GCTAATTGTG GGCCATGTTG ATACCGTTTA TCCGCCAGGC ACGATTGATC AATTTCCGTT CCGCATCGAT GGCGATGTGG TGCGTGGGCC TGGAGTCAGC GATATGAAGG GCTGTATTTT GCTGACTTGC GCCGCCTTGC AAGCCTTACG CCACTTTAGC CGCTGGACCA GCCGCCCCTT GAAATTTTTA ATTACGACCG ATGAAGAGAT TGGTAGCCCA ACCTCCCGAC GGTATATTGA AGAACAGGCT CGCGGTTGTC GCGCAGCCTT GATTATCGAA TCAGCAGAAG AGGGTGGTTG GCTCAAAACA TGGCGCAAAA GTGTCAGTAT GTATGACTTA ACAATTACTG GCAAGCCCTC GCATGCGGGG GTAGCCCCGG AGCTTGGCAT TAGCGCGATT CACGAATTAA GCTACCAAAT TGGCCAGATT TTGCCCTTGG CGCGGCCTGA AATTGGCACA ACGATCAATA TTGGCAAAAT TAATGGTGGT ACTGCCACCA ATGTTGTAGC CGCCGAGGCC CATTGCACGA TCGATGTGCG GGCATTAAAA GTTGGCGAGG CGGAACGGGT TGATCAAGCG CTGCATCAAT TAGTGCCCCA TTTGGCTGGC GCAAAATTAA CTTTAGAAGG TGGCGTAAAT CGCCCAGCCA TGGAACAAAC GCCTGCCACA ATGGCATTAT ATGCTGCTGC CGAGCAAATT GCCAATCAAT TGGATTTGCC GATTAAAGCT AGTGGCACTG GCGGCGGTTC GGATGGCAAT TTCACGTCGG CGATCGGTGT GCCAACCCTC GATGGGCTTG GTGGCTGGGG CAGTGATTCG CATAGCTTCG ATGAATGGCT TTCGATCAGC CAATTTGCCC CACGGGCTGC CTTGCTGGCT CGTTTGATTG AGACATTGTA G
|
Protein sequence | MPRQLLTYVD ACLPDLLDEM RQWIEIESFT RDITAVSWMV NVVGERLSKL GASVRKYNGK PQADHLLASW PGEGEPLLIV GHVDTVYPPG TIDQFPFRID GDVVRGPGVS DMKGCILLTC AALQALRHFS RWTSRPLKFL ITTDEEIGSP TSRRYIEEQA RGCRAALIIE SAEEGGWLKT WRKSVSMYDL TITGKPSHAG VAPELGISAI HELSYQIGQI LPLARPEIGT TINIGKINGG TATNVVAAEA HCTIDVRALK VGEAERVDQA LHQLVPHLAG AKLTLEGGVN RPAMEQTPAT MALYAAAEQI ANQLDLPIKA SGTGGGSDGN FTSAIGVPTL DGLGGWGSDS HSFDEWLSIS QFAPRAALLA RLIETL
|
| |