Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2833 |
Symbol | |
ID | 5734714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3601368 |
End bp | 3602699 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279976 |
Product | peptidase M20 |
Protein accession | YP_001545599 |
Protein GI | 159899352 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.03601 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTTGCTC CCCTATCCAC TCTCGATGAA CTGATTTTGC ACTATGTCCG TGCCTTGGTA GCCTTACCAA GTGTTACTGG CGATGACCCA GCCTTGGAAC AGGCTGCGCC GACGATTGCT GATATATTGC GCGGCCTTGG GTTTCAGGTC AATGTGCACC CAACTGAAGG TGCACCGATT ATTTTGGCCC ATCGGCCTGG CAAAAGCGCT CAACGCTTGC TCTTTTTTAA TCACTACGAT GTGATGCCAG CTGGGGTCTG GCGTGATTGG TTTCATGAGC CGTTTACCTT GGCTGAGCGC GAAGGGCTGC TCTATGGGCG TGGCGTTGCC AACGATAAAG GCAATTTAGC TGCGCGAATT GCTGCTGTGG CTCAAATTTT GGCCGAAACT GGCGATCTCC CAGTTGGTGT GACGTTTTTA ATTGAAGGTG ACGGGCTGAG TGGTAGCCCA TCATTAGCTA ATTTGGTTGC CGATCAAGCT AATCATTTAA CTGCCGATTT GGTAGTTGGC TATGGTGGCA TGCTCGATCA AGCGCGTTTG CCCTACTTGT ATGCTGGGGT TCGTGGGCGC TTGTTGGTAC GCTTGCGAGC TGAGGGTGCT AAAATTCCGA TCGGTGCTGA TATGGCGACC AGTGTACCCA ACCCGGCTTG GCGCATCCTT TGGGCGGTCA ATCGGATTAA AAATGATAGC GAAGAAGTGA CGATTGATGG GTTTTATGAT GCGGTTGTGC CACCAAGCCG CGAGGCCAAT AAGCTCACTC GTGGCTTGCA GCTCGATGAG GAAACGCGAC TAAAAGCTTG GGGCATGCCG CAATTTTTAT TTGGCATGAC CGGGGCTGGT CTGGCTCGGG CTGAAACCTT CAACCCAACC TGTAATGTTG CCGGATTAAC GGTTGATACC GGTCATAGCC CCCCTCCAAC CATCCCTGCC AGTGCCGAAG TGCTGCTCGA TTTCAGTCTT GTGCCTGAGC AACGCCCAAC CGAAGTTGCC CGTTTGTTGC GTGAGCACCT CAATAGTGCC GATTTCCATG ATGTGCATTT GGAGATTATC AAGGGTGCCT ATCCGCCCGC AATGAATGCG TTGAGCACAC CATTACTCAA TAGTTTAGCG ACGGCAATTG AACAGGTTTA TGGCTCAACT CCGCAAATTG TGCCGCTTGC GCCATTCTCG GTGCCGCTAC ACCTTTTTAC CGCTGGGATG AATGTGCCAG CCGTTGCCTT GGGAATTCAA CGCCCCGATA GCAACGTTCG GGTCATCAAT GAACATATTC AGTTGGCCGA TCTCAAAGCG ATGGCTAGTT TGATTCAACA ACTGATTATC GGCCTTGGCT AA
|
Protein sequence | MVAPLSTLDE LILHYVRALV ALPSVTGDDP ALEQAAPTIA DILRGLGFQV NVHPTEGAPI ILAHRPGKSA QRLLFFNHYD VMPAGVWRDW FHEPFTLAER EGLLYGRGVA NDKGNLAARI AAVAQILAET GDLPVGVTFL IEGDGLSGSP SLANLVADQA NHLTADLVVG YGGMLDQARL PYLYAGVRGR LLVRLRAEGA KIPIGADMAT SVPNPAWRIL WAVNRIKNDS EEVTIDGFYD AVVPPSREAN KLTRGLQLDE ETRLKAWGMP QFLFGMTGAG LARAETFNPT CNVAGLTVDT GHSPPPTIPA SAEVLLDFSL VPEQRPTEVA RLLREHLNSA DFHDVHLEII KGAYPPAMNA LSTPLLNSLA TAIEQVYGST PQIVPLAPFS VPLHLFTAGM NVPAVALGIQ RPDSNVRVIN EHIQLADLKA MASLIQQLII GLG
|
| |