Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4949 |
Symbol | |
ID | 5736785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6276438 |
End bp | 6277535 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641282116 |
Product | peptidase M20 |
Protein accession | YP_001547707 |
Protein GI | 159901460 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCAATCC GTACAAGTTT GGTTGATCTG ACCACGCGCT TGGTGGCGAT TCCCAGTGTT TCCGCCGAAA AACGCGATTT GCAGCCAGTG ATCGATCTGG TGGTGGCTGA ATTAGCCGAT TATCCAGCGG CGTTGTTGCA TCATCGCGAT GCTAATGGCT ACCCAATGTT GGTGGTCAAT TTCAACCAAG AACTGCGCAG CGATCTTATT TTGAATGCCC ATTTGGATGT TGTGCCAGCC CGCCCTGAGC AATGGCACGC CTTCGAGCAT GATGGCAAAT TGTATGGTCG TGGCACGCAA GATATGAAGG GATCGGCGGC GGTCTACATT GAAATTATTA AAGAAATTGC CCAATTGCCT GCTGAGCAAC GCCCTAACGT AAGCTTTCAA TTTGTGACCG ATGAGGAAAT TGGCGGAGCA AATGGCACAG CCTTATTGCG TGATGAAGGC TGGCAGGCTA ATTTATTTAT TGCTGGCGAG CCGACCAACC TGAATATTTG TCATGGAGCC AAGGGCATTT TATGGCTGGC AGTTGAGCAA CCAGGCGTGC CAGCCCATGG TTCGCGGCCT TGGGAAGGCG TGAATCCGAT TGAGCGTTTG GCAAGTGGCC TTGGGCGTTT GTACGAATAT TATCCAACGC CTGCGCAAGA AATTTGGCGC ACTACGGTTA CGCCTTCGAT TATCAAAGGC GGCGATGCTG GCAATCGGAT TCCAGCCAAT GCCCAACTGA ATCTTGATAT TCGCTGGACA CCCGAAGAAG GTGCTGATGC GGTGATTGAT AACGTGAAGC AAGCCTTTGC AACGAGCAGC GAACCCAATC CCAATGTGCA GATTTTGCAT CGTGGCACGG CCCTAAATAC GCCAGCCGAG GAGCCAAACT TACAACGCAT TGTTGATGCA CAACAATCCA GCCTTGGTCG CCAAGCCCAA CTCTTCCGCG AGCATTTTGG CTCCGATGCC CGCTTCTACA GCGATGCCGG AATTCCAGCG GTCTGTTGGG GGCCAGAAGG TGCAGGCTTG CATACCGACG ACGAGTGGGT CAGCATCGAT GGCTTGGTCG ATTATTATCA GGCGGTCAAA ACCTTGTTGG GTATGTAG
|
Protein sequence | MSIRTSLVDL TTRLVAIPSV SAEKRDLQPV IDLVVAELAD YPAALLHHRD ANGYPMLVVN FNQELRSDLI LNAHLDVVPA RPEQWHAFEH DGKLYGRGTQ DMKGSAAVYI EIIKEIAQLP AEQRPNVSFQ FVTDEEIGGA NGTALLRDEG WQANLFIAGE PTNLNICHGA KGILWLAVEQ PGVPAHGSRP WEGVNPIERL ASGLGRLYEY YPTPAQEIWR TTVTPSIIKG GDAGNRIPAN AQLNLDIRWT PEEGADAVID NVKQAFATSS EPNPNVQILH RGTALNTPAE EPNLQRIVDA QQSSLGRQAQ LFREHFGSDA RFYSDAGIPA VCWGPEGAGL HTDDEWVSID GLVDYYQAVK TLLGM
|
| |