Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0004 |
Symbol | |
ID | 5736838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4561 |
End bp | 5760 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277125 |
Product | amidohydrolase |
Protein accession | YP_001542784 |
Protein GI | 159896537 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.209653 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAGTC GTCCAGATTT TCGGTATGTT GCTCACACGT TGGTCGAGCA ATTGATCACT GATCGCCGCG ATTTACACCA GCATCCTGAA CTTGGCTTCG AGGAGTTTCG TACCGCCAAA ATTGTGGCTG ATCGTTTGCG TGAGTTAGGC TACGAGGTTA CCGAGGGGGT TGCCACCACC GGCGTTTTAG GCCATATTCC GGCTCAGCCA GGCGGCAAAG TTGCCATGTT GCGCTTCGAC ATGGATGCCT TGCCAATCCA CGAGCAAAAC GATGTCGATT ACCGCTCAAC CATCGACGGC AAAATGCATG CTTGTGGCCA TGATGGCCAT GTTGCGATCG GCTTAGGCGT GGCCGCAGCC CTGATGCAAA ATCGCGAAGC GCTTGGCACA GGTGGGATTA AATTGCTATT CCAGCCTGCC GAAGAAGGCG GCGGCGGCGC TCAAAAGATG GTCGAAGCAG GCGCGATGCA AAATCCACGG CCTGATATTT CGCTTGGTTT GCATATTTGG GCACCCATGC CCTTGGGTAA AGCCAATGTG CGTTCAGGGC CAATTATGGC TTCTGCCGAT ACCTTTATCG TGGAAATTAC TGGCAAAGGT GGCCACGGCG CTCAGCCTGA AACTACCGTC GATTCGGTTT TGGTGGCTTC ACATATGGTC GTTGCGTTGC ATTCAATCGT TAGCCGCAAC GTTCACCCTG AACAGCCCGC AGTGCTTTCG GTTGGTTCGG TACAAGCTGG CACAGCTCAT AATATCATCG CCCACAACGC CACTCTAACT GGCACAATTC GCAGCTATGA CCCCGAAGCT CGCGAGCGCT TGAAACAACG AGTGCATGAA GTAGTGCAAG GCGTGGCGGC AACCTTTGGC GCAACCGCTA CCCTCAAATA CGATGAAATG TGCCCAGCAA CCATCTGCGA CCCTGCGGCA ACCGCCTTGG TACGTGGTGC AGCTGAAGCG ATTTTGGGCG CGGAGAACGT CGATGACAGC GTGCGCACCA TGGGTTCAGA AGATATGTCG GTGCTGTTGA ATGAAGTGCC TGGCTGCTAT TTCTTCTTGG GCGGGCAAAC CCTTGAGCGC GAGTTGGGCG CACATCCGCA TCATCACCCA GCATTTAGCT TCGATGAAGG CGTATTGCCC TTGGGCGTTG CCATTTTATG TGAAGCCGCA ACCCGCTATC TCAACGGGAG CAACGAATGA
|
Protein sequence | MASRPDFRYV AHTLVEQLIT DRRDLHQHPE LGFEEFRTAK IVADRLRELG YEVTEGVATT GVLGHIPAQP GGKVAMLRFD MDALPIHEQN DVDYRSTIDG KMHACGHDGH VAIGLGVAAA LMQNREALGT GGIKLLFQPA EEGGGGAQKM VEAGAMQNPR PDISLGLHIW APMPLGKANV RSGPIMASAD TFIVEITGKG GHGAQPETTV DSVLVASHMV VALHSIVSRN VHPEQPAVLS VGSVQAGTAH NIIAHNATLT GTIRSYDPEA RERLKQRVHE VVQGVAATFG ATATLKYDEM CPATICDPAA TALVRGAAEA ILGAENVDDS VRTMGSEDMS VLLNEVPGCY FFLGGQTLER ELGAHPHHHP AFSFDEGVLP LGVAILCEAA TRYLNGSNE
|
| |