Gene Haur_0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0004 
Symbol 
ID5736838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4561 
End bp5760 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content54% 
IMG OID641277125 
Productamidohydrolase 
Protein accessionYP_001542784 
Protein GI159896537 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.209653 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAGTC GTCCAGATTT TCGGTATGTT GCTCACACGT TGGTCGAGCA ATTGATCACT 
GATCGCCGCG ATTTACACCA GCATCCTGAA CTTGGCTTCG AGGAGTTTCG TACCGCCAAA
ATTGTGGCTG ATCGTTTGCG TGAGTTAGGC TACGAGGTTA CCGAGGGGGT TGCCACCACC
GGCGTTTTAG GCCATATTCC GGCTCAGCCA GGCGGCAAAG TTGCCATGTT GCGCTTCGAC
ATGGATGCCT TGCCAATCCA CGAGCAAAAC GATGTCGATT ACCGCTCAAC CATCGACGGC
AAAATGCATG CTTGTGGCCA TGATGGCCAT GTTGCGATCG GCTTAGGCGT GGCCGCAGCC
CTGATGCAAA ATCGCGAAGC GCTTGGCACA GGTGGGATTA AATTGCTATT CCAGCCTGCC
GAAGAAGGCG GCGGCGGCGC TCAAAAGATG GTCGAAGCAG GCGCGATGCA AAATCCACGG
CCTGATATTT CGCTTGGTTT GCATATTTGG GCACCCATGC CCTTGGGTAA AGCCAATGTG
CGTTCAGGGC CAATTATGGC TTCTGCCGAT ACCTTTATCG TGGAAATTAC TGGCAAAGGT
GGCCACGGCG CTCAGCCTGA AACTACCGTC GATTCGGTTT TGGTGGCTTC ACATATGGTC
GTTGCGTTGC ATTCAATCGT TAGCCGCAAC GTTCACCCTG AACAGCCCGC AGTGCTTTCG
GTTGGTTCGG TACAAGCTGG CACAGCTCAT AATATCATCG CCCACAACGC CACTCTAACT
GGCACAATTC GCAGCTATGA CCCCGAAGCT CGCGAGCGCT TGAAACAACG AGTGCATGAA
GTAGTGCAAG GCGTGGCGGC AACCTTTGGC GCAACCGCTA CCCTCAAATA CGATGAAATG
TGCCCAGCAA CCATCTGCGA CCCTGCGGCA ACCGCCTTGG TACGTGGTGC AGCTGAAGCG
ATTTTGGGCG CGGAGAACGT CGATGACAGC GTGCGCACCA TGGGTTCAGA AGATATGTCG
GTGCTGTTGA ATGAAGTGCC TGGCTGCTAT TTCTTCTTGG GCGGGCAAAC CCTTGAGCGC
GAGTTGGGCG CACATCCGCA TCATCACCCA GCATTTAGCT TCGATGAAGG CGTATTGCCC
TTGGGCGTTG CCATTTTATG TGAAGCCGCA ACCCGCTATC TCAACGGGAG CAACGAATGA
 
Protein sequence
MASRPDFRYV AHTLVEQLIT DRRDLHQHPE LGFEEFRTAK IVADRLRELG YEVTEGVATT 
GVLGHIPAQP GGKVAMLRFD MDALPIHEQN DVDYRSTIDG KMHACGHDGH VAIGLGVAAA
LMQNREALGT GGIKLLFQPA EEGGGGAQKM VEAGAMQNPR PDISLGLHIW APMPLGKANV
RSGPIMASAD TFIVEITGKG GHGAQPETTV DSVLVASHMV VALHSIVSRN VHPEQPAVLS
VGSVQAGTAH NIIAHNATLT GTIRSYDPEA RERLKQRVHE VVQGVAATFG ATATLKYDEM
CPATICDPAA TALVRGAAEA ILGAENVDDS VRTMGSEDMS VLLNEVPGCY FFLGGQTLER
ELGAHPHHHP AFSFDEGVLP LGVAILCEAA TRYLNGSNE