Gene Haur_4629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4629 
Symbol 
ID5736476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5914984 
End bp5916570 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content53% 
IMG OID641281793 
Producthistidine ammonia-lyase 
Protein accessionYP_001547388 
Protein GI159901141 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAATGTT TAGTGCTCAA TGGCGAGCAG TTAACAGTTG ATGGTTTGGT GGCTGCCGCT 
CGTAATCCGG CAATTAAGGT CGAATTAGCG CCCGAAGCAA TTGAACGAAT GCACTATTCT
CGCGCTGCCG TCGAGCGATT TGTGGCCGAA GGTCGCGTGG TCTATGGCAT TACCACGGGC
TTTGGTCATT TTCAAAATCG TACAATCGAT CGCGACCATG TGCGCGAGTT GCAACGCAAT
ATTATTATGA GCCACGCCAC TGGCACAGGC ACGCCGCTGC GCCGCGACCA AGTACGCGCC
ATGTTGATCG TGCGAGTCAA TACCTTGGCT AAAGGCTTTT CAGGGATTCG CCCGCTGGTT
GCACAAGCCT TGCTTGATCT GCTCAACGCC GATATTTTGC CAATTATTCC TTGTCAAGGC
TCGCTTGGAG CTAGCGGCGA TTTGGCTCCC CTCGCCCATG CCTGTTTGAT TTTGCTGGGC
TTGGGCGAGG CGGTTGCTCC AGGTCAATCG CCAGTCCATG GCCAACGCAT GAGTGGAGCC
GAAGTTTTAG CCCACTTGCA GCAAGAACCT TTGGTTTTAG AGGCTAAAGA AGGCTTAGCA
TTAACTAATG GCACGGCATT ATTGAGTGGC TTAGCCGCCT TGGCAATCTA CGATGCCGAG
CAACTTTGCC GCAGTGCCGA GACTATCGCC GCCTTGTCGA TGGAAGCTTT GGCGGCTTTG
CCAGCAGCCT TCGATCAGCG GTTGCATGCA ATTCGTCCGC ATCCACGTCA GCTTGATAGT
GCGCGGAGCA TTCGTCAATT GTTGCAAGGC AGTAGCTTTG TTTACCCCAG CCAAGCCGCT
GATCCGACTA TTTATGGGCC GCATAAAGTC CAAGATGCCT ACTCGTTGCG CTGTGTGCCT
CAAGTCCATG GGGCAATTCG CGATGCAGCC TGTTATGGGC GTTGGGCTAC CGAGATTGAA
CTCAACAGCG CTACCGATAA CCCCTTGATT GTTCCTGTTG ATCCTGCGCA ACCCCATGGC
GAATATGAGG CGATTTCGGG CGGTAACTTT CATGGTGAGC CTCTCGCATT AGCCATGGAT
TTTCTGAAAG TGGCGTTGAG CGAATTGGGC AACATCAGCG AGCGCCGCAC TGCTCGCTTG
GTTGATGCAG GTTTGAATGG CAATTTACTC GCCCCGTTTT TAACCGAGCA AGGCGGCCTG
CACTCAGGCA TGATGTTGAT TCAATATACG GCTGTGGCTT TGGCGAGCGA AAATAAAGTG
CTGGTACACC CCGCTGCTGC TGATACGATT CCTACCTCGG GTAATCAAGA AGATCATGTC
AGTATGGGGC CGACTGCTGC CCGTCAGGCT GCCGAGATGC TCGATAATGT GGTGGGTATT
TTGGCCTGTG AAGCCTTATG CGCGGCCCAA GCGATCGATT TACGTTGGCG CAAACACGAG
CATTTACAGC TGGGCCAAGG AACTGCGCCC GCCCATCAAG TAATTCGCCA GGTTGTGCCA
TTTCTAGCTG AAGATACCGT GATGTACCCG CATATCGAAG GCCTGAAACA GGTGATTCAG
GCTGGTAAAT TGGCCTTAGC CGAATGA
 
Protein sequence
MECLVLNGEQ LTVDGLVAAA RNPAIKVELA PEAIERMHYS RAAVERFVAE GRVVYGITTG 
FGHFQNRTID RDHVRELQRN IIMSHATGTG TPLRRDQVRA MLIVRVNTLA KGFSGIRPLV
AQALLDLLNA DILPIIPCQG SLGASGDLAP LAHACLILLG LGEAVAPGQS PVHGQRMSGA
EVLAHLQQEP LVLEAKEGLA LTNGTALLSG LAALAIYDAE QLCRSAETIA ALSMEALAAL
PAAFDQRLHA IRPHPRQLDS ARSIRQLLQG SSFVYPSQAA DPTIYGPHKV QDAYSLRCVP
QVHGAIRDAA CYGRWATEIE LNSATDNPLI VPVDPAQPHG EYEAISGGNF HGEPLALAMD
FLKVALSELG NISERRTARL VDAGLNGNLL APFLTEQGGL HSGMMLIQYT AVALASENKV
LVHPAAADTI PTSGNQEDHV SMGPTAARQA AEMLDNVVGI LACEALCAAQ AIDLRWRKHE
HLQLGQGTAP AHQVIRQVVP FLAEDTVMYP HIEGLKQVIQ AGKLALAE