Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_3574 |
Symbol | hutH |
ID | 3722088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007494 |
Strand | - |
Start bp | 668567 |
End bp | 670138 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640073237 |
Product | histidine ammonia-lyase |
Protein accession | YP_355075 |
Protein GI | 77465572 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2986] Histidine ammonia-lyase |
TIGRFAM ID | [TIGR01225] histidine ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.183418 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGCCA TGAGCCCCCC GAAGCCGGCC GTCGAGCTGG ATCGCCACAT CGATCTGGAC CAGGCCCATG CCGTGGCGAG CGGCGGCGCG CGGATTGTCC TTGCCCCTCC GGCGCGCGAC CGGTGCCGTG CGTCCGAAGC GCGGCTCGGC GCTGTCATCC GCGAGGCGCG CCATGTCTAC GGACTGACAA CCGGCTTCGG TCCCCTTGCG AACCGCCTGA TCTCAGGTGA GAATGTCCGA ACGCTGCAGG CCAATCTTGT CCATCATCTG GCCAGCGGCG TGGGACCGGT GCTTGACTGG ACGACGGCGC GCGCCATGGT TCTGGCGCGT CTGGTGTCGA TCGCTCAGGG AGCCTCCGGT GCCAGCGAGG GGACCATCGC TCGCCTGATC GACCTGCTCA ATTCCGAGCT CGCTCCGGCC GTTCCCAGCC GCGGCACGGT GGGCGCGTCG GGTGACCTGA CACCGCTTGC GCATATGGTG CTCTGCCTCC AGGGCCGGGG AGACTTCCTG GACCGGGACG GGACGCGGCT TGACGGCGCA GAAGGGCTCC GGCGCGGACG GCTGCAACCG CTCGATCTCT CCCATCGCGA TGCACTGGCG CTGGTCAACG GGACCTCCGC CATGACCGGG ATCGCGCTGG TGAATGCTCA CGCCTGCCGC CATCTCGGCA ACTGGGCGGT GGCGTTGACG GCCCTGCTTG CGGAATGTCT GAGAGGCCGG ACCGAGGCAT GGGCCGCGGC ACTGTCCGAC CTGCGGCCGC ATCCCGGACA GAAGGACGCC GCAGCGAGGC TGCGCGCCCG CGTGGACGGC AGCGCGCGGG TGGTCCGGCA CGTCATTGCC GAGCGGAGGC TCGACGCCGG CGATATCGGG ACGGAGCCGG AGGCGGGGCA GGATGCCTAC AGCCTGCGCT GCGCTCCGCA GGTTCTCGGG GCGGGCTTCG ACACGCTCGC ATGGCATGAC CGGGTGCTGA CGATCGAGCT GAACGCGGTG ACCGACAATC CGGTGTTTCC GCCCGATGGC AGCGTGCCCG CCCTGCACGG GGGCAATTTC ATGGGCCAGC ATGTGGCGCT GACGTCCGAT GCGCTCGCCA CGGCCGTCAC CGTTCTGGCG GGCCTTGCGG AGCGCCAGAT TGCACGTCTG ACAGATGAAA GGCTGAACCG TGGGCTGCCC CCCTTCCTCC ACCGGGGCCC CGCCGGGTTG AATTCCGGCT TCATGGGCGC ACAGGTGACG GCGACCGCGC TCCTGGCCGA GATGCGAGCC ACGGGACCTG CCTCGATCCA TTCGATCTCC ACGAACGCCG CCAATCAGGA TGTGGTCTCG CTTGGGACCA TCGCCGCGCG CCTCTGCCGC GAGAAGATCG ACCGTTGGGC GGAGATCCTT GCGATCCTCG CTCTCTGTCT TGCACAAGCT GCGGAGCTGC GCTGCGGCAG CGGCCTAGAC GGGGTGTCTC CCGCGGGGAA GAAGCTGGTG CAGGCCCTGC GCGAGCAGTT CCCGCCGCTT GAGACGGACC GGCCCCTGGG ACAGGAAATT GCCGCGCTTG CTACGCACCT CTTGCAGCAA TCTCCCGTCT GA
|
Protein sequence | MLAMSPPKPA VELDRHIDLD QAHAVASGGA RIVLAPPARD RCRASEARLG AVIREARHVY GLTTGFGPLA NRLISGENVR TLQANLVHHL ASGVGPVLDW TTARAMVLAR LVSIAQGASG ASEGTIARLI DLLNSELAPA VPSRGTVGAS GDLTPLAHMV LCLQGRGDFL DRDGTRLDGA EGLRRGRLQP LDLSHRDALA LVNGTSAMTG IALVNAHACR HLGNWAVALT ALLAECLRGR TEAWAAALSD LRPHPGQKDA AARLRARVDG SARVVRHVIA ERRLDAGDIG TEPEAGQDAY SLRCAPQVLG AGFDTLAWHD RVLTIELNAV TDNPVFPPDG SVPALHGGNF MGQHVALTSD ALATAVTVLA GLAERQIARL TDERLNRGLP PFLHRGPAGL NSGFMGAQVT ATALLAEMRA TGPASIHSIS TNAANQDVVS LGTIAARLCR EKIDRWAEIL AILALCLAQA AELRCGSGLD GVSPAGKKLV QALREQFPPL ETDRPLGQEI AALATHLLQQ SPV
|
| |