Gene RSP_3574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3574 
SymbolhutH 
ID3722088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007494 
Strand
Start bp668567 
End bp670138 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content69% 
IMG OID640073237 
Producthistidine ammonia-lyase 
Protein accessionYP_355075 
Protein GI77465572 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.183418 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGCCA TGAGCCCCCC GAAGCCGGCC GTCGAGCTGG ATCGCCACAT CGATCTGGAC 
CAGGCCCATG CCGTGGCGAG CGGCGGCGCG CGGATTGTCC TTGCCCCTCC GGCGCGCGAC
CGGTGCCGTG CGTCCGAAGC GCGGCTCGGC GCTGTCATCC GCGAGGCGCG CCATGTCTAC
GGACTGACAA CCGGCTTCGG TCCCCTTGCG AACCGCCTGA TCTCAGGTGA GAATGTCCGA
ACGCTGCAGG CCAATCTTGT CCATCATCTG GCCAGCGGCG TGGGACCGGT GCTTGACTGG
ACGACGGCGC GCGCCATGGT TCTGGCGCGT CTGGTGTCGA TCGCTCAGGG AGCCTCCGGT
GCCAGCGAGG GGACCATCGC TCGCCTGATC GACCTGCTCA ATTCCGAGCT CGCTCCGGCC
GTTCCCAGCC GCGGCACGGT GGGCGCGTCG GGTGACCTGA CACCGCTTGC GCATATGGTG
CTCTGCCTCC AGGGCCGGGG AGACTTCCTG GACCGGGACG GGACGCGGCT TGACGGCGCA
GAAGGGCTCC GGCGCGGACG GCTGCAACCG CTCGATCTCT CCCATCGCGA TGCACTGGCG
CTGGTCAACG GGACCTCCGC CATGACCGGG ATCGCGCTGG TGAATGCTCA CGCCTGCCGC
CATCTCGGCA ACTGGGCGGT GGCGTTGACG GCCCTGCTTG CGGAATGTCT GAGAGGCCGG
ACCGAGGCAT GGGCCGCGGC ACTGTCCGAC CTGCGGCCGC ATCCCGGACA GAAGGACGCC
GCAGCGAGGC TGCGCGCCCG CGTGGACGGC AGCGCGCGGG TGGTCCGGCA CGTCATTGCC
GAGCGGAGGC TCGACGCCGG CGATATCGGG ACGGAGCCGG AGGCGGGGCA GGATGCCTAC
AGCCTGCGCT GCGCTCCGCA GGTTCTCGGG GCGGGCTTCG ACACGCTCGC ATGGCATGAC
CGGGTGCTGA CGATCGAGCT GAACGCGGTG ACCGACAATC CGGTGTTTCC GCCCGATGGC
AGCGTGCCCG CCCTGCACGG GGGCAATTTC ATGGGCCAGC ATGTGGCGCT GACGTCCGAT
GCGCTCGCCA CGGCCGTCAC CGTTCTGGCG GGCCTTGCGG AGCGCCAGAT TGCACGTCTG
ACAGATGAAA GGCTGAACCG TGGGCTGCCC CCCTTCCTCC ACCGGGGCCC CGCCGGGTTG
AATTCCGGCT TCATGGGCGC ACAGGTGACG GCGACCGCGC TCCTGGCCGA GATGCGAGCC
ACGGGACCTG CCTCGATCCA TTCGATCTCC ACGAACGCCG CCAATCAGGA TGTGGTCTCG
CTTGGGACCA TCGCCGCGCG CCTCTGCCGC GAGAAGATCG ACCGTTGGGC GGAGATCCTT
GCGATCCTCG CTCTCTGTCT TGCACAAGCT GCGGAGCTGC GCTGCGGCAG CGGCCTAGAC
GGGGTGTCTC CCGCGGGGAA GAAGCTGGTG CAGGCCCTGC GCGAGCAGTT CCCGCCGCTT
GAGACGGACC GGCCCCTGGG ACAGGAAATT GCCGCGCTTG CTACGCACCT CTTGCAGCAA
TCTCCCGTCT GA
 
Protein sequence
MLAMSPPKPA VELDRHIDLD QAHAVASGGA RIVLAPPARD RCRASEARLG AVIREARHVY 
GLTTGFGPLA NRLISGENVR TLQANLVHHL ASGVGPVLDW TTARAMVLAR LVSIAQGASG
ASEGTIARLI DLLNSELAPA VPSRGTVGAS GDLTPLAHMV LCLQGRGDFL DRDGTRLDGA
EGLRRGRLQP LDLSHRDALA LVNGTSAMTG IALVNAHACR HLGNWAVALT ALLAECLRGR
TEAWAAALSD LRPHPGQKDA AARLRARVDG SARVVRHVIA ERRLDAGDIG TEPEAGQDAY
SLRCAPQVLG AGFDTLAWHD RVLTIELNAV TDNPVFPPDG SVPALHGGNF MGQHVALTSD
ALATAVTVLA GLAERQIARL TDERLNRGLP PFLHRGPAGL NSGFMGAQVT ATALLAEMRA
TGPASIHSIS TNAANQDVVS LGTIAARLCR EKIDRWAEIL AILALCLAQA AELRCGSGLD
GVSPAGKKLV QALREQFPPL ETDRPLGQEI AALATHLLQQ SPV