Gene Rsph17029_3256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3256 
Symbol 
ID4899183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp323112 
End bp324683 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content69% 
IMG OID640113854 
Producthistidine ammonia-lyase 
Protein accessionYP_001045124 
Protein GI126464011 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.06186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGCCA TGAGCCCCCC GAAGCCGGCC GTCGAGCTGG ATCGCCACAT CGATCTGGAC 
GAGGCCCATT CCGTGGCGAG CGGCGGCGCG CGGATTGTCC TTGCCCCTCC AGCGCGCGAC
CGGTGCCGTG CGTCCGAAGC GCGGCTCGGC GCTGTCATCC GCGAGGCGCG CCATGTCTAC
GGACTGACAA CCGGCTTCGG TCCCCTTGCG AACCGACTGG TCTCAGGTGA GAATGTCCGA
ACGCTGCAGG CCAATCTTGT CCATCATCTG GCCAGCGGCG TGGGGCCGGT GCTTGACTGG
ACGACGGCCC GCGCCATGGT TCTGGCGCGT CTGGTGGCGA TCGCTCAGGG AGCCTCCGGT
GCCAGCGAGG GGACCATCGC TCGCCTGATC GACCTGCTCA ATTCCGAGCT CGCTCCGGCC
GTGCCAATGC GCGGCACGGT GGGCGCGTCG GGTGACCTGA CACCGCTTGC GCATATGGTG
CTCTGCCTCC AGGGCCGGGG AGACTTCCTG GACCGGGACG GGACGCGGCT TGACGGCGCA
GAAGGGCTCC GGCGCGGACG GCTGCAACCG CTCGATCTCT CCCATCGCGA TGCACTGGCG
CTGGTCAACG GGACCTCCGC CATGACCGGG ATCGCGCTGG TGAATGCTCA CGCCTGCCGC
CATCTCGGCA ACTGGGCGGT GGCGTTGACG GCCCTGCTTG CGGAATGTCT GGGAGGCCGG
ACCGAGGCAT GGGCCGCGGC ACTGTCCGAC CTGCGGCCGC ATCCCGGACA GAAGGACGCC
GCAGCGAGGC TGCGCGCCCG CGTGGACGGC AGCGCGCGGG TGGTCCGGCA CGTCATTGCC
GAGCGGAGGC TCGGCGCCAG CGATATCGGG ACGGAGCCGG AGGCGGGGCA GGATGCCTAC
AGTCTGCGCT GCGCTCCGCA GGTTCTCGGG GCGGGCTTCG ACACGCTCGC ATGGCATGAC
CGGGTGCTGA CGATCGAGCT GAACGCGGTG ACCGACAATC CGGTGTTTCC GCCCGATGGC
AGCGTGCCCG CCCTGCACGG GGGCAATTTC ATGGGCCAGC ATGTGGCGCT GACGTCCGAT
GCGCTCGCCA CGGCCGTCAC CGTTCTGGCC GGCCTTGCGG AGCGCCAGAT TGCACGTCTG
ACAGATGAAA GGCTGAACCG TGGGCTGCCC CCCTTCCTCC ACCGGGGCCC CGCCGGTTTG
AATTCCGGGT TCATGGGCGC ACAGGTGACG GCGACCGCGC TCCTGGCCGA GATGCGAGCC
ACGGGACCTG CCTCGATCCA TTCGATCTCC ACGAACGCCG CCAATCAGGA TGTGGTCTCG
CTTGGGACCA TCGCCGCGCG CCTCTGCCGC GAGAAGATCG ACCGTTGGGC GGAGATCCTT
GCGATCCTCG CTCTCTGTCT TGCACAAGCT GCGGAGCTGC GCTGCGGCAG CGGCCTCGAC
GGGGTATCTC CCGCGGGGAA GAAGCTGGTG CAGGCCCTGC GCGAGCAGTT CCCGCCGCTT
GAGACGGATC GGCCCCTGGG ACAGGAAATT GCCGCGCTTG CTACGCACCT CTTGCAGCAA
TCTCCCGTCT GA
 
Protein sequence
MLAMSPPKPA VELDRHIDLD EAHSVASGGA RIVLAPPARD RCRASEARLG AVIREARHVY 
GLTTGFGPLA NRLVSGENVR TLQANLVHHL ASGVGPVLDW TTARAMVLAR LVAIAQGASG
ASEGTIARLI DLLNSELAPA VPMRGTVGAS GDLTPLAHMV LCLQGRGDFL DRDGTRLDGA
EGLRRGRLQP LDLSHRDALA LVNGTSAMTG IALVNAHACR HLGNWAVALT ALLAECLGGR
TEAWAAALSD LRPHPGQKDA AARLRARVDG SARVVRHVIA ERRLGASDIG TEPEAGQDAY
SLRCAPQVLG AGFDTLAWHD RVLTIELNAV TDNPVFPPDG SVPALHGGNF MGQHVALTSD
ALATAVTVLA GLAERQIARL TDERLNRGLP PFLHRGPAGL NSGFMGAQVT ATALLAEMRA
TGPASIHSIS TNAANQDVVS LGTIAARLCR EKIDRWAEIL AILALCLAQA AELRCGSGLD
GVSPAGKKLV QALREQFPPL ETDRPLGQEI AALATHLLQQ SPV