Gene Rru_A1301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1301 
Symbol 
ID3833609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp1534495 
End bp1536039 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content68% 
IMG OID637825391 
Producthistidine ammonia-lyase 
Protein accessionYP_426389 
Protein GI83592637 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2986] Histidine ammonia-lyase 
TIGRFAM ID[TIGR01225] histidine ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGC CGCTTTGCCT CACCCCCGGC GGACTCAGCC TGGATGTATT GCGCCGTATC 
CATCGCGAGG CGCCGCCGTT GACGCTCGAT CCGCGCTCTT ACGCGGCGAT GGCCGCCTCG
CAGGCGGTGG TGGCGGCGAT CGCCGGTGGC GAAAGCGCCG TTTATGGCAT CAACACCGGC
TTTGGCAAGC TCGCCCACAA GCGCATCGCC CCGGCCGATC TTGAAGCCCT GCAGACCAAT
CTGATCTTGT CGCACGCCAC CGGCATGGGG GCGCCGATCG CCGATGCCAC GGTGCGGCTG
ATCCTGGCGA TCAAAGCCGC CAGTCTGGCG GTTGGCGCCT CGGGCATCCG CGCCGAGATC
GTCGACGCCC TGCTCGCCCT GGCCAATGCC GATGTGCTAC CGGTGATCCC GTCAAAGGGC
TCGGTCGGCG CCTCGGGCGA TCTGGCGCCG CTCGCCCATC TGTGCTGCGC CCTGCTTGGC
ATCGGCTCGG TTCGCCATAA GGGCGCGGTG CTGCCGGCCG GCGAAGGTCT GGCGATCGCC
GGCCTGTCCC CGATCACCCT GCGGGCCAAG GAAGGTCTAG CGCTGATCAA CGGCACCCAG
GTGTCAACCG CCCTGGCCTT GGCCGGCTTG TTCGAGATCG AGCGCGCCTT CGCCGCCGCC
ATCCTGGCCG GGGCGCTGTC GGTCGAGGCG GTGATGGGCA GCCACCGCCC CTTCGACCCG
CGGATCAGCG CCCTGCGCGG CCAGTTCGGC CAGATTGATG TCGCCGCCCT TTTCCGCCTG
CTGCTCGATG GCAGCCCGCT GAACGCCGCC CATCAGGGAC CGTCGTGCGA GCGGGTTCAA
GACCCCTATT CCCTGCGCTG TCAGCCCCAG GTGATGGGGG CGGTGCTTGA TCAGATGCGC
TTCGCCGCGC GCACCCTGAC CATCGAAGCC AATGGCGTGA CCGATAATCC GCTGGTGCTG
GTCGATACGG GCGAGGTGCT GTCGGGGGGC AACTTCCATG CCGAGCCGGT GGCGATGGCC
GCCGATCAGT TGGCGATCGC CGCCTCGGAG ATCGGCGCCT TGTCGGAACG GCGCATCGCC
ATGCTGATCG ACAGCACGAT CAGCGGCCTG CCGCCCTTCC TGGTCGCCGA ACCGGGGTTG
AATTCGGGCT TCATGATCGC CCATGTGACG GCCGCCGCCC TGGCCTCCGA GAACAAATCC
CTGGCCCATC CCGCCAGCGT CGACAGCTTG CCAACCTCGG CCAATCAGGA AGACCACGTC
AGCATGGCGA CCTTCGCCGC CCGCCGCCTG GGCGACATCG CCGCCAATGT CACGGGCATC
GTCGGCATCG AACTGCTCGC CGCCGCCCAG GGTCTGGAAT TCCACCGCCC CTTGCGCTCG
AGCCAAACCC TGGAAACGGC CATGGCGATG ATCCGCGAGC GGGTGCCCTC CTATCGCGTC
GACCGCTATT TCGCCCCCGA CCTGGAAGCC ATCGCCCACC TGATCGGCGA AGGCCGCTTC
GACGCCCTGG TCCCGGTCGA TCTGTCGACC CTGGGATCGG TCTGA
 
Protein sequence
MTEPLCLTPG GLSLDVLRRI HREAPPLTLD PRSYAAMAAS QAVVAAIAGG ESAVYGINTG 
FGKLAHKRIA PADLEALQTN LILSHATGMG APIADATVRL ILAIKAASLA VGASGIRAEI
VDALLALANA DVLPVIPSKG SVGASGDLAP LAHLCCALLG IGSVRHKGAV LPAGEGLAIA
GLSPITLRAK EGLALINGTQ VSTALALAGL FEIERAFAAA ILAGALSVEA VMGSHRPFDP
RISALRGQFG QIDVAALFRL LLDGSPLNAA HQGPSCERVQ DPYSLRCQPQ VMGAVLDQMR
FAARTLTIEA NGVTDNPLVL VDTGEVLSGG NFHAEPVAMA ADQLAIAASE IGALSERRIA
MLIDSTISGL PPFLVAEPGL NSGFMIAHVT AAALASENKS LAHPASVDSL PTSANQEDHV
SMATFAARRL GDIAANVTGI VGIELLAAAQ GLEFHRPLRS SQTLETAMAM IRERVPSYRV
DRYFAPDLEA IAHLIGEGRF DALVPVDLST LGSV