Gene Rru_A1304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1304 
Symbol 
ID3833612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp1538781 
End bp1540451 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content68% 
IMG OID637825394 
Producturocanate hydratase 
Protein accessionYP_426392 
Protein GI83592640 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.235615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGATC GTCTCGACAA TCAGCGCGTG GTGCGCGCCC CCCGTGGCCC CGAGATCACT 
TGCAAAAGCT GGCTGTCGGA AGCCCCTTTG CGCATGTTGA TGAACAACCT CGACCCCGAG
GTGGCTGAAA AGCCGGAAGA ATTGGTGGTT TATGGCGGCA TCGGCCGGGC GGCGCGCAAT
TGGCAATGCT ATGACCAGAT CGTCGCCGCG CTCAAGCGCC TGGAAGACGA TGAAACCCTG
TTGGTCCAAT CGGGCAAGGC GGTGGGCGTG TTCAAAACCC ACCGCGACGC CCCGCGCGTG
CTGATCGCCA ATTCCAATCT GGTGCCCCAT TGGGCGACCT GGGAGACCTT TCACGCCCTC
GACCGCGCCG GGCTGATGAT GTACGGCCAG ATGACCGCCG GGTCGTGGAT CTATATCGGC
AGCCAGGGCA TCGTTCAGGG CACCTATGAG ACCTTCGTCG AAGCCGGGCG GCGCCATTAC
GGTGGGTCTT TGGCCGGACG CTGGATCCTG ACCGGCGGCT TGGGCGGTAT GGGCGGCGCC
CAGCCGCTGG CCGCCACCAT GGCCGGGGCG TCGATGCTGG CGGTGGAGTG CCAGCCCAGT
CGCATCGAGG CGCGGCTGCG CACGGGCTAT CTTGATCGCC AGACCGCCGA TCTTGATCAA
GCCCTGGCCT GGATCGCCGA GGCCGGCGCT CCGGGGGCCA AGCCGGTTTC CGTCGGCCTG
CTCGGCAATG CCGCCGAGGT GTTTCCCGAG CTGGTCAAGC GTGGCGTCCG CCCCGATCTG
GTGACCGACC AGACCTCGGC CCACGACCCG CTGAACGGCT ATCTGCCCGC CGGGTGGAGC
CTGGAGCGTT GGGAGCGGGG GCGCGAGCGC GCGCCGGCCG AGGTGATCGC CGCGGCCAAG
GCGTCGATGG CGACCCAGGT GCGCGCCATG CTGGCCTTCC ACGCCCAGGG CATTCCCACC
GTCGATTACG GCAACAACAT CCGCCAAAGG GCGCTTGAGG AAGGGGTGAG CGACGCCTTC
GCCTTCCCCG GCTTCGTGCC GGCCTATATC CGGCCGCTGT TTTGCCGGGG CATCGGTCCC
TTCCGCTGGG CGGCGCTGTC GGGCGATCCC GAGGATATCT ATCGCACCGA CGCCAAGGTG
AAGGAGCTGA TCCCCGATGA TCCCCATCTG CACACCTGGC TCGACATGGC GCGCGAGCGT
ATCCACTTCC AGGGACTGCC CTCGCGCATC TGCTGGGTCG GCCTGGGCGA TCGCCATCGC
CTGGGGCTGG CCTTCAACGC CATGGTGGCG AGCGGTGAAT TAAAGGCGCC GGTGGTCATC
GGCCGCGACC ACCTTGACAG CGGCTCTGTC GCCAGCCCCA ACCGCGAGAC CGAGGCGATG
CGCGACGGCT CCGACGCCGT GTCGGACTGG CCGTTGCTCA ACGCCCTGCT CAATACCGCC
GGCGGCGCCA CTTGGGTCAG CCTGCACCAC GGCGGCGGCG TCGGCATGGG CTTTTCCCAG
CATGCGGGCA TGGTCATCGT CTGCGACGGC AGCGAGGACG CGGCGCGGCG CATCGGCCGC
GTGCTGTGGA ACGACCCGGC CACCGGCGTC ATGCGCCACG CCGACGCCGG CTACGACGAC
GCCATCGCCT GCGCCCGGGA AAAAGGTCTG GACCTGCCGT TCTTGGGGTA A
 
Protein sequence
MSDRLDNQRV VRAPRGPEIT CKSWLSEAPL RMLMNNLDPE VAEKPEELVV YGGIGRAARN 
WQCYDQIVAA LKRLEDDETL LVQSGKAVGV FKTHRDAPRV LIANSNLVPH WATWETFHAL
DRAGLMMYGQ MTAGSWIYIG SQGIVQGTYE TFVEAGRRHY GGSLAGRWIL TGGLGGMGGA
QPLAATMAGA SMLAVECQPS RIEARLRTGY LDRQTADLDQ ALAWIAEAGA PGAKPVSVGL
LGNAAEVFPE LVKRGVRPDL VTDQTSAHDP LNGYLPAGWS LERWERGRER APAEVIAAAK
ASMATQVRAM LAFHAQGIPT VDYGNNIRQR ALEEGVSDAF AFPGFVPAYI RPLFCRGIGP
FRWAALSGDP EDIYRTDAKV KELIPDDPHL HTWLDMARER IHFQGLPSRI CWVGLGDRHR
LGLAFNAMVA SGELKAPVVI GRDHLDSGSV ASPNRETEAM RDGSDAVSDW PLLNALLNTA
GGATWVSLHH GGGVGMGFSQ HAGMVIVCDG SEDAARRIGR VLWNDPATGV MRHADAGYDD
AIACAREKGL DLPFLG