Gene Rru_A1302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1302 
Symbol 
ID3833610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp1536036 
End bp1537403 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content68% 
IMG OID637825392 
ProductN-formimino-L-glutamate deiminase 
Protein accessionYP_426390 
Protein GI83592638 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02022] formiminoglutamate deiminase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAACC TGTTTTTCCA GTCGCTTTTG CTGCCCGAGG GCTGGGCGGA AAACGTCGCC 
ATGACGGTCG ATGAAAACGG CATGATCGCG ACGCTAAGCC CGGGGTCGCC ACCGCCGGCC
AGCGGCCCGT CGTTCCGCGG CCCGGCCTTC GCCGGCATCC CCAACCTTCA TTCCCACGCC
CATCAGCGCG CCCTGGCCGG ATCGGGTGAA CGCTCGGGCG GCGATGGCGA GGACAGCTTC
TGGAGCTGGC GCAAGGCGAT GTACGCCGCC CTGGCCCGCC TGACCCCCGA AGCTTTTGAA
GATGTGGCCA CCCAGCTTTA TGTGGAGATG GTCAAAGCCG GCTACACCGC CGTCGCCGAA
TTCCACTATC TGCACCACGA CCGCGACGGC CGCCCCTTCG CCGATCCGGC CGAGATGAGC
CATCGTCTGG TCGCCGCCGC CCGCACGGCG GGGATCGCGC TGACCCTGCT TCCCGTTCTC
TACAGCGCCT CGGGCTTTGA TGGCGCCCCG CCCACGGAAG GCCAGAAACG CTTTCACACC
ACCGGATCGT CCTTTGGCGC CCTGGTCGAG CGCCTGAAGC GCGACTATGG CCGCGACGGC
GCCATCATGC TTGGCATCGC GCCGCATTCC CTGCGCGCCG TTCCCGCGCC GCTGCTGGCC
GAGGTGATCG GCGCCCACCC GGAAGGCCCG ATCCACCTGC ATATCGCCGA ACAGACGATC
GAGGTTACTG ATTGCCTTGC CCATACCGGC CAGCGCCCGG TGGAGTGGCT GCTTGACCAT
GTCGATCTTG ACCCGCGCTG GTGCCTGATC CATGCCACCC ATGTCACCGA CCAGGAACTG
GCCGGTATCG CCGCCAGCCG CGCCGTCGTC GGCCTTTGCC CAACGACCGA GGCCAATCTT
GGCGACGGCC TGTTCCCGGC CGATCGGTTC CTGGGGCTTG GCGGGCGGTT CGGCATCGGC
TCGGACAGCC ATATCTCGGT CAATCCGGTC GAGGAATTGC GCTGGCTGGA ATACGGCCAG
CGGTTGACCA CCCGCCGGCG CACCGTGCTG GCCGGCGGCA TCGACCGTTC GACCGGCCGC
GCCCTGATCG AACAGGCCCA GATCTCGGGG GCGACGGCCT GCGCGATCAA GGCCGGGCGG
CTGGCGGTCG GCCAGCGCGC CGATATCGTC GTGCTGGATG GCGAGGCGCC CGTGCTGTGC
GGGCGCTCGG GCGATGGCGC CCTTGATGCC TGGATTTTTT CGGGCAATGC CCCGACCGTC
CATTCGGTGG TGGTTGGCGG CGCCCTTGTC GTTGAAAATG GCCGCCATCG GGCCGAAGAG
GCCGTGGCCC GGCGTTTCGC ATCCACCCTT GGGAGACTTC TCGCATGA
 
Protein sequence
MRNLFFQSLL LPEGWAENVA MTVDENGMIA TLSPGSPPPA SGPSFRGPAF AGIPNLHSHA 
HQRALAGSGE RSGGDGEDSF WSWRKAMYAA LARLTPEAFE DVATQLYVEM VKAGYTAVAE
FHYLHHDRDG RPFADPAEMS HRLVAAARTA GIALTLLPVL YSASGFDGAP PTEGQKRFHT
TGSSFGALVE RLKRDYGRDG AIMLGIAPHS LRAVPAPLLA EVIGAHPEGP IHLHIAEQTI
EVTDCLAHTG QRPVEWLLDH VDLDPRWCLI HATHVTDQEL AGIAASRAVV GLCPTTEANL
GDGLFPADRF LGLGGRFGIG SDSHISVNPV EELRWLEYGQ RLTTRRRTVL AGGIDRSTGR
ALIEQAQISG ATACAIKAGR LAVGQRADIV VLDGEAPVLC GRSGDGALDA WIFSGNAPTV
HSVVVGGALV VENGRHRAEE AVARRFASTL GRLLA