Gene RPB_1671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1671 
Symbol 
ID3908658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1902793 
End bp1903989 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content68% 
IMG OID637883565 
Productamidohydrolase 
Protein accessionYP_485290 
Protein GI86748794 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.446042 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCACG ACGCCGCCGC GCCCACGGGA CCGACCAAGC TCGTCATTCG CAACATCGGC 
CTCTTGATCA GCGGTGACCT CGACAAGCCG ATCCTCGATG CCGACACCAT CGTTGCGGAG
AACGGCAAGA TCTCCGCGAT CGGTCGGCTG AAGGACGTCG ACACCGAAGG CGCGACCACC
ACCGTCGATG CGGGTGGCGC TGCGGTCACC CCGGGCCTGA TCGACAGCCA CGTCCATCCG
GTGGCGGGCG ACTGGACGCC GCGGCAGAGC CAACTCAACT GGATCGACTC CTCGCTGCAC
GGCGGCGTCA CCACCATGAT CTCGGCCGGC GAGGTGCACT ATCCCGGCCG GCCGCGCGAC
GTCATCGGCA TCAAGGCGCT GGCGATCACC GCGCAGCGCA GCTTCTCCGC CTTCCGCGCC
AGCGGCGTGA AGGTCCATGC CGGCGCTCCG GTGATCGAGC ACGAGATGGA AGAGAACGAC
TTCAAGGAAC TCGCCGCGGC CGGCGTCAAG CTGCTCGGCG AGATTGGGCT CGGCGGCGTC
AAGGACGGAC CGACCGCGAA GAAGATGGTG GCGTGGGCGC GCAAATACGG CATCCAGTCC
ACCATCCACA CCGGCGGCCC GTCCATCGCG GGCTCCGGGC TGATCGACAA GGACGTGGTG
CTGGAAGCCG GCACCGACGT GATCGGCCAC ATCAACGGCG GCCACACCGC GCTGCCCGAC
GGGCAGATCC GCTGCATCTG CGAGGGCTGC AAGGCCGGGC TCGAGCTGGT CCATAACGGC
AACGAGCGCT CGGCGCTGTA CACGCTGCGG ATCGCGCGCG AGATGGGCGA TCTCCATCGC
GTCATTCTCG GCACCGACGG CCCGGCCGGC TCCGGTGTCC AGCCGCTCGG CATCCTGCGG
ATGATTTCGC TGTTGTCGTC GCTCGGCGAT CTTCCCGCCG AACAGGCGTT CTGCCTCGCC
ACCGGCAACA CCGCGCGGAT GCGCGATCTC GACTGCGGCC TGATCGAGGT CGGCCGCGTC
GCCGATTTCG TGATCATGGA CGCGGCCCAG CACTCGGCCA GTTCGTCGCT GCTGGAAAGC
GTCCGCCTCG GCGATCTGCC GGGCATCGGC ATGACCATCA TCGACGGCAT CGTGCGCAGC
GAACGCTCCC GCAACACCCC GCCGGCGACG CGACTGCCGA GCATCGTCAA GGCCTGA
 
Protein sequence
MAHDAAAPTG PTKLVIRNIG LLISGDLDKP ILDADTIVAE NGKISAIGRL KDVDTEGATT 
TVDAGGAAVT PGLIDSHVHP VAGDWTPRQS QLNWIDSSLH GGVTTMISAG EVHYPGRPRD
VIGIKALAIT AQRSFSAFRA SGVKVHAGAP VIEHEMEEND FKELAAAGVK LLGEIGLGGV
KDGPTAKKMV AWARKYGIQS TIHTGGPSIA GSGLIDKDVV LEAGTDVIGH INGGHTALPD
GQIRCICEGC KAGLELVHNG NERSALYTLR IAREMGDLHR VILGTDGPAG SGVQPLGILR
MISLLSSLGD LPAEQAFCLA TGNTARMRDL DCGLIEVGRV ADFVIMDAAQ HSASSSLLES
VRLGDLPGIG MTIIDGIVRS ERSRNTPPAT RLPSIVKA