Gene RPB_2084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2084 
Symbol 
ID3908497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2369361 
End bp2370596 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content66% 
IMG OID637883976 
Productamidohydrolase 
Protein accessionYP_485701 
Protein GI86749205 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.680329 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.210595 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTTCG ATCTCATTCT CCGCAACGCC GCCGTCGTCG ATCGTCCCGA GACGGCGCTC 
GACATCGGCA TCAAGGCCGG ACGCTTCGCC GCGATCGAGG CGGGCCTGCC GGCGACCGGC
GCGCCGGAGC ACGATTGCGG CGGCCGCCTC GTCGTGCCGG GCTTCGTCGA AACCCATATC
CATCTCGACA AATCCTGCCT GCTCGGCCGC TGCAATTGCG AGAAGGGTAC GCTCGACGAG
GCCATCGCCG AAGTGGTGAA AGCCAAGCGC GGCTTCACCG AGGAGGACGT GTACCAGCGC
GCGTCGCAGA CGCTGGAAAA GGCGATCAAG AACGGCACCA ATCAGATGCG CACCCACGTT
GAAGTGGACC CGCGGGTCGG GCTCACCAGC TTTCGCGCGC TGAAGCAATT GAAGCTGGAC
TATGCCTGGG CGATCGATCT GCAGCTCTGC GTGTTTCCGC AAGAGGGCCT GCTCGACGAT
CCCGGCTGCG ACGCGGTGAT GCTGCAGGCG CTGCGCGAGG GCGCCGACGT GGTCGGCGGC
GCGCCCTATA TGGACAAGGA TTCCCACGGC CAGATCGGCC GCATCTTCGA GATGGCCAAA
CAGTTCGGCG TCGATATCGA CTTCCATCTC GATTTCGGCC TCGACCCGGC GCATCTCGAT
CTCGACGAGG TCTGCCGGAT GGCGGACGCC ACCGGCTGGG GCGGCCGCGT CGCGATCGGG
CATGTGACGA AGCTGTCGGC GATGCCCAAG GCCGCATTCG ACGCCGCGGC CAAGCGCCTT
GCGGACGCCG GCGTCGCCGT CACCGTGCTG CCGGCGACCG ATCTGTTTCT CACCGGCCGC
GACGCCGAGT TCAACGTGCC GCGCGGCGTG ACGCCGGCGC ACTGGCTGCG TCATCACGGC
GTCAATTGCA GCATCTCGAC CAACAACGTG CTCAACCCGT TCACGCCGTT CGGCGACTGC
TCGCTGATCC GGATGATCAA TCTCTACGCC AACATCACCC AGGTCGCGGC GACCGCCGAT
CTCACGGGCT GCCTCGACAT GGTGACGTCG GGCTCGGCGA AGCTGATCAA CCGCGCGGAT
TACGGCATCG CCGTCGGCCA TCCGGCCGAT CTCGTGGTGC TGGATTGCGC CACCAAGGCG
CAGGCGGTCT GCGAGATCGC GCAGCCTTTG TTCGGGCTGA AGCGCGGCCT GCGCACATTC
GACAAGCCCG CCGCAATCCT GCACAAGCCG AACTGA
 
Protein sequence
MTFDLILRNA AVVDRPETAL DIGIKAGRFA AIEAGLPATG APEHDCGGRL VVPGFVETHI 
HLDKSCLLGR CNCEKGTLDE AIAEVVKAKR GFTEEDVYQR ASQTLEKAIK NGTNQMRTHV
EVDPRVGLTS FRALKQLKLD YAWAIDLQLC VFPQEGLLDD PGCDAVMLQA LREGADVVGG
APYMDKDSHG QIGRIFEMAK QFGVDIDFHL DFGLDPAHLD LDEVCRMADA TGWGGRVAIG
HVTKLSAMPK AAFDAAAKRL ADAGVAVTVL PATDLFLTGR DAEFNVPRGV TPAHWLRHHG
VNCSISTNNV LNPFTPFGDC SLIRMINLYA NITQVAATAD LTGCLDMVTS GSAKLINRAD
YGIAVGHPAD LVVLDCATKA QAVCEIAQPL FGLKRGLRTF DKPAAILHKP N