Gene RPD_1844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1844 
Symbol 
ID4022326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2063932 
End bp2065179 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content67% 
IMG OID637962038 
Productallantoate amidohydrolase 
Protein accessionYP_568981 
Protein GI91976322 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGC CTGCCTCGAA CCTGCAGATC GATTCGTCGC GGCTGTGGGA GACCATCCAC 
ACCACTGCGC AGTTCGGCGC GACGCCGAAA GGCGGCGTGC GGCGGCTGAC GCTCGGGCCG
GAGGACAAGC AGGTTCGCGA CTGGTTTCGC ACCGCCTGCG AAAAGGCCGG CCTCGAGGTG
CAGGTCGACG GGCTCGGCAA CATGTTCGCG CTGCGCAAGG GCCGCGACAT GTCGAAGCCG
CCGATCGGAC TCGGCTCGCA TCTCGACACC CAACCGACCG GCGGCAAGTA CGACGGCGTA
CTTGGGACAC TCGCCGCGCT GGAAGTGATC CGCACACTGA ACGACGCCGG GATCGAGACC
GACCTGCCGC TCTGCGTCAC CAACTGGACC AATGAGGAAG GCTCGCGCTA CGCGCCGGCG
ATGATGGGAT CGGCCGCGTT CGTCGGCGAC TTCACCGTCG ACGACATCCT CGGCCGCAAG
GATGCCGCGG GCATCAGCGT CGCGCAGGCG CTCGACGACA TCGGCTATCG CGGCGACAAG
CCGGTCGGCG CGCAGCCATT CAGCGGCTTC ATCGAACTCC ACATCGAACA AGGCCCGATC
CTGGAAGCCG AGGGCAAGAC CATCGGCGTG GTCGAGCACG GCCAGGGCGT GCTGTGGTAC
GACGGCAAGA TCACAGGCTT CGAGAGTCAC GCCGGCTCGA CGCCGATGAA GCACCGACGC
GACGCGCTGG CGACGCTGTC GGAAATCGTG CTGGCGATCG AAACCATCGC TACCGAACTC
GGCAACGCGG TCGGCACCGT CGGCGAAGCG ATGATCGCCG CGCCTTCGCG CAACGTCATC
CCCGGCGAGG TCACCTTCAC CATCGACACT CGCAGCGCCG ATCCGGGCAT CCTCGACCAG
CTCGACGCGC GAATCCGCGC GGCTGCGGCC GGGATCGCCG CGAAACGCAA GGTCGAGGTC
GCACTCGACC TGGTCTGGCG CAAGGAACCG ACGCATTTCG ATCCGACGCT GGTCGGCGCG
GTCGAGAACG CGGCGAACGC GCTCGGCTAC CAAAACCGCC GCATCACCTC CGGCGCCGGC
CACGACGCCT GCAACCTCAA CGCCAAGCTG CCGGCCGCGA TGATCTTCGT GCCGTGCAAG
GACGGCGTCA GCCACAACGA GCTGGAAGAC GCAACGCAGA GCGACTGCGC CGCCGGCGCC
AACGTGCTGC TGCACACGGT GCTGTCGCTC GCCGGCGTCG CGAAGTAA
 
Protein sequence
MTKPASNLQI DSSRLWETIH TTAQFGATPK GGVRRLTLGP EDKQVRDWFR TACEKAGLEV 
QVDGLGNMFA LRKGRDMSKP PIGLGSHLDT QPTGGKYDGV LGTLAALEVI RTLNDAGIET
DLPLCVTNWT NEEGSRYAPA MMGSAAFVGD FTVDDILGRK DAAGISVAQA LDDIGYRGDK
PVGAQPFSGF IELHIEQGPI LEAEGKTIGV VEHGQGVLWY DGKITGFESH AGSTPMKHRR
DALATLSEIV LAIETIATEL GNAVGTVGEA MIAAPSRNVI PGEVTFTIDT RSADPGILDQ
LDARIRAAAA GIAAKRKVEV ALDLVWRKEP THFDPTLVGA VENAANALGY QNRRITSGAG
HDACNLNAKL PAAMIFVPCK DGVSHNELED ATQSDCAAGA NVLLHTVLSL AGVAK