Gene RPB_3623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3623 
Symbol 
ID3911425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4158364 
End bp4159611 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content67% 
IMG OID637885525 
Productallantoate amidohydrolase 
Protein accessionYP_487229 
Protein GI86750733 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.90869 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGC CTGCCTCGAA CCTGCAGATC GATTCGTCGC GGCTGTGGGA CACCATCCAC 
ACCACCGCGC AATTCGGCGC CACGCCGAAA GGCGGCGTGA GACGGCTGAC GCTCGGGCCG
GAGGACAAGC AGGTCCGCGA CTGGTTCCGA GCCGCCTGCG AAAACGCCGG CCTCGACGTC
CAGGTCGACG GGCTCGGCAA CATGTTCGCG TTGCGCCGCG GCCGCGACAT GTCGAAGCCG
CCGCTCGGGC TCGGCTCGCA TCTCGACACC CAGCCGACCG GCGGCAAGTT CGACGGCATC
CTCGGCACAC TCGCCGCACT CGAAGTGATC CGCACCCTGA ACGACGCCGG CATCGAAACC
GAGCTGCCGC TCTGCGTCAC GAACTGGACC AACGAAGAAG GCTCGCGCTA CGCGCCGGCG
ATGATGGGAT CGGCCGCCTT CGTCGGCGAT TTCACGGTCG ACGACATTCT CGGGCGCAAG
GACGCCGCCG GCATCAGCGT CGCCGAGGCG CTCGACAGCA TCGGCTATCG CGGTGATCTG
CCGGTCGGCG CGCAGCCGTT CAGCGGCTTC CTCGAACTGC ACATCGAACA GGGGCCGATC
CTCGAAGCCG AAGGCAAGAC CATCGGCGTC GTCGAAAACG GTCAGGGCGT GCTGTGGTAC
GACGGCAGGA TCACCGGCTT CGAAAGCCAT GCCGGATCGA CGCCGATGAA TCTTCGCCGC
GACGCGCTGG CGACGCTGTC GGAAATCGTG CTGGCGATCG AAGCCATCGC GGTCGAACTC
GGCAATGCAG TCGGCACCGT CGGCGAAGCC GTGATCGCCT CACCCTCGCG CAACGTCATC
CCCGGCGAGA TCGCTTTCAC CATCGACGCG CGTAGCGCCG ACGCGGCGAT CCTCGCGCAA
CTCGACGAGC GCATCCGCGC CGCGGCGGCC GGGATCGCGG CGAAGCGCAA GGTCGAGGTC
ACACTCGATC TGGTCTGGCG CAAGGAGCCG ACGCATTTCG ACAAGACGCT GGTCGGCGCG
GTCGAGAGCG CAGCGAACGC GCTCGGCTAT GCCAATCGCC GCATCACCTC GGGCGCCGGC
CACGACGCCT GCAATCTCAA CGCCAAGGTG CCGACGGCGA TGATTTTCGT GCCGTGCAAG
GACGGCGTCA GCCACAACGA GCTCGAGGAC GCTACGCAGA CCGACTGCGC CTCCGGCGCC
AATGTGCTGC TGCACACGGT GCTGTCGCTC GCGGGCGTCG CGAAGTAA
 
Protein sequence
MTKPASNLQI DSSRLWDTIH TTAQFGATPK GGVRRLTLGP EDKQVRDWFR AACENAGLDV 
QVDGLGNMFA LRRGRDMSKP PLGLGSHLDT QPTGGKFDGI LGTLAALEVI RTLNDAGIET
ELPLCVTNWT NEEGSRYAPA MMGSAAFVGD FTVDDILGRK DAAGISVAEA LDSIGYRGDL
PVGAQPFSGF LELHIEQGPI LEAEGKTIGV VENGQGVLWY DGRITGFESH AGSTPMNLRR
DALATLSEIV LAIEAIAVEL GNAVGTVGEA VIASPSRNVI PGEIAFTIDA RSADAAILAQ
LDERIRAAAA GIAAKRKVEV TLDLVWRKEP THFDKTLVGA VESAANALGY ANRRITSGAG
HDACNLNAKV PTAMIFVPCK DGVSHNELED ATQTDCASGA NVLLHTVLSL AGVAK