Gene RPD_0936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0936 
Symbol 
ID4021411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1051991 
End bp1053388 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content66% 
IMG OID637961127 
Productargininosuccinate lyase 
Protein accessionYP_568075 
Protein GI91975416 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACA AGATGTGGGG CGGCCGGTTC ACGGATCGCC CCGACGCGAT CATGGAAGAG 
ATCAACGTCT CGATCGACGT CGATCGGCAT CTTTACGCCC AGGACATCGC GGCGTCCAAG
GCTCATGCCG CCATGCTCGC CGCGCAGGGC ATCATCACCG CAAACGATGC GAAAAATATC
GGCAAGGGTC TAGACACGAT TCTGTCAGAG ATCGGCGCGG GCAAATTCAC GTTCAAGCGT
GCGCTCGAGG ACATTCACAT GAATGTCGAG AGCCGGCTCG CCGAGCTGAT CGGGCCGGCC
GCGGGAAGGT TGCATACCGC ACGCTCGCGC AACGACCAGG TCGCGACCGA TTTCCGGCTG
TATGTCCGCG ACGTCCTCGA CGAGACCGAC GCGGCGCTGG CGAGCTTCCA GCGCGCCTTG
GTGGAACGGG CGCTCGAGCA CGCCGAGACC GTGATGCCGG GCTTCACCCA TCTGCAGACC
GCGCAGCCGG TGACTTTCGG CCATCATCTG ATGGCTTACG TCGAAATGGC GGCGCGCGAT
CGCGGCCGAT TCCAGGACGC CCGCAAGCGG CTCAATGAAA GCCCGCTCGG CGCCGCGGCG
CTGGCTGGCA CCTCGTTCCC GATCGACCGC CACGCCACCG CGAAGAAGCT CGGCTTCGAT
CGTCCGATGG CGAATTCGCT CGACGCGGTG TCGGATCGCG ACTTCGTGCT GGAGACGCTG
TCGGCGGCTT CGATCTGCGC GGTGCACCTG TCGCGCTTCG CCGAGGAAAT CGTGATCTGG
ACCTCGCCGC TGGTCGGGCT GGTGCGGTTG AGCGACAAGT TCACCACCGG CTCCTCGATC
ATGCCGCAGA AGCGCAACCC GGATGCCGCC GAGCTGGTCC GCGCCAAGAC CGGGCGGGTG
ATCGGCGCGC TGAACGGCCT GTTGATCGTG ATGAAGGGCC TGCCGCTCGC CTATCAAAAG
GACATGCAGG AGGACAAGCA GGGCGCGATG GAGGGCTTCG CCGCGCTGTC GCTGGCGATC
CGGGCGATGA CCGGCATGGT CCGCGACATC GTCCCCGAGC AGGACCGGAT GCGAGCCGCG
GCCGGCGAAG GCTACGCCAC CGCCACCGAC CTGGCCGACT GGCTGGTGCG GACGCTGAAG
ATGCCGTTCC GCGACGCCCA TCACGTCACC GGAAAGATCG TCGGCCTCGC CGCCAAGGCC
GGCGTCGCGC TGCACGAGCT GCCGCTGAAG GAGATGCAGG CGGTCGAGCC GAAGATCAGC
CGCGACGCGC TGGCGGTGCT GAGCGTCGAA TCCTCGGTGA AAAGCCGGAC CTCCTATGGC
GGCACCGCGC CGAAGAACGT CCGGGCCCAG GCCAAGGCCT GGCTCAAACG ACTGGAAAAA
GAGCAAAAGT TGGGCTGA
 
Protein sequence
MSNKMWGGRF TDRPDAIMEE INVSIDVDRH LYAQDIAASK AHAAMLAAQG IITANDAKNI 
GKGLDTILSE IGAGKFTFKR ALEDIHMNVE SRLAELIGPA AGRLHTARSR NDQVATDFRL
YVRDVLDETD AALASFQRAL VERALEHAET VMPGFTHLQT AQPVTFGHHL MAYVEMAARD
RGRFQDARKR LNESPLGAAA LAGTSFPIDR HATAKKLGFD RPMANSLDAV SDRDFVLETL
SAASICAVHL SRFAEEIVIW TSPLVGLVRL SDKFTTGSSI MPQKRNPDAA ELVRAKTGRV
IGALNGLLIV MKGLPLAYQK DMQEDKQGAM EGFAALSLAI RAMTGMVRDI VPEQDRMRAA
AGEGYATATD LADWLVRTLK MPFRDAHHVT GKIVGLAAKA GVALHELPLK EMQAVEPKIS
RDALAVLSVE SSVKSRTSYG GTAPKNVRAQ AKAWLKRLEK EQKLG