Gene RPD_0602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0602 
Symbol 
ID4021071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp679926 
End bp683234 
Gene Length3309 bp 
Protein Length1102 aa 
Translation table11 
GC content67% 
IMG OID637960790 
Productcarbamoyl-phosphate synthase L chain, ATP-binding 
Protein accessionYP_567741 
Protein GI91975082 
COG category[C] Energy production and conversion
[I] Lipid transport and metabolism 
COG ID[COG1038] Pyruvate carboxylase
[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit
[COG4799] Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTTC GCAAATTGCT GATCGCCAAT CGGGGCGAGA TCGCGATCCG CATCGCGCGC 
GCCGCCGCCG ACGCCGGGCT CGCCACCGTG GCGATTTATC CCGCCGACGA TTCGGCGTCG
CTGCATCTGC GGATCGCCGA CGAGGCGCGG GAGATTCCCG GGCGCGGCGC GCGGGCCTAT
CTCGACATCG AAGGCGTGAT CGCCGCCGCC AAGGCGGCGC ATTGCGACGC CCTGCATCCG
GGCTACGGCT TTCTCAGCGA GAATGCCGAT CTGGCGCGGC GCTGCGCCGA AGAAGGCATC
ACCTTCGTCG GGCCATCGCC GGAGGCGTTG CGATTGTTCG GCGACAAGGT CGCGGCCAAG
GAGTTGGCGA AGCGCTGCGG CGTCCCGATC ATTGCCGGCA CCACGGGGCC TTCGACACTG
CAGGACGTCA AGGCGTTCTT CGCATCGCTC GGCGACGCCG CCGCGGTGAT GATCAAGGCG
ATGGCCGGCG GCGGCGGTCG CGGTATGCGC ATCGTCGAAC GCGCGGACGA TCTGGAAGAG
GCCTATGCGC GTTGCCAGTC CGAAGCCAAG GCGGCATTCG GCAGCGACGG GGTCTACGCC
GAGCGGCTGA TCCGCAAGGC TCGGCATATC GAAGTGCAGA TCATCGGCGA CCGGCATGGC
GGCATCAGCC AGCTCTGGGA GCGCGAATGC ACGATCCAGC GCCGCAATCA GAAGCTGATC
GAGATCGCGC CGAGCCCGTC GCTCAGCGAC AGCCTGCGCG GGCGGATCGT CGCGGCCGCG
AAGGCGCTCG CGGCGGCGGC GAACTACGAC AGCCTCGGCA CCTTCGAATT TCTGGTCGAT
GGCGAAGCAG GCGAAAGCGA CGGCGCGTTC GCCTTCATCG AAGCCAATCC GCGGCTGCAG
GTCGAGCACA CCGTGACCGA GGAAGTGCTG TCGATCGATC TGGTGCGATC TCAGCTTGCT
GTGGCGGGCG GCGCGACGCT GGCCTCGCTC GGCCTCGATC AGGCTGCCGT GCCGCGCCCG
CGCGGCTTTG CGATGCAGCT CCGCATCAAC ATGGAAACCA TGGACGAAAG CGGCGCTACC
AAACCGACCG GCGGCACGCT GGCGATCTTC GAGCCGCCGT CGGGCCCGGG CGTGCGCGTC
GATACGTTCG GTTATGCCGG CTACAAAACC AGCGCGGCGT TCGACTCTTT GCTCGCCAAG
GTGATCGTTC ACACCCCGTC GCAATGGCCG GATGTCGTCG CCAAGGCCGC GCGCGCCTTG
CGCGAATTCC GGATCGATGG CGTCGGAACT AATATCCCGT TCATCCAGGC GATCCTGGCC
CATGCCGACT TCAAGGCCAA CCGGGTCAGT ACCGGCTTCA TCGACCGCAA CGTCGCCGAG
CTGGTCGGCG CTGCGCAAGG CTTCGCGGGT CCGCTGATCG CCACGCCCGG CGCGACAGCG
CACCAAGGCG CTGCATCGGC GAAGATCGAC TCGGCGCCGG ACGGCGCGGT CGCGATCACC
GCACCGCTGC AGGGGACGGT GGTTGCGATC ACGGTCGCGG AAGGTGATGT GGTTCGGCCG
GGGCAGCAGC TCGCGGTGCT CGAATCGATG AAGATGGAGC ATCTCGTCAT CGCCGAGCAG
GGCGGCCGCA TCCGGCGCAT CGTCACCGCC GACGGCGTGA CGCTGATGCA GGGCGAGGCG
ATCCTCTATC TCGAGCCGCA GGACATCGAA GGCGATCAGC TCGCCGAGGA AGACGAGGTC
GATCTCGACG AGATTCGTCC GGATCTTGCC GAGATGCTGG CGCGACAGGG CAACACCAGC
GACAACAGCC GCCCCGACGC GGTCGAGCGC CGCCGCAAGA CCAATCAGCG CACCGCGCGC
GAAAACATCG CCCAGCTCGT CGATGACGGC TCGTTCATGG AGTATGGCAG CCTGGCGATC
GCGGCGCAGC GCCGCCGCCG GTCGCTCGAC GACCTGATCA AGAACACGCC GGCCGACGGC
CTGATCGCCG GCGTCGCCAC CGTCAATGCC GCGCAGTTCG GTGAACACGG CGCCCGCTGC
ATGGTGATCG CCTACGACTA CACGGTGCTG GCCGGCACCC AGGGCCACAT GAATCACAAG
AAGATCGACC GGATGCTGAC GCTGGTCGAG CAGTGGCGGA TTCCGCTGGT GTTCTACGCC
GAGGGCGGCG GCGGCCGTCC CGGCGACACC GACCGGCTCG GTCTCACCGG TCTCGACGGC
CCGTCCTTCG TGCAGTTCGC CAGGCTTTCG GGCCTGGTGC CGGTGGTCGG CGTCGTCTCC
GGCTATTGCT TCGCCGGCAA TGCCGCGATG CTCGGCTGCT GCGACGTGAT CATCGCCACG
CAGAACGCCT CGATCGGGAT GGGCGGGCCG GCGATGATCG AGGGCGGCGG GCTCGGCGTG
TATCACCCCG CCGAAGTCGG TCCCGTCTCG TTCCAGTCGC CGAACGGCGT GGTCGACATC
CTGGTCGAGG ACGAGGAGGA GGCGACGCGG GTCGCGCAGA AATATCTGTC GTACTTCCAG
GGCGCGGTGT CCGAATGGCG CGCCAGCGAT CAGCGGCTGC TGCGGCGTGC GATCCCGGAG
AACCGGCTGC GGGTCTACGA TATCCGTCAC GTCATCGATC TGATCGCCGA CGAAGGATCG
GTGCTCGAGC TGCGCCGCGA TTTCGGCGTC GGGATGATCA CCGCGTTCAT TCGCGTCGAA
GGCAAGCCGT TCGGCTTGAT CGCCAACAAT CCGAAGCATC TCGGCGGCGC GATCGACGCC
GACGCCGGCG ACAAGGCGGC GCGCTTTCTG CAATTGTGCG ACGCCTTCGA CATTCCGATC
GTGTCGCTGT GCGATACGCC CGGTTTCATG GTCGGGCCGG AGGCGGAGAA GACCGCGATC
GTCCGCCATG TCGCGCGGAT GTTCGTCACC GGCGCCAGCC TGACCGTGCC GCTGTTCGGC
ATCGTGCTGC GCAAGGGCTA CGGCCTCGGC GCGCAATCGA TGCTCGGCGG CGGTTTCCAC
GCCTCGTTCT TCACCGCGGC GTGGCCGACC GGCGAGTTCG GCGGCATGGG TCTGGAAGGC
TATGTCCGCC TCGGCTTCCG CAAGGAGATG GAGGCGATCG CCGATCCGGT CGAGCGCGAG
ACGTACTACA AGAACAAGGT CGCCGAGATG TATGCCAACG GCAAGGCGGT CTCGATCGCG
TCGGTGCTCG AAATCGACAA TGTCATCGAT CCGGCGGAGA CGCGGCGCTG GATCATGGCC
GGCCTGCGCT CGGTGCCGAC GCCGCCTCAA CGCGACGGGC GCAAGCGGCC CTGCATCGAC
GCCTGGTAG
 
Protein sequence
MPFRKLLIAN RGEIAIRIAR AAADAGLATV AIYPADDSAS LHLRIADEAR EIPGRGARAY 
LDIEGVIAAA KAAHCDALHP GYGFLSENAD LARRCAEEGI TFVGPSPEAL RLFGDKVAAK
ELAKRCGVPI IAGTTGPSTL QDVKAFFASL GDAAAVMIKA MAGGGGRGMR IVERADDLEE
AYARCQSEAK AAFGSDGVYA ERLIRKARHI EVQIIGDRHG GISQLWEREC TIQRRNQKLI
EIAPSPSLSD SLRGRIVAAA KALAAAANYD SLGTFEFLVD GEAGESDGAF AFIEANPRLQ
VEHTVTEEVL SIDLVRSQLA VAGGATLASL GLDQAAVPRP RGFAMQLRIN METMDESGAT
KPTGGTLAIF EPPSGPGVRV DTFGYAGYKT SAAFDSLLAK VIVHTPSQWP DVVAKAARAL
REFRIDGVGT NIPFIQAILA HADFKANRVS TGFIDRNVAE LVGAAQGFAG PLIATPGATA
HQGAASAKID SAPDGAVAIT APLQGTVVAI TVAEGDVVRP GQQLAVLESM KMEHLVIAEQ
GGRIRRIVTA DGVTLMQGEA ILYLEPQDIE GDQLAEEDEV DLDEIRPDLA EMLARQGNTS
DNSRPDAVER RRKTNQRTAR ENIAQLVDDG SFMEYGSLAI AAQRRRRSLD DLIKNTPADG
LIAGVATVNA AQFGEHGARC MVIAYDYTVL AGTQGHMNHK KIDRMLTLVE QWRIPLVFYA
EGGGGRPGDT DRLGLTGLDG PSFVQFARLS GLVPVVGVVS GYCFAGNAAM LGCCDVIIAT
QNASIGMGGP AMIEGGGLGV YHPAEVGPVS FQSPNGVVDI LVEDEEEATR VAQKYLSYFQ
GAVSEWRASD QRLLRRAIPE NRLRVYDIRH VIDLIADEGS VLELRRDFGV GMITAFIRVE
GKPFGLIANN PKHLGGAIDA DAGDKAARFL QLCDAFDIPI VSLCDTPGFM VGPEAEKTAI
VRHVARMFVT GASLTVPLFG IVLRKGYGLG AQSMLGGGFH ASFFTAAWPT GEFGGMGLEG
YVRLGFRKEM EAIADPVERE TYYKNKVAEM YANGKAVSIA SVLEIDNVID PAETRRWIMA
GLRSVPTPPQ RDGRKRPCID AW