Gene RPC_3200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3200 
Symbol 
ID3971998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3541578 
End bp3543572 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content69% 
IMG OID637926310 
Productcarbamoyl-phosphate synthase L chain, ATP-binding 
Protein accessionYP_533061 
Protein GI90424691 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0237038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.361734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTCG CTCCCGCTTT GCAGTTGACG CCGTTCTCCA GCATTCTGAT CGCCAACCGC 
GGTGAGATCG CGCTGCGCAT CATGCGCACC GCGCGAAAAC TCGGCTATGC GGTGGTCGCG
GTGTATTCCG ACGCCGACGC CGACGCGTTG CACGTCCGTG CCGCCGATGC GGCGGTCCGT
ATCGGCGGCG CGTTGCCGCA GCAATCCTAT CTGAACATCG AAGCGATCAT CGCCGCGGCG
AAGCAGAGCG GCGCCGATGC GGTGCATCCC GGCTACGGCT TCCTCGCCGA AAACCAGGAC
TTCGCCGCGG CCTGCCGCGA CGCCGGCCTG CTGTTCATCG GGCCCTCGGC GGCGTCGATC
GCGGCGATGG CCAACAAGGC GGGGGCCAAG GCGATCATGC GCAAGGCCGG CGTGCCGTGC
GTGCCGGGCT ATCAGGGCGC CGATCAGAGC GACGAAGCGA TGGCGCGTGA GGCGGCTTCG
ATCGGCTTTC CGGTGATGAT CAAGGCGGTC GCCGGCGGCG GCGGTCGCGG CATGCGGCTG
GTCGAAGATG CCGCGGCGTT TGCCGAAGCG TTGCGCGGCG CGCGGTCGGA GGCGCAGGGC
GGGTTCGGCG ATCCCACGGT GATCCTGGAG CGCGCGATCG TGAATCCGCG GCACATCGAG
ATTCAGGTGT TCGGCGATCG CTACGGCCAC GCCGTCCATC TCGGCGAGCG CGACTGCTCG
GTGCAGCGCC GGCACCAGAA GCTGATCGAG GAGGCGCCGT CTCCCGCGGT GTCGCAGGCG
TTGCGCGAGG AGATGGGCAC GGTCGCCGTC AACGCGGCGA AGGCGATCGG CTACGAGGGC
GCCGGCACGC TGGAATTCCT GCTCGATGGC GATGGGCGGT TCTATTTTAT GGAGATGAAT
ACGCGGCTGC AGGTCGAGCA TCCGGTCACC GAGGCGATCA CCGGGCTCGA TCTGGTGGAG
CTGCAATTGC GGATCGCCGC CGGCGAGCCG CTGCCGTTGC AGCAGGACGA TGTCCAGTTC
CGCGGCCACG CCATCGAGGT GCGGCTGTGT TCGGAAGACC CCGCGCAGGA TTTCATGCCG
CAATCCGGCA GGCTCGCGCT GTGGCAGATG CCGGGCGAAT TGCGGGTCGA GCACGCGCTG
CAGTCGGGCT CGGCGATTTC GCCATATTAC GATTCGATGA TCGCCAAGAT CGTCGCCCAC
GGCGACAGCC GCGACGAAGC GCGGCGAAAA CTGCTGCACG GCCTCGCGCG GACCGTCGCG
TTCGGCGTCT CCACGAATTG CAGCTTCCTC GCCGCCTGTC TGCGGCATCC TTCGTTTGCG
GCCGGCGCGG CGACCACCGG GTTCATCGCC CAGCATCGCG ATGCGTTGTT GGCGCCCGAG
CCGGACCACG ATGTGCCGGC CGCCGCTGTG GCGGCGCTGT TGCTGTACGT CACCGATCGC
CACGCGCCGC GCTGGCGCAA GGGCCGCTCG TTGGCGGCGA CGTTCCCGTG CCCGCTGCGG
ATCGAGATCG ACGGCATCAC CCTCGAGCCC GAGGTGACGC GCGAGCGCGA CGGCTGCACC
ATCGTGCGCA TCGGCGGCAA CGCCGTGCGC TTCGCGCTCG ACGAGCTGAG CGGCGACACC
TTGCGCTTCC GTGCCGATGG GATGTCGGAA TCCATCGTCT ATCACCGCGA CGGCGACGCG
CTGTTTGTGC TGCGGCAGGG CGTCACGTCT GCGGTCCGCG ACCTGACCCG CGCCGCGCCG
GCGCGCGCCG CCGCGGCCGG CGGCGACGGG CGGCTGCGCG CGGCGATGAA CGGCCGCGTC
GTCGCGGTGC TGGTCAAGCC CGGCGACCAG GTCGAGGCCG GGCAGCCGGT GCTGACGCTG
GAAGCGATGA AGATGGAGCA CGTGCACGCC GCTCCGATTT CGGGCATCGT TTCGGCGATC
GATGTCGAGG AAGGCGAGCA GGTCACAACC GGCCGCATCG TCGCCGAGAT CGAGGCGATG
TCGCAACAAG CGTAA
 
Protein sequence
MSVAPALQLT PFSSILIANR GEIALRIMRT ARKLGYAVVA VYSDADADAL HVRAADAAVR 
IGGALPQQSY LNIEAIIAAA KQSGADAVHP GYGFLAENQD FAAACRDAGL LFIGPSAASI
AAMANKAGAK AIMRKAGVPC VPGYQGADQS DEAMAREAAS IGFPVMIKAV AGGGGRGMRL
VEDAAAFAEA LRGARSEAQG GFGDPTVILE RAIVNPRHIE IQVFGDRYGH AVHLGERDCS
VQRRHQKLIE EAPSPAVSQA LREEMGTVAV NAAKAIGYEG AGTLEFLLDG DGRFYFMEMN
TRLQVEHPVT EAITGLDLVE LQLRIAAGEP LPLQQDDVQF RGHAIEVRLC SEDPAQDFMP
QSGRLALWQM PGELRVEHAL QSGSAISPYY DSMIAKIVAH GDSRDEARRK LLHGLARTVA
FGVSTNCSFL AACLRHPSFA AGAATTGFIA QHRDALLAPE PDHDVPAAAV AALLLYVTDR
HAPRWRKGRS LAATFPCPLR IEIDGITLEP EVTRERDGCT IVRIGGNAVR FALDELSGDT
LRFRADGMSE SIVYHRDGDA LFVLRQGVTS AVRDLTRAAP ARAAAAGGDG RLRAAMNGRV
VAVLVKPGDQ VEAGQPVLTL EAMKMEHVHA APISGIVSAI DVEEGEQVTT GRIVAEIEAM
SQQA