Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3835 |
Symbol | carB |
ID | 3969448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 4264510 |
End bp | 4267974 |
Gene Length | 3465 bp |
Protein Length | 1154 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637926946 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_533688 |
Protein GI | 90425318 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.863015 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.842487 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAAC GTACTGATAT ATCGACGATT CTCATCATCG GCGCCGGCCC GATCGTGATC GGCCAGGCCT GCGAATTCGA CTATTCCGGC ACCCAGGCGG TGAAGACGCT GAAGGACGAG GGCTACCGCA TCGTCCTGGT CAATTCCAAT CCGGCCACCA TCATGACCGA CCCGGAATTG GCGGACGCGA CCTACATCGA GCCGATCACC CCGGAGATCG TCGCCAAGAT CATCGAGAAG GAACGCCACG TCATTCCCGG CGGCTTCGCG CTGTTGCCGA CGATGGGCGG GCAGACCGCG CTGAACTGCG CGCTGAGCCT GCGCAAGCAG GGCACGCTGG AGAAATTCGA CGTCGAGATG ATCGGCGCCA CCGCCGACGC CATCGACAAG GCCGAGGACC GCCAGCTGTT CCGCGAGGCG ATGACCAAGA TCGGGCTGGA GACGCCGAAG TCGCGGCTCG CCAACGCCAC CGCTTTGAAG AAGGCCTATC GCGAGAAATA TCTCGAAGAC CGCGCCAAAC TGTCCGGCGA GGAGCTGGTC GCCTTCGAAC GGCAATGGGT GCTCGGCGAA AGCGATCGCC GCAAGCGCTA CCAGGAACAG GCGCTCGGTT CGGCCCTGAT GGCGTTGTCC GAGATCGGCC TGCCGGCGAT CATCCGGCCG TCGTTCACCA TGGGCGGCAC CGGCGGCGGC ATCGCCTACA ACAAGGAAGA ATTCCTCGAC ATCATCGAGC GCGGGCTCGA CGCCTCGCCG ACCAACGAAG TCTTGATCGA AGAATCCGTG CTCGGCTGGA AAGAGTACGA GATGGAGGTG GTGCGCGACA AGAACGACAA CTGCATCATC GTCTGCTCGA TCGAGAACCT CGACCCGATG GGCGTGCACA CCGGCGATTC GATCACGGTG GCCCCCGCGC TGACGCTGAC GGATAAAGAG TATCAGATCA TGCGCGACGC CTCATTGGCG GTGCTGCGCG AGATCGGCGT CGAGACCGGC GGCTCCAACG TGCAGTTCGG CGTCAATCCG GCCGACGGCC GCATGGTCGT GATCGAAATG AATCCGCGGG TGTCGCGCTC CTCGGCGCTG GCCTCGAAGG CCACCGGCTT TCCGATCGCC AAGGTCGCCG CCAAGCTGGC GGTCGGCTAC ACGCTCGATG AGATCGCCAA CGACATCACC GGCGGCGCCA CGCCGGCGTC GTTCGAGCCG ACCATCGACT ACGTGGTCAC CAAGATTCCG CGCTTTGCGT TCGAGAAATT CCCCGGCGCC AGCCACACCT TGACCACCTC GATGAAGTCG GTCGGCGAAG TGATGGCGAT CGGTCGCACC TTCCAGGAAA GCTTGCAGAA GGCGCTGCGC GGGCTGGAAA CCGGCCTCAC CGGGCTCGAC GAGATCGAGA TCGAAGGATT GGGCCGCAGC GACGACAAGA ACGCGATCCG CGCCGCGCTC GGCACGCCGA CCCCCGACCG GCTGCTGCAG GTCGCGCAGG CGATGCGGCT CGGCTGGACC GACGAGGAGA TCTTCACTTC CTGCAAGATC GATCCCTGGT TCCTGTCGGA ATTGCGCGGC ATCGTCGAGA TGGAAGCCAA GGTCAAGGCC AGCGGCCTGC CCGGCAACGC CTTCGGCATG CGCACGCTGA AGGCGATGGG CTTCTCCGAC GCAAGGCTCG CGGTGCTGGC CAAGACCACC GAAGCCGACG TCAAAGCCCA GCGCCACGCG CTCGGCGTCC GTCCGGTGTT CAAGCGGATC GACACCTGCG CGGCGGAATT CGCCTCGCCC ACCGCCTATA TGTATTCGAC CTACGAGTCG CCGTTCGCCG GCCCCCCCGC CGACGAAAGC GCGCCGTCCG ACAAGAAGAA GGTGATCATC CTCGGCGGCG GTCCGAACCG CATCGGCCAA GGCATCGAGT TCGATTACTG CTGCTGCCAC GCCTGCTTCG CGCTGCACGA CGCCGGCTAT GAATCGATCA TGATCAACTG CAACCCGGAA ACCGTGTCGA CCGACTACGA CACCGCGGAC CGGCTGTATT TCGAGCCGTT GACCGCCGAG GACGTGCTGG AGATCATCGA CACCGAGAAG CGCAACGGCA CGCTGCACGG CGTCATCGTG CAGTTCGGCG GCCAGACCCC GCTGAAGCTG GCGCGGGCGT TGGAAGCCGC CGACGTGCCG ATCCTCGGCA CTTCGCCGGA CGCCATCGAT CTGGCCGAAG ACCGCGACCG CTTCAAGCGC GTGCTCGACA AGTTGAAGCT GAAGCAGCCG AAGAACGGCA TCGCCTATTC GGTCGAGCAG GCGCGCCTGG TCGCCGCCGA ACTCGGCCTG CCGCTGGTGG TGCGCCCGTC CTATGTGCTC GGCGGCCGCG CCATGCAGAT CATCCGCGAG GAATCCCAGC TCGGCGATTA CCTGCTCGGC ACCCTGCCCG AACTGGTGCC CGCCGACGTC AAAGCCCGCT ATCCGAACGA TAAGACCGGG CAGATCAACA CCGTGCTCGG CACCAATCCC TTGTTGTTCG ACCGCTATCT GTCCGACGCC ATCGAGATCG ACGTCGACTG CCTGTGCGAC GGCAAGGATA CTTTCATCGT CGGCATCATG GAGCACATCG AGGAGGCCGG CATTCACTCC GGCGACTCGG CCTGCTCGCT GCCGCCGCAT TCGCTCGACG CCCCGATGAT CGCCGAACTG GAGCGGCAGA CCCGCGACAT GGCGCTCGGC CTCGACGTCG TCGGGCTGAT GAACGTGCAA TTCGCCATCA AGGACGGCGA CATCTACGTG CTCGAGGTCA ACCCGCGGGC GTCGCGCACC GTGCCGTTCG TCGCCAAAGT GGTGGGCATT CCGGTCGCCA AGATCGCCGC GCGGCTGATG GCCGGCGAAA AGCTGGTCGA TTTCAAGCTC GCCAAGCGCA AGCTCGACCA TGTCGGCGTC AAGGAATCGG TGTTCCCGTT CGCCCGCTTC CCCGGCGTCG ACACCGTGCT CGGCCCGGAG ATGCGCTCCA CCGGCGAGGT CATGGGGATC GACCGCTCGT TCGAAATCGC CTTCGCCAAG AGCCAGCTCG GCGGCGGCAC CCGCGTGCCG CGCAAGGGCA CCGTGTTCGT CTCGGTGCGC GAGGGCGACA AGACCCGGAT TCTCGACGCC GTGAAGCTGT TGTGCTCGCT CGGCTTCAAG GTGCTGGCGA CCTCCGGCAC CCAGCGCTTC CTCGCCGACC ACGGCGTCCC CGCGGAAAAG GTCAACAAGG TGTTGGAGGG CCGGCCGCAT ATCGTCGACG CCATCACCAA TGGCGAGATC CAGCTGGTGT TCAACACCAC CGAAGGCCCG CAGGCGTTGG CCGACAGCCG GTCGCTGCGT CGCGCTGCCC TCTTGCATAA AGTTCCGTAT TACACCACTC TTTCGGGCGC GGTCGCCGCC GCGCAGGGCA TCCGGGCCTA CCTTGGTGGG GACCTTGAGG TCCGTACCCT GCAGAGTTAC TTTTCCGACA CCTGA
|
Protein sequence | MPKRTDISTI LIIGAGPIVI GQACEFDYSG TQAVKTLKDE GYRIVLVNSN PATIMTDPEL ADATYIEPIT PEIVAKIIEK ERHVIPGGFA LLPTMGGQTA LNCALSLRKQ GTLEKFDVEM IGATADAIDK AEDRQLFREA MTKIGLETPK SRLANATALK KAYREKYLED RAKLSGEELV AFERQWVLGE SDRRKRYQEQ ALGSALMALS EIGLPAIIRP SFTMGGTGGG IAYNKEEFLD IIERGLDASP TNEVLIEESV LGWKEYEMEV VRDKNDNCII VCSIENLDPM GVHTGDSITV APALTLTDKE YQIMRDASLA VLREIGVETG GSNVQFGVNP ADGRMVVIEM NPRVSRSSAL ASKATGFPIA KVAAKLAVGY TLDEIANDIT GGATPASFEP TIDYVVTKIP RFAFEKFPGA SHTLTTSMKS VGEVMAIGRT FQESLQKALR GLETGLTGLD EIEIEGLGRS DDKNAIRAAL GTPTPDRLLQ VAQAMRLGWT DEEIFTSCKI DPWFLSELRG IVEMEAKVKA SGLPGNAFGM RTLKAMGFSD ARLAVLAKTT EADVKAQRHA LGVRPVFKRI DTCAAEFASP TAYMYSTYES PFAGPPADES APSDKKKVII LGGGPNRIGQ GIEFDYCCCH ACFALHDAGY ESIMINCNPE TVSTDYDTAD RLYFEPLTAE DVLEIIDTEK RNGTLHGVIV QFGGQTPLKL ARALEAADVP ILGTSPDAID LAEDRDRFKR VLDKLKLKQP KNGIAYSVEQ ARLVAAELGL PLVVRPSYVL GGRAMQIIRE ESQLGDYLLG TLPELVPADV KARYPNDKTG QINTVLGTNP LLFDRYLSDA IEIDVDCLCD GKDTFIVGIM EHIEEAGIHS GDSACSLPPH SLDAPMIAEL ERQTRDMALG LDVVGLMNVQ FAIKDGDIYV LEVNPRASRT VPFVAKVVGI PVAKIAARLM AGEKLVDFKL AKRKLDHVGV KESVFPFARF PGVDTVLGPE MRSTGEVMGI DRSFEIAFAK SQLGGGTRVP RKGTVFVSVR EGDKTRILDA VKLLCSLGFK VLATSGTQRF LADHGVPAEK VNKVLEGRPH IVDAITNGEI QLVFNTTEGP QALADSRSLR RAALLHKVPY YTTLSGAVAA AQGIRAYLGG DLEVRTLQSY FSDT
|
| |