Gene RPD_1457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1457 
SymbolcarB 
ID4021935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1626491 
End bp1629820 
Gene Length3330 bp 
Protein Length1109 aa 
Translation table11 
GC content64% 
IMG OID637961650 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_568595 
Protein GI91975936 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAAA GAACCGACAT ATCCACGATT CTCATCATCG GGGCTGGTCC GATTGTGATC 
GGCCAGGCTT GTGAATTCGA CTATTCCGGC ACCCAGGCGG TCAAAGCGCT GAAGCAAGAG
GGCTACCGGA TCGTCCTGGT CAATTCCAAC CCGGCCACGA TCATGACCGA TCCGGAATTG
GCCGATGCGA CCTATATCGA GCCGATCACG CCCGAGATCG TCGCCAAGAT CATCGAGAAG
GAGCGCTACG CCATCCCCGG AGGCTTTGCG CTGCTGCCGA CGATGGGCGG CCAGACTGCG
CTGAATTGTG CACTCAGCCT GCGCAAGCTC GGCACGCTGG AGACATTCGA CGTCGAAATG
ATCGGCGCCA CCGCCGACGC CATCGACAAG GCGGAGGACC GCGAGCGGTT CCGCGAGGCC
ATGACCAAGA TCGGCCTCGA AACGCCCAGC TCCCGCCAGG TCAAGAACCT GCCCGACGCG
CTGCGCGCGC TCGACGAGAT CGGTTTCCCG GCGCTGATTC GGCCGTCCTT CACGATGGGC
GGCACCGGCG GCGGCATCGC CTACACCAAG GCGGAATTCA TCGAGATCGT GGAGAGCGGC
ATCGACGCCT CGCCCACCAG CGAGGTTCTG ATCGAGGAAT CCATCCTCGG TTGGAAAGAA
TACGAGATGG AGGTTGTCCG CGACAAAAAG GACAACTGCA TCATCGTCTG TTCGATCGAA
AACCTCGATC CGATGGGCGT CCACACCGGC GACTCGATCA CGGTCGCGCC GGCGCTGACG
CTGACCGACA AGGAATACCA GATCATGCGC GACGCCTCGC TGGCGGTGCT GCGCGAGATC
GGCGTCGAAA CCGGCGGCTC GAACGTGCAG TTCGCGGTCA ACCCGGATGA CGGCCGCCTG
GTCGTGATCG AAATGAATCC GCGGGTTTCG CGCTCCTCGG CGCTGGCCTC CAAGGCCACC
GGCTTCCCGA TCGCCAAGGT CGCCGCCAAG CTCGCGGTCG GCTACACCCT CGACGAGATC
GCCAACGACA TCACCGGCGG CGCGACGCCG GCGTCGTTCG AGCCGACCAT CGACTATGTC
GTCACCAAGA TCCCGCGCTT TGCGTTCGAG AAGTTCCCCG GCGCCTCGCA CAATCTGACC
ACCTCGATGA AGTCGGTCGG CGAGGTGATG GCGATCGGCC GCACCTTCCA GGAAAGCCTG
CAGAAGGCGT TGCGCGGCCT CGAAAGCGGG CTCACCGGCC TCGACGAGAT CGAGATCGAC
GGGCTCGGCC GCGGCGACGA CAAGAACGCG ATCCGCGCCG CGCTCGGCAC CCCGACGCCG
AGCCGGTTGC TGCAAGTCGC GCAGGCGATG CGGCTCGGCT GGACCGACGA GGAGATCTTC
AACTCCTGCA AGATCGATCC GTGGTTCCTG GCGCAGCTGC GCGGCATCGT CGAAATGGAA
AACAAGGTCC GCAGCAGCGG CCTGCCGGAT CACGCATTCG GGATGCGCAC GCTGAAGGCG
ATGGGGTTCT CCGACGCCCG CCTCGCAGTG CTCGCCAACA CCACCGAAGC CGAGGTCAAG
GCGAAACGCC GCGCGCTCGA CGTGCGGCCG GTGTTCAAGC GGATCGACAC TTGCGCTGCG
GAATTCGCCT CGCCGACCGC CTATATGTAC TCGACTTACG AGACTCCGTT CGCGGGACCG
CCCTCTGACG AGAGCCGGCC CTCGGCGAAG AACAAGGTGA TCATTCTCGG CGGCGGTCCG
AACCGGATCG GCCAGGGCAT CGAGTTCGAC TATTGCTGCT GTCATGCCTG CTTCGCACTG
CACGACGCCG GCTATGAATC GATCATGGTC AACTGCAACC CAGAAACCGT GTCGACCGAT
TACGACACCG CGGATCGGCT GTATTTCGAA CCGCTGACCG CCGAGGACGT GCTCGAGATC
ATCGATACCG AACGCAGCAA CGGCACGCTG CACGGCGTGA TCGTGCAGTT CGGCGGCCAG
ACGCCGCTGA AGCTCGCGCG CGCGCTGGAA GCGGCCGATG TGCCGATCCT GGGCACCTCG
CCCGACGCGA TCGACCTCGC CGAGGACCGC GACCGCTTCA AGCGGATTCT CGACAAGCTG
CGGCTGAAGC AGCCGAAGAA CGGCATCGCC TATTCGGTCG AGCAGGCGCG CCTGGTCGCC
GCCGAACTCG GCCTGCCGCT GGTGGTGCGC CCGTCTTACG TGTTGGGCGG CCGCGCGATG
CAGATCATCC GCGAGGACAA TCAGCTCAGC GACTATCTGC TCGGCACCCT GCCGGAGCTG
GTGCCGGGCG ACGTCAAGGC GCGCTATCCG AACGACAAGA CCGGCCAGAT CAACACGGTG
CTCGGCACCA ATCCGCTGCT GTTCGACCGC TATCTGTCGG ACGCGACCGA AATCGATGTC
GATTGCCTGT CCGACGGCAA GGATACTTTC ATCGTCGGAA TCATGGAGCA TATCGAGGAA
GCCGGCATTC ACTCCGGCGA CTCGGCCTGC TCGCTGCCGC CGCATTCGCT CGATGCGGCG
ATGATCGCCG AACTGGAACG CCAGACCCGC GAGCTTGCAC TCGGCCTTGA TGTGGTCGGC
CTGATGAACG TGCAATACGC CATCAAGGAC GGCGAGATCT ACGTGCTCGA GGTCAATCCG
CGCGCGTCGC GCACCGTGCC GTTCGTCGCC AAGGTGATCG GTATGCCGGT GGCGAAGCTC
GCCGCGCGGA TCATGGCCGG CGAGAAGATC GCCGACCTTG GCCTGAAGCG CCGCAAGCTC
GATCATGTCG GCGTCAAGGA ATCGGTGTTT CCGTTCGCGC GCTTCCCGGG CGTCGATACC
GTGCTCGGCC CGGAGATGCG CTCGACCGGC GAAGTCATGG GGCTCGACCG CTCGTTCGAG
ATCGCCTTCG CCAAGAGCCA GCTCGGCGGC GGCACGCGGG TGCCGCGCAA GGGAACGGTG
TTCGTTTCAG TTCGCGAAAG CGACAAGACC CGCATCGTCG ATGCAGTGAA GCTGCTACAC
GAGGCGGGCT TCAAGGTGAT CGCGACCTCG GGCACCCAAC GCTATCTCAG CGACCACGGC
GTCCCGGCCG AGAAGGTCAA CAAGGTTCTG GAAGGTCGTC CGCACATCGT CGACGCCATC
ATGAACGGCG AAGTGCAACT GGTCTTCAAT ACGACCGAAG GACCTCAGGC CCTGGCGGAC
AGCCGCTCGT TGCGACGTGC TGCCCTCTTG CATAAAGTAC CATATTACAC CACTCTTTCG
GGCGCTGTTG CCGCCGCGAA GGGAATCCGG GCCTATCTTG GCGGGGACCT TGAGGTTCGG
ACCTTGCAGA GCTACTTTTC CGAAACCTGA
 
Protein sequence
MPKRTDISTI LIIGAGPIVI GQACEFDYSG TQAVKALKQE GYRIVLVNSN PATIMTDPEL 
ADATYIEPIT PEIVAKIIEK ERYAIPGGFA LLPTMGGQTA LNCALSLRKL GTLETFDVEM
IGATADAIDK AEDRERFREA MTKIGLETPS SRQVKNLPDA LRALDEIGFP ALIRPSFTMG
GTGGGIAYTK AEFIEIVESG IDASPTSEVL IEESILGWKE YEMEVVRDKK DNCIIVCSIE
NLDPMGVHTG DSITVAPALT LTDKEYQIMR DASLAVLREI GVETGGSNVQ FAVNPDDGRL
VVIEMNPRVS RSSALASKAT GFPIAKVAAK LAVGYTLDEI ANDITGGATP ASFEPTIDYV
VTKIPRFAFE KFPGASHNLT TSMKSVGEVM AIGRTFQESL QKALRGLESG LTGLDEIEID
GLGRGDDKNA IRAALGTPTP SRLLQVAQAM RLGWTDEEIF NSCKIDPWFL AQLRGIVEME
NKVRSSGLPD HAFGMRTLKA MGFSDARLAV LANTTEAEVK AKRRALDVRP VFKRIDTCAA
EFASPTAYMY STYETPFAGP PSDESRPSAK NKVIILGGGP NRIGQGIEFD YCCCHACFAL
HDAGYESIMV NCNPETVSTD YDTADRLYFE PLTAEDVLEI IDTERSNGTL HGVIVQFGGQ
TPLKLARALE AADVPILGTS PDAIDLAEDR DRFKRILDKL RLKQPKNGIA YSVEQARLVA
AELGLPLVVR PSYVLGGRAM QIIREDNQLS DYLLGTLPEL VPGDVKARYP NDKTGQINTV
LGTNPLLFDR YLSDATEIDV DCLSDGKDTF IVGIMEHIEE AGIHSGDSAC SLPPHSLDAA
MIAELERQTR ELALGLDVVG LMNVQYAIKD GEIYVLEVNP RASRTVPFVA KVIGMPVAKL
AARIMAGEKI ADLGLKRRKL DHVGVKESVF PFARFPGVDT VLGPEMRSTG EVMGLDRSFE
IAFAKSQLGG GTRVPRKGTV FVSVRESDKT RIVDAVKLLH EAGFKVIATS GTQRYLSDHG
VPAEKVNKVL EGRPHIVDAI MNGEVQLVFN TTEGPQALAD SRSLRRAALL HKVPYYTTLS
GAVAAAKGIR AYLGGDLEVR TLQSYFSET