Gene RPB_1516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1516 
SymbolcarB 
ID3908429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1707864 
End bp1711193 
Gene Length3330 bp 
Protein Length1109 aa 
Translation table11 
GC content65% 
IMG OID637883411 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_485137 
Protein GI86748641 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAAC GAACAGACAT CTCCACCATC CTCATCATCG GCGCCGGTCC CATCGTGATC 
GGCCAAGCCT GTGAATTCGA CTATTCCGGC ACCCAGGCGG TCAAGGCGCT GAAGCAGGAA
GGCTACCGGG TCGTCCTGGT CAATTCCAAT CCGGCCACGA TCATGACCGA TCCGGAACTG
GCGGACGCGA CCTATATCGA GCCGATCACA CCCGAGATCG TCGCCAAGAT CGTCGAGAAG
GAACGCTACG TCATCCCCGG CGGCTTCGCG TTGCTGCCGA CCATGGGCGG CCAGACCGCG
CTGAATTGCG CCCTGTCGCT TCGCAAGCTC GGCACACTGG AGAAATTCGA CGTCGAGATG
ATCGGCGCCA CCGCCGACGC CATCGACAAG GCCGAGGACC GCGAGCGCTT CCGCGAGGCG
ATGACCAAGA TCGGCCTCGA GACGCCGCGC TCGCGCCAGG TCAAAAATCT TCCCGACGCG
CTGCGCGCGC TCGACGAGAT CGGCTTCCCG GCGCTGATCC GGCCGTCCTT CACGATGGGC
GGCACCGGCG GCGGCATCGC CTACACCAAG GCGGAATTCA TCGAGATCGT CGAGAGCGGC
ATCGATGCGT CGCCGACCAG CGAAGTGCTG ATCGAGGAAT CCATCCTCGG CTGGAAAGAA
TACGAGATGG AGGTTGTCCG CGACAAGAAG GACAACTGCA TCATCGTCTG CTCGATCGAA
AATCTCGATC CGATGGGCGT GCACACCGGC GATTCGATCA CGGTCGCGCC GGCGCTGACG
CTCACCGACA AGGAATACCA GATCATGCGC GACGCCTCGC TGGCGGTGCT GCGCGAGATC
GGCGTCGAGA CCGGCGGCTC GAACGTGCAG TTCGCGGTCA ATCCGGACGA CGGCCGGCTG
GTCGTGATCG AAATGAATCC GCGGGTGTCG CGCTCCTCGG CGCTGGCCTC GAAGGCGACC
GGCTTCCCGA TCGCCAAGGT CGCGGCCAAG CTCGCGGTCG GCTACACGCT CGACGAAATC
GCCAACGACA TCACCGGCGG CGCCACGCCG GCGGCATTCG AGCCGACCAT CGACTACGTG
GTCACCAAGA TTCCGCGCTT CGCCTTCGAG AAATTCCCCG GCGCCTCGCA CAATCTGACC
ACCTCGATGA AGTCGGTCGG CGAGGTGATG GCGATCGGCC GCACCTTCCA GGAAAGCCTG
CAGAAGGCGC TGCGCGGCCT CGAAAGCGGC CTCACCGGAC TCGACGAAAT CGAGATCGAC
GGCCTCGGCC GCGGCGACGA CAAGAACGCC ATTCGCGCGG CCCTCGGCAC CCCGACGCCG
AGCCGGCTGC TGCAGGTCGC CCAGGCGATG CGGCTCGGCT GGACCGACGA CGAGATCTTC
AACTCCTGCA AGATCGATCC GTGGTTCCTG GCGCAATTGC GCGGCATCGT CGAGATGGAG
AACAAGGTCC GCGCCAGCGG CCTGCCGGAC CACGCGTTCG GGATGCGGAC ACTGAAGGCG
ATGGGCTTTT CCGACGCCCG CCTGTCGGTG CTGGCCGGCA CTACCGAAGC CGAGGTGAAA
GCCAAGCGCC GCGCGCTCGG CGTGCGGCCC GTGTTCAAGC GGATCGATAC CTGTGCGGCG
GAGTTCGCCT CGCCGACCGC CTACATGTAT TCGACCTACG AATCGCCGTT CGCCGGTCCG
GCGTCGGACG AAAGCCACCC CTCGGCCAAG GACAAGGTGA TCATTCTCGG CGGCGGCCCG
AACCGGATCG GCCAGGGTAT CGAGTTCGAT TATTGCTGCT GTCACGCCTG CTTCGCGCTG
CACGACGCCG GCTATGAATC AATCATGGTC AACTGCAACC CGGAAACCGT GTCGACCGAC
TACGACACCG CGGACCGGCT GTATTTCGAG CCGCTGACTG CCGAGGACGT GCTGGAGATC
ATCGACACCG AACGCAGCAA CGGTACGCTG AAAGGCGTCA TCGTGCAGTT CGGCGGCCAG
ACCCCGCTCA AGCTGGCGCG GGCGCTGGAA GCTGCCGACG TGCCGATCCT CGGCACCTCG
CCGGACGCGA TCGACCTCGC CGAGGACCGC GACCGCTTCA AGCGCATCCT CGACAAACTT
CGTCTCAAGC AGCCGAAGAA CGGCATCGCC TATTCGGTCG AGCAGGCGCG CCTCGTCGCC
GCCGAACTCC GCCTGCCGCT GGTGGTGCGC CCGTCCTACG TGCTGGGCGG CCGCGCGATG
CAGATCATCC GCGAGGACAA TCAGCTCAGC GACTATCTGC TCGGCACCTT GCCGGAGCTG
GTGCCGGCCG ACGTCAAGGC GCGTTATCCG AACGACAAGA CCGGCCAGAT CAACACCGTG
CTCGGTACCA ACCCGCTGCT GTTCGACCGC TATCTGTCGG ACGCCATCGA GATCGATGTC
GATTGCCTGT GCGACGGCAA GGATACTTTC ATCGTCGGAA TCATGGAGCA CATCGAGGAA
GCCGGCATCC ACTCCGGCGA CTCGGCCTGC TCGCTGCCGC CGCATTCGCT CGATGCGCCG
ATGATCGCCG AGCTCGAGCG ACAGACCCGC GAGCTCGCGC TCGGGCTCGA CGTGGTCGGG
CTGATGAACG TGCAGTTCGC CATCAAGGAC GGCGAGATCT ACGTGCTCGA AGTCAATCCG
CGCGCCTCGC GCACGGTGCC GTTCGTCGCC AAGGTGATCG GCATGCCGGT GGCCAAGCTC
GCCGCCCGGA TCATGGCCGG CGAGAAGATC GCCGACCTCG GTCTGAAGAA GCGCAAGCTC
GACCATGTCG GCGTCAAGGA GTCGGTATTT CCGTTCGCGC GCTTCCCCGG CGTCGACACC
GTGCTCGGCC CGGAGATGCG CTCGACCGGC GAGGTTATGG GGCTCGATCG CTCGTTCGAG
ATCGCCTTCG CCAAGAGCCA GCTCGGCGGC GGCACGCGGG TGCCGCGCAA GGGCACGGTT
TTCGTGTCGG TCCGCGAAAG CGACAAGACC CGGATCGCCG TCGCGGTGAA GCTGCTGCAC
GAGGTCGGTT TCAAGGTGAT CGCCACCTCG GGCACCCAGC GCTATCTCAG CGACCACGGC
ATCCCGGCGG AGAAGATCAA CAAGGTGCTG GAAGGCCGAC CGCACATCGT CGACGCCATC
ATGAATGGCG AGGTGCAACT GGTGTTCAAC ACCACCGAGG GGCCGCAGGC GCTGGCCGAC
AGCCGGTCGT TGCGACGCGC TGCCCTCTTG CATAAGGTTC CGTATTACAC CACTCTTTCC
GGGGCCGTAG CCGCCGCCAA AGGAATCCGG GCCTATCTTG GCGGAGACCT TGAGGTTCGG
ACCCTGCAGA GCTACTTTTC CGAATCCTGA
 
Protein sequence
MPKRTDISTI LIIGAGPIVI GQACEFDYSG TQAVKALKQE GYRVVLVNSN PATIMTDPEL 
ADATYIEPIT PEIVAKIVEK ERYVIPGGFA LLPTMGGQTA LNCALSLRKL GTLEKFDVEM
IGATADAIDK AEDRERFREA MTKIGLETPR SRQVKNLPDA LRALDEIGFP ALIRPSFTMG
GTGGGIAYTK AEFIEIVESG IDASPTSEVL IEESILGWKE YEMEVVRDKK DNCIIVCSIE
NLDPMGVHTG DSITVAPALT LTDKEYQIMR DASLAVLREI GVETGGSNVQ FAVNPDDGRL
VVIEMNPRVS RSSALASKAT GFPIAKVAAK LAVGYTLDEI ANDITGGATP AAFEPTIDYV
VTKIPRFAFE KFPGASHNLT TSMKSVGEVM AIGRTFQESL QKALRGLESG LTGLDEIEID
GLGRGDDKNA IRAALGTPTP SRLLQVAQAM RLGWTDDEIF NSCKIDPWFL AQLRGIVEME
NKVRASGLPD HAFGMRTLKA MGFSDARLSV LAGTTEAEVK AKRRALGVRP VFKRIDTCAA
EFASPTAYMY STYESPFAGP ASDESHPSAK DKVIILGGGP NRIGQGIEFD YCCCHACFAL
HDAGYESIMV NCNPETVSTD YDTADRLYFE PLTAEDVLEI IDTERSNGTL KGVIVQFGGQ
TPLKLARALE AADVPILGTS PDAIDLAEDR DRFKRILDKL RLKQPKNGIA YSVEQARLVA
AELRLPLVVR PSYVLGGRAM QIIREDNQLS DYLLGTLPEL VPADVKARYP NDKTGQINTV
LGTNPLLFDR YLSDAIEIDV DCLCDGKDTF IVGIMEHIEE AGIHSGDSAC SLPPHSLDAP
MIAELERQTR ELALGLDVVG LMNVQFAIKD GEIYVLEVNP RASRTVPFVA KVIGMPVAKL
AARIMAGEKI ADLGLKKRKL DHVGVKESVF PFARFPGVDT VLGPEMRSTG EVMGLDRSFE
IAFAKSQLGG GTRVPRKGTV FVSVRESDKT RIAVAVKLLH EVGFKVIATS GTQRYLSDHG
IPAEKINKVL EGRPHIVDAI MNGEVQLVFN TTEGPQALAD SRSLRRAALL HKVPYYTTLS
GAVAAAKGIR AYLGGDLEVR TLQSYFSES