Gene Rsph17025_0361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0361 
SymbolcarB 
ID5082197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp355681 
End bp359010 
Gene Length3330 bp 
Protein Length1109 aa 
Translation table11 
GC content67% 
IMG OID640481913 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_001166572 
Protein GI146276413 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.175688 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.763709 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAAA GAACCGATAT CAGCTCGATC ATGATCATCG GGGCCGGGCC CATCGTCATC 
GGGCAGGCCT GCGAGTTCGA CTATTCCGGC GCCCAGGCCT GCAAGGCCCT GCGCGAAGAG
GGCTATCGGG TCATCCTGGT GAACTCGAAC CCGGCGACGA TCATGACCGA TCCGGGTCTC
GCCGACGCCA CCTATATCGA GCCGATCACG CCGGAAGTCG TCGCCAAGAT CATCGAGAAG
GAGCGCCCGG ACGCGCTTCT GCCCACGATG GGCGGTCAGA CCGGCCTGAA CACGGCGCTC
GCGCTCGCCG ACATGGGCGT GCTCGAGACG TTCGGCGTGC AGCTGATCGG CGCCAACCGC
GAGGCCATCG AGATGGCCGA GGACCGCAAG CTCTTCCGCG AGGCGATGGA CCGGATCGGG
CTGGAAAACC CCAAGGCCAC CATCATCGCC GCCCCGAAGC TGCCCAACGG GCGCTATGAC
ATCAACGCCG GCGTGGCCGA GGCGATGGCC GCCATCGAAT ATGTGGGCCT GCCGGCGATC
ATCCGCCCCG CCTTCACGCT CGGCGGCACC GGCGGCGGCG TGGCCTACAA CCGCGACGAT
TACGAGGCGA TCTGCCGCTC GGGCCTTGAC GCGTCGCCGG TGGCGCAGAT CCTCGTCGAT
GAAAGCCTGC TCGGCTGGAA GGAATACGAG ATGGAGGTGG TCCGCGACCG TGCGGACAAC
GCCATCATCG TCTGCTCGAT CGAGAACGTG GACCCGATGG GCGTGCATAC GGGCGACTCG
ATCACCGTGG CACCCGCGCT GACGCTGACC GACAAGGAAT ACCAGATCAT GCGCAACGGC
TCGATCGCCG TGCTGCGCGA GATCGGGGTC GAGACGGGCG GGTCGAACGT GCAATGGGCG
ATCAACCCGG TGGACGGCCG CATGGTGGTG ATCGAGATGA ACCCGCGCGT GTCGCGCTCC
TCGGCGCTGG CGTCCAAGGC CACGGGCTTC CCCATCGCCA AGATCGCGGC GAAGCTCGCC
GTGGGCTACA CGCTCGATGA ACTCGACAAC GACATCACCA AGGTCACGCC CGCCTCGTTC
GAGCCCTCGA TCGACTATGT CGTGACCAAG ATCCCGCGCT TCGCCTTCGA GAAGTTTCCC
GGGTCGAAGC CCGAACTCAC CACCGCGATG AAGTCGGTGG GCGAGGTCAT GGCCATCGGC
CGGACCTTCC ACGAATCGAT GCAGAAGGCG CTGGCCTCGC TGGAAACCGG CCTTTCGGGC
TTCGACGAGA TCGAGATCCC CGGCGCGCCG GACAAGGCTG CGATCATCAA GGCGATCTCG
GCCCAGACCC CCGACCGGCT GCGCCTGATC GCGCAGGCGA TGCGCCACGG GCTGTCGGAT
GACGAGATCC AGGCCGCCAC CGCCTTCGAC CCCTGGTTCC TCGCCCGCAT CCGCGAAATC
ATCGACGCCG AGGCGAAGAT CCGCGCGAAC GGGCTGCCGC TGGCCGAGGA GCCGCTGCGC
AAACTGAAGA TGATGGGCTT TACCGACGCC CGGCTGGCAA AGCTGACGGG CCGCGAGGAA
GGCCAGGTGC GCCGCGCCCG CCGCAACCTC GGCGTGAAGG CGGTCTTCAA GCGCATCGAC
ACCTGCGCGG CCGAGTTCGA GGCCCAGACC CCCTACATGT ATTCCACCTA CGAGGCCCCC
GCGATGGGCG ACGTGGAATG CGAGGCGCGC CCCTCGGCGG CGAAGAAGGT GGTGATCCTC
GGCGGGGGGC CGAACCGGAT CGGGCAGGGC ATCGAGTTCG ACTACTGCTG CTGCCACGCC
TGCTTCGCGC TGACCGCGGC CGGTTACGAA ACCATCATGA TCAACTGCAA CCCCGAGACG
GTCTCGACCG ACTACGACAC CTCGGACCGG CTCTATTTCG AGCCGCTGAC GCTGGAACAT
GTGCTGGAAA TCCTGCGGGT GGAGCAGGAG AACGGCACGC TGCACGGCGT GATCGTGCAG
TTCGGCGGGC AGACGCCGCT GAAACTCGCT CAGGCGCTGG CGCATGAGGG CATCCCGATC
CTCGGCACCA CGCCCGATGC GATCGACCTT GCCGAGGACC GCGAGCGCTT CCAGAAGCTG
CTGAACGATC TGGGCCTGAA GCAGCCGATC AACGGCATGG CGCGGTCGCG CGACGAGGCC
TTCGCCATCG CCGGCCGCAT CGGCTATCCG CTGGTGATCC GGCCCTCCTA TGTGCTGGGC
GGCCGCGCCA TGGAGATCGT GCGCGACGAC GCCCAGCTTG AACGCTACAT CCGCGAGGCG
GTGCAGGTCT CGGGCACCTC GCCCGTGCTG CTCGACAGCT ATCTCGCCGG CGCCATCGAG
GTCGATGTGG ACGCGCTCTG CGATGGCGAG AACGTCCATG TGGCCGGTAT CATGGAGCAT
ATCGAGGAAG CGGGCGTCCA TTCCGGCGAT TCGGCCTGCT GCCTGCCGCC GCATTCGCTC
TCGGCCGAGA CCATCGAGGA ACTGAAGCGC CAGACCGTCG AGATGGCCCG CGCGCTGCAT
GTGGTGGGCC TGATGAACGT GCAGTTCGCG ATCAAGGACG GGGTGATCTT CGTTCTGGAA
GTGAACCCGC GCGCCTCGCG CACCGTGCCC TTCGTGGCCA AGGCCACCGA CAGCGCCATC
GCCTCCATCG CGGCACGGCT GATGGCGGGC GAGCCGCTGA GCGCCTTCCC GCTGCGCGAG
CCCTACCCGG CGGGCGTCGG CCCCGACACC GACCTGCCGC TGGCCGATCC GCTGACGCTG
GCCGATCCGA TCACGCCCTG GTTCTCGGTG AAGGAATCGG TGCTGCCCTT CGCCCGCTTC
CCCGGCGTGG ACCCGCTCCT CGGCCCCGAG ATGCGCTCGA CCGGCGAGGT GATGGGCTGG
GACCGCAACT TCGCGCTCGC GTTCCTCAAG GCGCAGATGG GCGCCGGCAC ACATCTGCCC
GAGGGCGGCC GCGTCTTCCT GTCGGTCAAG GATCAGGACA AGACCGATGC GCTGGCCAAG
GCCGCCGCCG GGCTGACCGC GATGGGCTTC GAGATCGTGG CGACGAAAGG CACCGCCGCC
TGGCTGAGCG CGCAGGGCAT CGAGGCGACC TCGGTCAACA AGGTCTATGA GGGCCGCCCG
AACATCGTGG ACCGGCTGAA GAACGGCGAC ATCACGCTGG TGATGAACAC GACCGAGGGG
ACGCAGGCGA TCAGCGACAG CCGCGACATC CGCCGCGTCG CGCTGATGGA CAAGATCCCC
TACTTCACCA CCGCCGCCGC AAGCATCGCC GCCGTCGAGG CGATGCAGGC CCGCGGCGAA
GGCTACGGGG TGCGCACCCT TCAGGGCTGA
 
Protein sequence
MPKRTDISSI MIIGAGPIVI GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPGL 
ADATYIEPIT PEVVAKIIEK ERPDALLPTM GGQTGLNTAL ALADMGVLET FGVQLIGANR
EAIEMAEDRK LFREAMDRIG LENPKATIIA APKLPNGRYD INAGVAEAMA AIEYVGLPAI
IRPAFTLGGT GGGVAYNRDD YEAICRSGLD ASPVAQILVD ESLLGWKEYE MEVVRDRADN
AIIVCSIENV DPMGVHTGDS ITVAPALTLT DKEYQIMRNG SIAVLREIGV ETGGSNVQWA
INPVDGRMVV IEMNPRVSRS SALASKATGF PIAKIAAKLA VGYTLDELDN DITKVTPASF
EPSIDYVVTK IPRFAFEKFP GSKPELTTAM KSVGEVMAIG RTFHESMQKA LASLETGLSG
FDEIEIPGAP DKAAIIKAIS AQTPDRLRLI AQAMRHGLSD DEIQAATAFD PWFLARIREI
IDAEAKIRAN GLPLAEEPLR KLKMMGFTDA RLAKLTGREE GQVRRARRNL GVKAVFKRID
TCAAEFEAQT PYMYSTYEAP AMGDVECEAR PSAAKKVVIL GGGPNRIGQG IEFDYCCCHA
CFALTAAGYE TIMINCNPET VSTDYDTSDR LYFEPLTLEH VLEILRVEQE NGTLHGVIVQ
FGGQTPLKLA QALAHEGIPI LGTTPDAIDL AEDRERFQKL LNDLGLKQPI NGMARSRDEA
FAIAGRIGYP LVIRPSYVLG GRAMEIVRDD AQLERYIREA VQVSGTSPVL LDSYLAGAIE
VDVDALCDGE NVHVAGIMEH IEEAGVHSGD SACCLPPHSL SAETIEELKR QTVEMARALH
VVGLMNVQFA IKDGVIFVLE VNPRASRTVP FVAKATDSAI ASIAARLMAG EPLSAFPLRE
PYPAGVGPDT DLPLADPLTL ADPITPWFSV KESVLPFARF PGVDPLLGPE MRSTGEVMGW
DRNFALAFLK AQMGAGTHLP EGGRVFLSVK DQDKTDALAK AAAGLTAMGF EIVATKGTAA
WLSAQGIEAT SVNKVYEGRP NIVDRLKNGD ITLVMNTTEG TQAISDSRDI RRVALMDKIP
YFTTAAASIA AVEAMQARGE GYGVRTLQG