Gene Rsph17029_2475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2475 
SymbolcarB 
ID4897495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2607418 
End bp2610747 
Gene Length3330 bp 
Protein Length1109 aa 
Translation table11 
GC content68% 
IMG OID640113073 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_001044349 
Protein GI126463235 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0559306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.209365 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAAA GAACCGATAT CAGCTCGATC ATGATCATCG GGGCGGGTCC CATCATCATC 
GGTCAGGCCT GCGAGTTCGA CTATTCCGGC GCTCAGGCCT GCAAGGCGCT GCGCGAAGAG
GGCTACCGGG TCATCCTCGT GAACTCGAAC CCGGCCACGA TCATGACCGA CCCGGGTCTG
GCGGACGCCA CCTACATCGA GCCGATCACC CCCGAGGTCG TGGCCAAGAT CATCGAGAAG
GAGCGCCCCG ACGCGCTTCT GCCCACGATG GGCGGGCAGA CCGGCCTCAA CACCGCGCTC
GCGCTGGCCG ACATGGGCGT CCTCGAGAAA TTCGGCGTCC AGCTCATCGG CGCGAACCGC
GAGGCCATCG AGATGGCCGA GGACCGCAAG CTGTTCCGCG AGGCGATGGA CCGGATCGGG
CTCGAGAACC CCAAGGCCAC CATCATCGCC GCCCCGAAGC TGGAAAACGG CCGCTACGAC
ATCAATGCGG GCGTGGCCGA GGCGATGGCC GCCATCGAAT ATGTGGGCCT GCCCGCGATC
ATCCGCCCCG CCTTCACGCT GGGCGGCACC GGCGGCGGCG TGGCCTACAA CCGCGACGAT
TACGAGGCCA TCTGCCGCTC GGGGCTCGAT GCCTCGCCGG TGGCGCAGAT CCTCGTCGAC
GAAAGCCTGC TCGGCTGGAA GGAATATGAG ATGGAGGTGG TCCGCGACCG CGCGGACAAT
GCCATCATCG TCTGTTCCAT CGAGAACGTG GACCCGATGG GCGTCCATAC CGGCGACTCG
ATCACCGTGG CGCCGGCGCT GACGCTGACC GACAAGGAAT ATCAGATCAT GCGCAACGGC
TCGATTGCCG TGCTGCGCGA GATCGGCGTC GAGACCGGCG GGTCGAACGT GCAATGGGCG
ATCAACCCCG CGGACGGCCG GATGGTCGTG ATCGAGATGA ACCCGCGCGT CTCGCGCTCG
TCCGCGCTGG CCTCCAAGGC CACTGGCTTC CCCATCGCCA AGATCGCGGC GAAGCTCGCC
GTGGGCTACA CGCTCGACGA GCTCGACAAC GACATCACCA AGGTCACGCC CGCCTCGTTC
GAGCCGTCCA TCGACTATGT CGTGACCAAG ATCCCGCGCT TCGCCTTCGA GAAGTTCCCC
GGCTCGAAGC CAGAACTGAC CACCGCGATG AAGTCGGTGG GCGAGGTCAT GGCCATCGGC
CGCACCTTCC ACGAATCGAT GCAGAAGGCG CTGGCCTCGC TCGAGACCGG CCTCTCGGGC
TTCGACGAGA TCGAGATCCC CGGCGCCCCC GACAAGGCCG CGGTCATCAA GGCCATCTCG
GCCCAGACGC CCGACCGGCT GCGGCTGATC GCGCAGGCGA TGCGGCACGG GCTGACCGAG
GACGAGATCC AGGCCGCGAC GGCCTTCGAT CCGTGGTTCC TCGCCCGCAT CCGCGAGATC
GTCGAGGCCG AGGCCGAGAT CCGCGCCAAG GGCCTGCCCG TCACCGAGGC CGCGCTGCGC
AGGCTGAAGA TGATGGGCTT CACCGACGCG CGTCTGGCCA AGCTCACCGG CCGCGACGAG
GGTCAGGTGC GCCGCGCGCG CCGGAACCTC GGGGTGAAGG CGGTCTTCAA GCGCATCGAC
ACCTGCGCGG CCGAGTTCGA GGCCCAGACC CCCTACATGT ATTCCACCTA CGAGGCCCCC
GCGATGGGCG ACGTGGAATG CGAGGCCCGG CCCTCGGGCG CGAAGAAGGT GGTGATCCTC
GGCGGCGGCC CGAACCGGAT CGGTCAGGGC ATCGAGTTCG ACTATTGCTG CTGCCATGCC
TGCTTCGCAC TGACCGCGGC AGGCTATGAA ACCATCATGA TCAACTGCAA CCCCGAGACC
GTGTCGACCG ACTACGACAC CTCGGACCGG CTCTATTTCG AGCCGCTGAC GCTCGAACAT
GTGCTGGAAA TCCTGCGCGT CGAGCAGGAG AACGGCACTC TTCACGGCGT GATCGTGCAG
TTCGGCGGCC AGACGCCGCT CAAGCTCGCG CAGGCGCTGG CGGCCGAGGG GATCCCGATC
CTCGGCACCA CGCCCGACGC CATCGACCTC GCCGAAGACC GCGAGCGGTT CCAGCAGCTC
CTTCACAAGC TGGACCTGAA GCAGCCGCAC AACGGCATGG CGCGGAGCCG CGACGAGGCC
TTCCGCATCG CGGGCGAGAT CGGCTACCCG CTGGTGATCC GGCCCTCCTA TGTGCTCGGC
GGCCGCGCGA TGGAGATCGT GCGCGACGAC GCCCAGCTCG AACGCTACAT CCGCGAGGCG
GTGCAGGTCT CGGGCACCTC GCCCGTGCTG CTCGACAGCT ATCTCTCGGG CGCCATCGAG
GTGGATGTGG ATGCGCTCTG CGACGGCGAG AACGTGCATG TCGCGGGGAT CATGGAACAT
ATCGAGGAGG CGGGGGTCCA TTCGGGCGAC TCCGCCTGCT GCCTGCCGCC CCATTCGCTC
TCGGCCGAGA CCATCGCCGA ACTGAAGCGC CAGACGGTCG AGATGGCCCG CGCGCTGCAT
GTGGTGGGGC TGATGAACGT GCAGTTCGCG ATCAAGGAAG GGGTGATCTT CGTCCTCGAG
GTGAACCCGC GCGCCTCGCG GACGGTGCCC TTCGTGGCCA AGGCCACCGA CAGCGCCATT
GCGTCCATCG CGGCGCGGCT GATGGCGGGC GAGCCGCTCT CGGCCTTCCC GGTGCGCGCG
CCCTATCCGG CGGGCGTGGG CCCCGACACC GACCTGCCGC TGGCCGATCC GCTGACGCTC
GCCGATCCGA TCACGCCCTG GTTCTCGGTC AAGGAATCGG TGCTGCCCTT CGCCCGCTTC
CCCGGCGTGG ACCCGCTCCT CGGCCCCGAG ATGCGCTCGA CGGGCGAGGT GATGGGCTGG
GACCGCAGCT TCGCGCTGGC CTTCCTCAAG GCGCAGATGG GCGCCGGCAC GCATCTGCCC
GAGAGCGGGC GCGTGTTCCT GTCGGTCAAG GATGCCGACA AGACCGCGGC GCTGGCCAAG
GCCGCGGCCG GGCTCACCGC GATGGGCTTC GAGATCGTGG CGACGAAGGG CACCGCCGCC
TGGCTCACCG GGCAGGGGAT CGCCTCGACC TCGGTCAACA AGGTCTACGA GGGTCGGCCG
AACATCGTCG ACCGGCTGAA GAACGGCGAC ATCACGCTGG TGATGAACAC GACCGAGGGC
GCGCAGGCGA TCTCGGACAG CCGCGACATC CGCCGCGTGG CGCTGATGGA CAAGATCCCC
TACTTCACCA CCGCCGCCGC CTCCATCGCC GCCGTCGAGG CGATGCAGGC CCGCGGCGAG
GGCTACGGGG TGCGCACCCT CCAAGGCTGA
 
Protein sequence
MPKRTDISSI MIIGAGPIII GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPGL 
ADATYIEPIT PEVVAKIIEK ERPDALLPTM GGQTGLNTAL ALADMGVLEK FGVQLIGANR
EAIEMAEDRK LFREAMDRIG LENPKATIIA APKLENGRYD INAGVAEAMA AIEYVGLPAI
IRPAFTLGGT GGGVAYNRDD YEAICRSGLD ASPVAQILVD ESLLGWKEYE MEVVRDRADN
AIIVCSIENV DPMGVHTGDS ITVAPALTLT DKEYQIMRNG SIAVLREIGV ETGGSNVQWA
INPADGRMVV IEMNPRVSRS SALASKATGF PIAKIAAKLA VGYTLDELDN DITKVTPASF
EPSIDYVVTK IPRFAFEKFP GSKPELTTAM KSVGEVMAIG RTFHESMQKA LASLETGLSG
FDEIEIPGAP DKAAVIKAIS AQTPDRLRLI AQAMRHGLTE DEIQAATAFD PWFLARIREI
VEAEAEIRAK GLPVTEAALR RLKMMGFTDA RLAKLTGRDE GQVRRARRNL GVKAVFKRID
TCAAEFEAQT PYMYSTYEAP AMGDVECEAR PSGAKKVVIL GGGPNRIGQG IEFDYCCCHA
CFALTAAGYE TIMINCNPET VSTDYDTSDR LYFEPLTLEH VLEILRVEQE NGTLHGVIVQ
FGGQTPLKLA QALAAEGIPI LGTTPDAIDL AEDRERFQQL LHKLDLKQPH NGMARSRDEA
FRIAGEIGYP LVIRPSYVLG GRAMEIVRDD AQLERYIREA VQVSGTSPVL LDSYLSGAIE
VDVDALCDGE NVHVAGIMEH IEEAGVHSGD SACCLPPHSL SAETIAELKR QTVEMARALH
VVGLMNVQFA IKEGVIFVLE VNPRASRTVP FVAKATDSAI ASIAARLMAG EPLSAFPVRA
PYPAGVGPDT DLPLADPLTL ADPITPWFSV KESVLPFARF PGVDPLLGPE MRSTGEVMGW
DRSFALAFLK AQMGAGTHLP ESGRVFLSVK DADKTAALAK AAAGLTAMGF EIVATKGTAA
WLTGQGIAST SVNKVYEGRP NIVDRLKNGD ITLVMNTTEG AQAISDSRDI RRVALMDKIP
YFTTAAASIA AVEAMQARGE GYGVRTLQG