Gene P9303_13841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_13841 
SymbolcarB 
ID4778188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1179952 
End bp1183260 
Gene Length3309 bp 
Protein Length1102 aa 
Translation table11 
GC content50% 
IMG OID640086893 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_001017396 
Protein GI124023089 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.19323 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGTA GGTCTGATCT GCGTCGCATC CTGCTGCTGG GGTCAGGTCC GATCGTGATT 
GGCCAGGCCT GTGAATTTGA TTATTCCGGC ACTCAAGCAT GCAAGGCCTT GCGGGATGAG
GGTTTTGAGG TGGTGTTGGT GAATTCCAAT CCGGCCTCAA TCATGACGGA TCCGGAGATG
GCTGATCGCA CCTATATCGA GCCGCTCACA CCCGAGGTAG TGGCTCGTGT CATTGAGTTA
GAGCGCCCTG ATGCGTTGCT GCCAACCATG GGCGGTCAGA CGGCTCTTAA TTTGGCGGTG
GCATTGGCTG AAAATCACAC ACTCGAACGT TTTGGCGTTG AGTTGATCGG TGCAGATCTA
GCTGCAATCC GAAAGGCCGA AGATCGCCAG CTGTTTAAGC AGGCGATGGA GCGTATAGGA
GTCGACGTTT GTCCTTCAGG GATTGCTTCC AGCCTCAAGG AGGCAGAAGA GGTTGGTGAA
GTGATTAATA GTTTTCCGCG CATCATCCGA CCGGCCTTCA CTCTTGGTGG CAGTGGTGGT
GGCATTGCTT ACAACCCAGA AGAATTTGCA GCGATCTGCA AAAGCGGTCT TGATGCCAGC
CCGGTTTCTC AGATCTTGAT CGAAAAATCT TTGTTGGGTT GGAAGGAGTT CGAATTAGAG
GTGATGCGTG ATTTGGCGGA TAACGTTGTG ATCGTCTGTA GCATTGAAAA TCTCGATGCA
ATGGGTGTTC ATACAGGAGA TTCAATCACT GTTGCTCCTG CGCAAACTCT CACCGATCGT
GAGTATCAGA GGCTTAGAGA TCAATCCATT GCCATTATTC GTGAGATAGG TGTTGCTACG
GGGGGTAGCA ATATTCAGTT CGCGATCAAT CCAGATGATG GCGAGGTGGT TGTTATTGAA
ATGAATCCGC GCGTTAGTCG ATCATCAGCG CTGGCAAGTA AGGCCACGGG TTTTCCAATT
GCCAAGATTG CTGCTCGTCT TGCTGTTGGT TATACCCTTG ACGAAATTAT TAACGACATC
ACGGGTAAGA CACCAGCCTG TTTTGAACCC ACAATTGATT ATGTCGTCAC GAAGATACCA
CGATTTGCAT TTGAGAAGTT TAAGGGTAGC CCTGCTGTTC TCTCTACGGC AATGAAGTCT
GTGGGAGAGG CAATGGCGAT CGGCCGCTGT TTCGAAGAGT CTTTTCAGAA AGCGATGCGT
TCTCTGGAAA TTGGGAGGGC TGGTTGGGGT TGTGATCGGC AGGAGCCAGA ATTGACTCCT
ACAGAAATTG AGCGTTTGTT GCGCACACCT TCTCCTGAGC GGATCATGGC AGTTAGAACA
GCGATGTTGG CCGGACGCAG TGACCAAGAT ATTTACGCTT TGAGCAAAAT TGATATTTGG
TTTTTAGCCA AACTACGCCA TTTGATAAAC ATTGAGACAA CGCATATGCA TGGACGTAAA
TTAGATGAAC TGGATCAGGA ATGTCTTCTC TCATTAAAGC AGCTTGGCTA TTCCGATCGT
CAGATTGCTT GGGCGACAGG TTCTGAGGAG CTCGCTGTAC GTGCTCGTCG TGAACTGTTA
AATATCAAAC CTGTTTTTAA GACAGTCGAT ACTTGCGCTG CCGAATTTTC ATCTACTACT
CCATATCATT ACTCAACTTA TGAGCGCCCC CTGTACCGTC TTGATTCTGA TGACCAATTG
TATAAACTTG AGCCGGAAAG CGAGGTCGTT GCAGATGAGC GATCAAAAGT GATGATTCTT
GGCGGCGGTC CTAATCGCAT CGGCCAAGGA ATTGAGTTTG ACTATTGCTG TTGCCATGCT
TCATTTGCTG CTCAGGACAA GGGATTTGTC ACTGTGATGG TTAACAGTAA TCCCGAAACT
GTCTCTACCG ATTATGATAC TAGTGACAGA CTTTATTTTG AACCTCTTAC TCTTGAGGAT
GTACTCAATG TGATCGAGGC TGAACGGCCT AATGGTGTCA TCGTTCAGTT TGGTGGTCAG
ACCCCACTCA AGTTGGCGAT TCCCTTGCTT CGTTGGCTAG AGAGTTCGAT CGGAAAGTCA
ACAGGCACAC GTATATGGGG TACATCACCA GAATCAATTG ATCAGGCTGA AGATCGTGAA
CAGTTCGAGG CGATCCTGCG GCAGCTTCAA ATTCGCCAGC CTCGTAATGG CTTAGCTCGC
AGTGAAGAAG ATGCCCGAGC CGTAGCACTT CGCATTGGTT ATCCCGTTGT TGTTCGTCCC
TCTTATGTGC TTGGAGGCCG AGCCATGGAG GTGGTGTTTG ATGAACAAGA GTTGAATCGC
TATATGGCTG AAGCGGTTCA GGTCGAACCC GATCATCCAG TGCTTATCGA TCAGTATCTG
GAAAATGCTG TTGAAGTTGA TGTTGATGCT CTCTGTGACT CTGATGGAGT TGTAGTGATT
GGCGGTTTGA TGGAACACAT TGAGCCTGCT GGAATTCATT CTGGTGATTC CGCTTGCTGC
CTTCCCTCCA TCTCTCTTGG TGAGCAAGCC CTTCGCACGA TTCGTCTCTG GAGTGAAGCG
TTGGCCCTTG CGCTCAAAGT TCAGGGGTTG ATCAATCTGC AGTTCGCTGT CCAGCGGGAT
GAAGCGGGTG ATGAGCAAGT GTTCATCATC GAGGCCAATC CTCGTGCTTC TCGTACCGTT
CCTTTTGTGG CTAAAGCCAC AGGAGTACCC CTGGCCAGGA TCGCAACAAG GATCATGGCA
GGAGAGACGT TGGCGGCTGT TGGACTTACG CATGAGCCTC AACCGCCTCT TCAGTCTGTC
AAGGAAGCCG TATTGCCATT CCGTCGTTTC CCTGGGGCCG ATTCGGTGCT TGGCCCGGAG
ATGCGGTCCA CTGGAGAAGT GATGGGTTCA GCCTCCAGTT TTGGCATGGC GTTTGCCAAG
TCTGAAATTG CAGCTGGTGA TCCCCTGCCA ATTCGTGGAA CTGTCTTTCT CTCTACTCAT
GACCGGGATA AGCCAGCGCT ACTACTTGTC GCTGAAAGGC TGATTGAACT TGGCTTTGAT
CTCACGGCCA CCTCCGGAAC TGCGCAAGCC CTTAATCAGG CTGGATTGAA CGTTCAGCCA
GTACTCAAGG TTCATGAGGG ACGTCCCAAT ATCGAAGATT TGATTCGCTC TGGTCAGATT
CAGTTGGTGA TCAACACACC TATTGGCCGT CAGGCCGCTC ATGACGACAA ATATTTGCGA
CGAGCAGCTC TTGATTACTC CGTGCCTACC CTTACGACGA TTGCTGGAGC ACGAGCTGCT
GTCGAGGGAA TCACGGCACT CCAGCAACAG TCACTATCCG TAGCGGCCCT TCAAGACATC
CATGTTTGA
 
Protein sequence
MPRRSDLRRI LLLGSGPIVI GQACEFDYSG TQACKALRDE GFEVVLVNSN PASIMTDPEM 
ADRTYIEPLT PEVVARVIEL ERPDALLPTM GGQTALNLAV ALAENHTLER FGVELIGADL
AAIRKAEDRQ LFKQAMERIG VDVCPSGIAS SLKEAEEVGE VINSFPRIIR PAFTLGGSGG
GIAYNPEEFA AICKSGLDAS PVSQILIEKS LLGWKEFELE VMRDLADNVV IVCSIENLDA
MGVHTGDSIT VAPAQTLTDR EYQRLRDQSI AIIREIGVAT GGSNIQFAIN PDDGEVVVIE
MNPRVSRSSA LASKATGFPI AKIAARLAVG YTLDEIINDI TGKTPACFEP TIDYVVTKIP
RFAFEKFKGS PAVLSTAMKS VGEAMAIGRC FEESFQKAMR SLEIGRAGWG CDRQEPELTP
TEIERLLRTP SPERIMAVRT AMLAGRSDQD IYALSKIDIW FLAKLRHLIN IETTHMHGRK
LDELDQECLL SLKQLGYSDR QIAWATGSEE LAVRARRELL NIKPVFKTVD TCAAEFSSTT
PYHYSTYERP LYRLDSDDQL YKLEPESEVV ADERSKVMIL GGGPNRIGQG IEFDYCCCHA
SFAAQDKGFV TVMVNSNPET VSTDYDTSDR LYFEPLTLED VLNVIEAERP NGVIVQFGGQ
TPLKLAIPLL RWLESSIGKS TGTRIWGTSP ESIDQAEDRE QFEAILRQLQ IRQPRNGLAR
SEEDARAVAL RIGYPVVVRP SYVLGGRAME VVFDEQELNR YMAEAVQVEP DHPVLIDQYL
ENAVEVDVDA LCDSDGVVVI GGLMEHIEPA GIHSGDSACC LPSISLGEQA LRTIRLWSEA
LALALKVQGL INLQFAVQRD EAGDEQVFII EANPRASRTV PFVAKATGVP LARIATRIMA
GETLAAVGLT HEPQPPLQSV KEAVLPFRRF PGADSVLGPE MRSTGEVMGS ASSFGMAFAK
SEIAAGDPLP IRGTVFLSTH DRDKPALLLV AERLIELGFD LTATSGTAQA LNQAGLNVQP
VLKVHEGRPN IEDLIRSGQI QLVINTPIGR QAAHDDKYLR RAALDYSVPT LTTIAGARAA
VEGITALQQQ SLSVAALQDI HV