Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_13841 |
Symbol | carB |
ID | 4778188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1179952 |
End bp | 1183260 |
Gene Length | 3309 bp |
Protein Length | 1102 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640086893 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_001017396 |
Protein GI | 124023089 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.19323 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGTA GGTCTGATCT GCGTCGCATC CTGCTGCTGG GGTCAGGTCC GATCGTGATT GGCCAGGCCT GTGAATTTGA TTATTCCGGC ACTCAAGCAT GCAAGGCCTT GCGGGATGAG GGTTTTGAGG TGGTGTTGGT GAATTCCAAT CCGGCCTCAA TCATGACGGA TCCGGAGATG GCTGATCGCA CCTATATCGA GCCGCTCACA CCCGAGGTAG TGGCTCGTGT CATTGAGTTA GAGCGCCCTG ATGCGTTGCT GCCAACCATG GGCGGTCAGA CGGCTCTTAA TTTGGCGGTG GCATTGGCTG AAAATCACAC ACTCGAACGT TTTGGCGTTG AGTTGATCGG TGCAGATCTA GCTGCAATCC GAAAGGCCGA AGATCGCCAG CTGTTTAAGC AGGCGATGGA GCGTATAGGA GTCGACGTTT GTCCTTCAGG GATTGCTTCC AGCCTCAAGG AGGCAGAAGA GGTTGGTGAA GTGATTAATA GTTTTCCGCG CATCATCCGA CCGGCCTTCA CTCTTGGTGG CAGTGGTGGT GGCATTGCTT ACAACCCAGA AGAATTTGCA GCGATCTGCA AAAGCGGTCT TGATGCCAGC CCGGTTTCTC AGATCTTGAT CGAAAAATCT TTGTTGGGTT GGAAGGAGTT CGAATTAGAG GTGATGCGTG ATTTGGCGGA TAACGTTGTG ATCGTCTGTA GCATTGAAAA TCTCGATGCA ATGGGTGTTC ATACAGGAGA TTCAATCACT GTTGCTCCTG CGCAAACTCT CACCGATCGT GAGTATCAGA GGCTTAGAGA TCAATCCATT GCCATTATTC GTGAGATAGG TGTTGCTACG GGGGGTAGCA ATATTCAGTT CGCGATCAAT CCAGATGATG GCGAGGTGGT TGTTATTGAA ATGAATCCGC GCGTTAGTCG ATCATCAGCG CTGGCAAGTA AGGCCACGGG TTTTCCAATT GCCAAGATTG CTGCTCGTCT TGCTGTTGGT TATACCCTTG ACGAAATTAT TAACGACATC ACGGGTAAGA CACCAGCCTG TTTTGAACCC ACAATTGATT ATGTCGTCAC GAAGATACCA CGATTTGCAT TTGAGAAGTT TAAGGGTAGC CCTGCTGTTC TCTCTACGGC AATGAAGTCT GTGGGAGAGG CAATGGCGAT CGGCCGCTGT TTCGAAGAGT CTTTTCAGAA AGCGATGCGT TCTCTGGAAA TTGGGAGGGC TGGTTGGGGT TGTGATCGGC AGGAGCCAGA ATTGACTCCT ACAGAAATTG AGCGTTTGTT GCGCACACCT TCTCCTGAGC GGATCATGGC AGTTAGAACA GCGATGTTGG CCGGACGCAG TGACCAAGAT ATTTACGCTT TGAGCAAAAT TGATATTTGG TTTTTAGCCA AACTACGCCA TTTGATAAAC ATTGAGACAA CGCATATGCA TGGACGTAAA TTAGATGAAC TGGATCAGGA ATGTCTTCTC TCATTAAAGC AGCTTGGCTA TTCCGATCGT CAGATTGCTT GGGCGACAGG TTCTGAGGAG CTCGCTGTAC GTGCTCGTCG TGAACTGTTA AATATCAAAC CTGTTTTTAA GACAGTCGAT ACTTGCGCTG CCGAATTTTC ATCTACTACT CCATATCATT ACTCAACTTA TGAGCGCCCC CTGTACCGTC TTGATTCTGA TGACCAATTG TATAAACTTG AGCCGGAAAG CGAGGTCGTT GCAGATGAGC GATCAAAAGT GATGATTCTT GGCGGCGGTC CTAATCGCAT CGGCCAAGGA ATTGAGTTTG ACTATTGCTG TTGCCATGCT TCATTTGCTG CTCAGGACAA GGGATTTGTC ACTGTGATGG TTAACAGTAA TCCCGAAACT GTCTCTACCG ATTATGATAC TAGTGACAGA CTTTATTTTG AACCTCTTAC TCTTGAGGAT GTACTCAATG TGATCGAGGC TGAACGGCCT AATGGTGTCA TCGTTCAGTT TGGTGGTCAG ACCCCACTCA AGTTGGCGAT TCCCTTGCTT CGTTGGCTAG AGAGTTCGAT CGGAAAGTCA ACAGGCACAC GTATATGGGG TACATCACCA GAATCAATTG ATCAGGCTGA AGATCGTGAA CAGTTCGAGG CGATCCTGCG GCAGCTTCAA ATTCGCCAGC CTCGTAATGG CTTAGCTCGC AGTGAAGAAG ATGCCCGAGC CGTAGCACTT CGCATTGGTT ATCCCGTTGT TGTTCGTCCC TCTTATGTGC TTGGAGGCCG AGCCATGGAG GTGGTGTTTG ATGAACAAGA GTTGAATCGC TATATGGCTG AAGCGGTTCA GGTCGAACCC GATCATCCAG TGCTTATCGA TCAGTATCTG GAAAATGCTG TTGAAGTTGA TGTTGATGCT CTCTGTGACT CTGATGGAGT TGTAGTGATT GGCGGTTTGA TGGAACACAT TGAGCCTGCT GGAATTCATT CTGGTGATTC CGCTTGCTGC CTTCCCTCCA TCTCTCTTGG TGAGCAAGCC CTTCGCACGA TTCGTCTCTG GAGTGAAGCG TTGGCCCTTG CGCTCAAAGT TCAGGGGTTG ATCAATCTGC AGTTCGCTGT CCAGCGGGAT GAAGCGGGTG ATGAGCAAGT GTTCATCATC GAGGCCAATC CTCGTGCTTC TCGTACCGTT CCTTTTGTGG CTAAAGCCAC AGGAGTACCC CTGGCCAGGA TCGCAACAAG GATCATGGCA GGAGAGACGT TGGCGGCTGT TGGACTTACG CATGAGCCTC AACCGCCTCT TCAGTCTGTC AAGGAAGCCG TATTGCCATT CCGTCGTTTC CCTGGGGCCG ATTCGGTGCT TGGCCCGGAG ATGCGGTCCA CTGGAGAAGT GATGGGTTCA GCCTCCAGTT TTGGCATGGC GTTTGCCAAG TCTGAAATTG CAGCTGGTGA TCCCCTGCCA ATTCGTGGAA CTGTCTTTCT CTCTACTCAT GACCGGGATA AGCCAGCGCT ACTACTTGTC GCTGAAAGGC TGATTGAACT TGGCTTTGAT CTCACGGCCA CCTCCGGAAC TGCGCAAGCC CTTAATCAGG CTGGATTGAA CGTTCAGCCA GTACTCAAGG TTCATGAGGG ACGTCCCAAT ATCGAAGATT TGATTCGCTC TGGTCAGATT CAGTTGGTGA TCAACACACC TATTGGCCGT CAGGCCGCTC ATGACGACAA ATATTTGCGA CGAGCAGCTC TTGATTACTC CGTGCCTACC CTTACGACGA TTGCTGGAGC ACGAGCTGCT GTCGAGGGAA TCACGGCACT CCAGCAACAG TCACTATCCG TAGCGGCCCT TCAAGACATC CATGTTTGA
|
Protein sequence | MPRRSDLRRI LLLGSGPIVI GQACEFDYSG TQACKALRDE GFEVVLVNSN PASIMTDPEM ADRTYIEPLT PEVVARVIEL ERPDALLPTM GGQTALNLAV ALAENHTLER FGVELIGADL AAIRKAEDRQ LFKQAMERIG VDVCPSGIAS SLKEAEEVGE VINSFPRIIR PAFTLGGSGG GIAYNPEEFA AICKSGLDAS PVSQILIEKS LLGWKEFELE VMRDLADNVV IVCSIENLDA MGVHTGDSIT VAPAQTLTDR EYQRLRDQSI AIIREIGVAT GGSNIQFAIN PDDGEVVVIE MNPRVSRSSA LASKATGFPI AKIAARLAVG YTLDEIINDI TGKTPACFEP TIDYVVTKIP RFAFEKFKGS PAVLSTAMKS VGEAMAIGRC FEESFQKAMR SLEIGRAGWG CDRQEPELTP TEIERLLRTP SPERIMAVRT AMLAGRSDQD IYALSKIDIW FLAKLRHLIN IETTHMHGRK LDELDQECLL SLKQLGYSDR QIAWATGSEE LAVRARRELL NIKPVFKTVD TCAAEFSSTT PYHYSTYERP LYRLDSDDQL YKLEPESEVV ADERSKVMIL GGGPNRIGQG IEFDYCCCHA SFAAQDKGFV TVMVNSNPET VSTDYDTSDR LYFEPLTLED VLNVIEAERP NGVIVQFGGQ TPLKLAIPLL RWLESSIGKS TGTRIWGTSP ESIDQAEDRE QFEAILRQLQ IRQPRNGLAR SEEDARAVAL RIGYPVVVRP SYVLGGRAME VVFDEQELNR YMAEAVQVEP DHPVLIDQYL ENAVEVDVDA LCDSDGVVVI GGLMEHIEPA GIHSGDSACC LPSISLGEQA LRTIRLWSEA LALALKVQGL INLQFAVQRD EAGDEQVFII EANPRASRTV PFVAKATGVP LARIATRIMA GETLAAVGLT HEPQPPLQSV KEAVLPFRRF PGADSVLGPE MRSTGEVMGS ASSFGMAFAK SEIAAGDPLP IRGTVFLSTH DRDKPALLLV AERLIELGFD LTATSGTAQA LNQAGLNVQP VLKVHEGRPN IEDLIRSGQI QLVINTPIGR QAAHDDKYLR RAALDYSVPT LTTIAGARAA VEGITALQQQ SLSVAALQDI HV
|
| |