Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_09421 |
Symbol | carB |
ID | 5731110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 838243 |
End bp | 841551 |
Gene Length | 3309 bp |
Protein Length | 1102 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641285308 |
Product | carbamoyl phosphate synthase large subunit |
Protein accession | YP_001550827 |
Protein GI | 159903483 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | [TIGR01369] carbamoyl-phosphate synthase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.48338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00020479 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCACGGC GTAAAGATAT ACGTCGCATT CTCATCCTAG GTTCTGGACC AATTGTGATA GGCCAGGCCT GTGAATTCGA TTATTCAGGA ACCCAAGCCT GTAAAGCCTT AAAAGAGGAA GGATATGAAG TCGTATTAGT TAATTCTAAT CCGGCTTCAA TAATGACAGA TCCTGAAATT GCCTATAGGA CCTATATAGA ACCAATAACA TCAGATGTCA TCGAAAAGGT TATTGAGGTG GAAAGGCCTC AGGCACTCCT TCCGACAATG GGAGGTCAAA CGGCTTTAAA CATATCTGTA GAACTTGCTG AGAAAGGAAT TCTTAAAAAA TTTGGCGTAG AACTTATCGG AGCAGATCTA GCCTCAATAA AAAGGGCAGA AGATCGCCAA TTATTTAAAG ACTCTATGAA GAATATTGGA GTAAATGTAT GTCCATCTGG TATTGCATCC AATATTGAGG AGTCTCTTTC TGTAGGAAAT ACAATATCGA CTTTCCCAAG AATTATCCGA CCTGCCTTTA CCCTTGGTGG AAGTGGCGGA GGCATTGCAT ATAACAAAGA AGAATTTATA TCTATTTGCA AGCAAGGATT GGAAGCTAGT CCAGTTTCTC AAATACTTAT TGAAAAGTCT TTATTGGGAT GGAAGGAGTT TGAATTGGAA GTAATGAGAG ATATTTCAGA CAATGTAGTT ATTGTTTGCA GTATTGAAAA TCTCGATCCT ATGGGTATTC ACACTGGTGA TTCTATAACA GTAGCACCAG CGCAAACATT AACTGACCGT GAATATCAAA GACTAAGGGA TTATTCAATA AAAATAATTA GAGAAATTGG AGTAGAAACT GGAGGGAGCA ATATTCAATT TGCAGTGAAT CCTATTGATG GTGAAGTAAT AGTTATTGAA ATGAACCCTA GGGTTAGTCG TTCATCAGCT TTGGCAAGTA AAGCTACTGG TTTCCCAATA GCAAAGATCG CAGCATTGTT AGCAGTTGGT TACAGACTCG ATGAAATTGT TAACGACATC ACAGGTAAAA CACCAGCTTG CTTTGAGCCA ACAATTGATT ATGTAGTCAC AAAAATTCCA CGATTTGCAT TTGAGAAATT CGGTGGAAGT CCTGCAGTTC TGAACACATC AATGAAGTCT GTAGGAGAAG CTATGGCAAT AGGCAGATCT TTTGAAGAAT CATTTCAGAA AGCATTAAGG TCCCTAGAAG TAGGTCTATC AGGATGGGGA TGTGATGGTA CTGATCAATT AATTAAGTCA GTTGATTTAG ATAAATTATT GAGGACACCT TCACCAGAAA GAATAATGGC AGTTAGAACT GCAATGTTGG AAGGTAGGTC TGATGAAAAT ATATACAGAA TTTCTAATAT TGACCCCTGG TTTTTATCTA AACTACGAAA CATTATTATA GCTGAACAGT CAATTCTAAC AGATAAGAAC ATAGAGGATG TTAATGAAGA GGAATTATTT TTGCTCAAAC AACTAGGATT TTCTGATAGA CAAATAGCAT GGGCTCTTAG AGTGAATGAA TTGCAAATCC GGAGTCTAAG AAAACGGTTT AATATATTGC CAAAATTTAA AACAGTGGAT ACCTGTGCAG CTGAATTTAG CTCCTCCACT CCCTACCATT ATTCAACTTA TGAAAGACAA GTTAAAAAAA TAGATTTAGA CGGAACAATT AATAAGATTG AGCTAGATAG AGAAATTGAG GCAAGCTATA CTAATAAAGT TCTTATTTTA GGTGGAGGGC CAAATAGAAT AGGGCAAGGG ATAGAATTTG ATTATTGCTG CTGTCACTCA TCATATCAAT TTCAGGTAGA TGGTTACACA ACTATAATGG TTAATAGTAA TCCTGAAACT GTTTCCACAG ACTATGATAC TAGCGATATT CTTTATTTTG AACCATTAAC TCTAGAGGAT ATTTTAAATA TTATTGAATA TGAGCAACCT AATGGAGTAA TCGTTCAATT TGGAGGTCAG ACACCACTAA AGTTATCAAT ACCAATCATG AACTGGCTTA ATTCTAGTGA TGGAATCAAA ACCCAAACAA AAATATTAGG AACATCTCCT ATATCTATAG ATAAGGCGGA AGATCGTGAG CAATTTGATA AAATTTTAAA TGACCTAAAG ATTAAACAAC CTCGAAATGG TATAGCAAGA TCTATTAGTG AAGCTATAGC TATAGCACAA GATATTAAAT ATCCAATAGT AGTTCGTCCA TCTTATGTAC TAGGGGGTCG AGCAATGGAA ATTGTCTATG AAGATGAGGA ATTAGTTAGA TATATGAATG AGGCTGTAAA GGTTGAGCCA GATCATCCTG TATTGATAGA TCAATATCTA CAGAATGCAA TTGAAGTTGA TGTTGATGCT CTTTGTGATC GTACAGGTAA TGTTGTTATT GGCGGTTTGA TGGAACATAT AGAACCAGCA GGTATTCACT CTGGAGATTC TGCCTGCTGC TTGCCATCTA TATCGTTATC AAAAGATTCA CAGCTAACTA TAAGAAAATG GACCGAGTCC CTATCTAAAG CACTAAATGT TGAAGGTTTA ATAAACCTAC AATTTGCTGT ACAGATAGAT ACTGAAGGTA TAGAGCAAGT TTTTATAATT GAAGCAAATC CAAGAGCATC TAGGACTGTT CCATTCGTCT CTAAGGCTAC AGGCTTGCCA TTGGCACGTA TAGCTACAAG TCTAATTGGA GGCAAAACAT TAAATCAACT TGGAGTGACT AAAGAACCAA TACCGCCTCT TCAAACAATA AAAGAGGCAG TCATGCCCTT TAAGCGCTTT CCAGGATCAG ACAGTGTTCT AGGACCAGAA ATGAGATCTA CAGGTGAGGT AATGGGCTCG GCTACTGATT TCGGTATGGC ATACGCCAAG TCAGAGCTTG CAGCGGGAGA AGCTTTGCCC ACGAGTGGGG TAGTGTTTTT ATCCACTCAT GACAGAGATA AACCTTCCCT CGTTCCTATT GCAAGATCAT TGATTGACTT AGGTTTTACC CTGACTGCAA CTCTCGGCAC TTCAAAATAT CTTATGGACG CTGGAGTATT GGTAGAACCT ATTCTAAAAG TTCATGAGGG ACGACCAAAT ATAGAAGACC TAATAAGATC TGCACAAATT CAATTAATAA TAAATACTCC TATAGGAAGG CAAGCAGCCC ACGATGATAA GTATTTAAGA AGAGCAGCTC TTGATTATTC AGTGCCTACA TTAACTACTG TTGCAGGAGC GAAGGCAGCT GTCGAAGCTA TAACAGCATT GCAAAATCAC AAAATAACAA TTAATGCATT ACAAGACATT CATCATTAA
|
Protein sequence | MPRRKDIRRI LILGSGPIVI GQACEFDYSG TQACKALKEE GYEVVLVNSN PASIMTDPEI AYRTYIEPIT SDVIEKVIEV ERPQALLPTM GGQTALNISV ELAEKGILKK FGVELIGADL ASIKRAEDRQ LFKDSMKNIG VNVCPSGIAS NIEESLSVGN TISTFPRIIR PAFTLGGSGG GIAYNKEEFI SICKQGLEAS PVSQILIEKS LLGWKEFELE VMRDISDNVV IVCSIENLDP MGIHTGDSIT VAPAQTLTDR EYQRLRDYSI KIIREIGVET GGSNIQFAVN PIDGEVIVIE MNPRVSRSSA LASKATGFPI AKIAALLAVG YRLDEIVNDI TGKTPACFEP TIDYVVTKIP RFAFEKFGGS PAVLNTSMKS VGEAMAIGRS FEESFQKALR SLEVGLSGWG CDGTDQLIKS VDLDKLLRTP SPERIMAVRT AMLEGRSDEN IYRISNIDPW FLSKLRNIII AEQSILTDKN IEDVNEEELF LLKQLGFSDR QIAWALRVNE LQIRSLRKRF NILPKFKTVD TCAAEFSSST PYHYSTYERQ VKKIDLDGTI NKIELDREIE ASYTNKVLIL GGGPNRIGQG IEFDYCCCHS SYQFQVDGYT TIMVNSNPET VSTDYDTSDI LYFEPLTLED ILNIIEYEQP NGVIVQFGGQ TPLKLSIPIM NWLNSSDGIK TQTKILGTSP ISIDKAEDRE QFDKILNDLK IKQPRNGIAR SISEAIAIAQ DIKYPIVVRP SYVLGGRAME IVYEDEELVR YMNEAVKVEP DHPVLIDQYL QNAIEVDVDA LCDRTGNVVI GGLMEHIEPA GIHSGDSACC LPSISLSKDS QLTIRKWTES LSKALNVEGL INLQFAVQID TEGIEQVFII EANPRASRTV PFVSKATGLP LARIATSLIG GKTLNQLGVT KEPIPPLQTI KEAVMPFKRF PGSDSVLGPE MRSTGEVMGS ATDFGMAYAK SELAAGEALP TSGVVFLSTH DRDKPSLVPI ARSLIDLGFT LTATLGTSKY LMDAGVLVEP ILKVHEGRPN IEDLIRSAQI QLIINTPIGR QAAHDDKYLR RAALDYSVPT LTTVAGAKAA VEAITALQNH KITINALQDI HH
|
| |