Gene A9601_10451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_10451 
SymbolcarB 
ID4717756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp906401 
End bp909697 
Gene Length3297 bp 
Protein Length1098 aa 
Translation table11 
GC content34% 
IMG OID640078760 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_001009436 
Protein GI123968578 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase
[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCAAA GAGGTGATCT TAAAAAAATT CTTATTCTAG GTTCGGGCCC GATTGTTATA 
GGACAAGCAT GTGAATTTGA TTACTCTGGT ACTCAAGCTT GCAAAGCTTT AAGAAATGCT
GGTTATGAAG TTGTCTTGAT AAATTCAAAT CCAGCATCAA TAATGACTGA TCCTGAGATA
GCGAGCAAAA CATATATAGA ACCATTGACT CCTGAAATTG TTTCTCAAAT CATATTAAGA
GAAAAACCTG ATGCGATTCT TCCTACTATG GGAGGTCAAA CCGCATTGAA TCTTGCGGTT
AAATTATCAG AGTCAGATTT TTTAAAAGAA AATAATGTTG AATTAATTGG TGCTGATTTA
AGAGCTATTA ATAAAGCTGA AGATAGAAAA TTGTTTAAAG AATCGATGGA GAAAATTAAT
GTAAATGTTT GTCCGTCTGG GATTGCATCT AACTTGGATG AAGCTGTGGA GGTATCAAAA
AAAATTAGTT CCTATCCTCT TATAATAAGG CCTGCATTTA CATTAGGTGG TGTAGGAGGT
GGAATCGCTT TTAACCTTGA AGAATTTGTT GTGTTGTGTA AATCAGGTTT AGAGGAAAGT
CCAAGTAATC AAATATTGAT TGAAAAATCA CTTATTGGAT GGAAGGAGTT TGAACTAGAG
GTAATGAGAG ATACTGCTGA CAATGTAGTA ATAATTTGCA GTATTGAAAA TTTAGACCCA
ATGGGAGTCC ACACTGGAGA TTCTATTACT GTCGCTCCTG CACAGACTTT AACAGATAAG
GAGTATCAGA GATTGAGGGA TTTGTCATTA AAAATTATTC GAGAGGTAGG AGTCGAAACA
GGAGGGAGTA ATATTCAATT TGCACTAAAT CCATCTAATG GAGAAGTAAT TGTCATTGAA
ATGAATCCCC GTGTGAGTAG ATCCTCTGCT TTGGCAAGTA AAGCAACTGG ATTTCCCATT
GCTAAGATTG CAGCTTTATT ATCTGTTGGC TATACACTTG ATGAGATTAT TAATGACATT
ACAAAAAAAA CACCTGCATG TTTTGAGCCG TCAATTGATT ATGTAGTTAC TAAGATTCCA
AGATTTGCCT TTGAAAAGTT TAAAGGCTCT ACAAATACTT TAAGCACTGC CATGAAATCC
GTTGGTGAGT CAATGGCAAT AGGTCGTTCT TTTGAAGAAT CATTTCAGAA AGCATTAAGG
TCATTAGAAG TCGGTGTTTT TGGATGGGAA TGTGATTCTC TAGATGAATT TAAAAATGAA
AGTCACATTA AAAATAGTTT AAGAAACCCT ACATCTGAAA GAATTCTCAT GGTTAAGAAA
GCTATGCAGC TTGGAAAAAC TAATTCTTAT ATTCAAGAAG TTACAAATAT AGATTTATGG
TTTATCGAGA AATTACGTAA TATCTTTAAT TTTGAAAATG AATTTTTGAA AGAAAAGGAA
CTTTATACTC TAGATAGGGA TTTGATGTTA CATGCTAAAC AATTAGGCTT TTCAGATCAA
CAGATAGCAA AGTTAACTAA TTCTGATTTT TTTGAAGTGA GAAGATATAG AAAAAAACTT
AATATAATTC CAATTTATAA GAATGTTGAT ACTTGTTCAG CAGAATTCTC GTCATCAACT
CCCTATCATT ATTCAACTTA CGAAGAGGCA TTCATAAATT TTAATTCTCA AACTTTTGAT
AGCGAGATTT CAGAAAATAG TAAATCAAAA AAAATTATGA TTCTAGGAGG AGGTCCAAAC
AGAATTGGTC AGGGAATAGA ATTTGATTAC TGTTGTTGTC ATGCATCATA TCAAGCCTCC
ACAAATGGTT ATAAAACAAT AATGGTTAAT AGTAACCCTG AAACTGTATC AACAGATTAT
GATACTAGCG ATATTTTATA TTTTGAGCCT GTAACTTTGG AAGATGTGCT CAACATAATA
GAGGCTGAAA ATCCTTATGG TTTAATTGTT CAATTTGGAG GCCAAACCCC ACTGAAATTA
TCATTACCTT TATCTGAATG GCTTAAATCT AATGAAGGGC TCAGAACTGG ATCAAAAATT
CTTGGGACTT CTCCAATATC TATCGATTTA GCAGAAGATA GAGAGGAATT TACAAAAATA
CTTAAAGAAT TAGCTATTAG ACAACCTTTA AACGGTATTG CACATAATCA AAAAGAAGCG
GCGGTTGTAG CAAAAAATAT AGGGTTCCCC TTGGTTGTAA GACCTTCTTA TGTTTTAGGA
GGAAGGGCAA TGGAAATTGT TAAAGATGAG AATGAATTAT CTAGATACAT CTCTGAAGCA
GTTAAGGTAT CTCCTGATCA TCCAATACTT CTTGATCAAT ATTTGAATAA TGCTATTGAG
ATAGATGTTG ATGCTTTGTG CGATTCAGAG GGTTCGGTTG TTATTGCTGG TCTAATGGAA
CATGTTGAAC CGGCAGGAAT TCATTCTGGA GACTCAGCAT GTTGTTTGCC ATCCATTTCT
CTTTCAAAAT CCACCATAGA AAATGTAAAG AAATGGACTA AATTAATTGC ACAAAGATTA
AATGTTGTTG GTTTAATTAA TTTGCAATTT GCAGTAACGA ATACAAATAA CAAAGAAAAA
AAATTATTTA TTCTTGAAGC AAATCCAAGA GCATCAAGGA CAGTCCCATT TGTTTCAAAA
GCCATAGGTA AACCAGTTGC AAAATTAGCT ACCCAGTTAA TGCAAGGTTT TACATTAGAA
GATGTTAATT TCACACAAGA ATTTTACCCA AAATATCAGG CAGTAAAAGA AGCTGTTTTA
CCTTTCAAAA GATTTCCTGG ATCTGATACA TTACTTGGTC CTGAAATGAA ATCTACTGGA
GAAGTAATGG GCTTAGCTAA AGATTTTGGA ATTGCTTATG CCAAGTCGGA ATTGGCTGCA
GGAAATGGTG TGCCTTCAGA AGGAGTGGCT TTTTTGTCTA CAAATGATTT AGATAAAAAA
CTTCTTGAGG AAGTTGCTAG AGAATTGATG ATTTTAGGAT TTAAATTAAT CGCAACAAAA
GGGACAGCTG CATATTTGAT TAATCTAGGC ATTCAAGTTG AAGAAGTGCT AAAAGTTCAC
GAAGGTAGAC CAAATATTGA GGATCTAATT CGTTCTGGAC TTGTTCAATT AATAATTAAT
ACTCCAATCG GCTCACAGGC TCTACATGAC GACGCTTATT TAAGACGTGC TGCTTTAGAA
TATAATATTC CAACTTTTAC AACCATTCCT GGAGCAAAGG CAGCGATTAA AGCAATCAAA
GCATTGCAAT GTAATAATAT TGATTCTTAT TCTCTTCAAG AAATCCATAA TTTTTAA
 
Protein sequence
MPQRGDLKKI LILGSGPIVI GQACEFDYSG TQACKALRNA GYEVVLINSN PASIMTDPEI 
ASKTYIEPLT PEIVSQIILR EKPDAILPTM GGQTALNLAV KLSESDFLKE NNVELIGADL
RAINKAEDRK LFKESMEKIN VNVCPSGIAS NLDEAVEVSK KISSYPLIIR PAFTLGGVGG
GIAFNLEEFV VLCKSGLEES PSNQILIEKS LIGWKEFELE VMRDTADNVV IICSIENLDP
MGVHTGDSIT VAPAQTLTDK EYQRLRDLSL KIIREVGVET GGSNIQFALN PSNGEVIVIE
MNPRVSRSSA LASKATGFPI AKIAALLSVG YTLDEIINDI TKKTPACFEP SIDYVVTKIP
RFAFEKFKGS TNTLSTAMKS VGESMAIGRS FEESFQKALR SLEVGVFGWE CDSLDEFKNE
SHIKNSLRNP TSERILMVKK AMQLGKTNSY IQEVTNIDLW FIEKLRNIFN FENEFLKEKE
LYTLDRDLML HAKQLGFSDQ QIAKLTNSDF FEVRRYRKKL NIIPIYKNVD TCSAEFSSST
PYHYSTYEEA FINFNSQTFD SEISENSKSK KIMILGGGPN RIGQGIEFDY CCCHASYQAS
TNGYKTIMVN SNPETVSTDY DTSDILYFEP VTLEDVLNII EAENPYGLIV QFGGQTPLKL
SLPLSEWLKS NEGLRTGSKI LGTSPISIDL AEDREEFTKI LKELAIRQPL NGIAHNQKEA
AVVAKNIGFP LVVRPSYVLG GRAMEIVKDE NELSRYISEA VKVSPDHPIL LDQYLNNAIE
IDVDALCDSE GSVVIAGLME HVEPAGIHSG DSACCLPSIS LSKSTIENVK KWTKLIAQRL
NVVGLINLQF AVTNTNNKEK KLFILEANPR ASRTVPFVSK AIGKPVAKLA TQLMQGFTLE
DVNFTQEFYP KYQAVKEAVL PFKRFPGSDT LLGPEMKSTG EVMGLAKDFG IAYAKSELAA
GNGVPSEGVA FLSTNDLDKK LLEEVARELM ILGFKLIATK GTAAYLINLG IQVEEVLKVH
EGRPNIEDLI RSGLVQLIIN TPIGSQALHD DAYLRRAALE YNIPTFTTIP GAKAAIKAIK
ALQCNNIDSY SLQEIHNF