Gene P9515_09001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_09001 
SymbolcarB 
ID4719308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp799375 
End bp802677 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content33% 
IMG OID640080578 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_001011214 
Protein GI123966133 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCAAA GAGGTGATCT TAAACGAATT CTTATTCTTG GTTCAGGTCC AATTGTTATT 
GGACAAGCAT GTGAATTTGA TTATTCAGGT ACTCAGGCAT GTAAAGCTTT ACGAAAAGCT
GGCTATGAAA TTATTTTAAT TAATTCTAAT CCAGCATCCA TAATGACCGA TCCTGATATT
GCAAACAAAA CATATATTGA GCCTTTAACT CCTGAAATTG TTTCTCAAAT AATTTTAAAA
GAAAAACCTG ATGCAATACT TCCAACTATG GGTGGTCAGA CAGCCCTTAA TCTAGCTGTA
AAACTCTCTG AATCTAATTT CTTGAAGAAT AATAATGTTG AATTAATAGG AGCAGATTTA
AAAGCTATTA ATAAGGCTGA AGATAGAAAA CTATTTAAAG AATCGATGGA AAAAATTAAT
GTAAATGTTT GTCCTTCTGG TATTGCTTCT GATTTAGTTG AAGCGATGAA AGTTTCTAAG
CAAATAAATT CATATCCATT AATTATTAGG CCCGCTTTTA CTTTAGGAGG AGTAGGAGGA
GGTATAGCAT ATAACCTTGA AGAATTTAAT GATCTTTGTA AATCTGGTTT AGATGAAAGT
CCTAGTAATC AAATATTGAT TGAAAAATCT TTAATCGGTT GGAAGGAATT TGAATTAGAA
GTTATGAGAG ATACCTCTGA TAACGTAGTG ATAATTTGCA GTATTGAAAA TTTGGACCCT
ATGGGAGTTC ACACTGGCGA TTCGATTACA GTAGCTCCTG CTCAAACATT AACTGATAAA
GAATATCAGA GACTCAGAGA TTTATCATTA AAAATTATTA GAGAAGTAGG TGTTGAAACA
GGGGGGAGTA ATATCCAATT TGCTGTAAAT CCCAATAATG GTGATGTGAT AGTTATTGAG
ATGAATCCGA GGGTAAGTAG ATCTTCTGCA TTAGCTAGTA AAGCTACAGG TTTTCCAATA
GCTAAAATTG CAGCTTTATT ATCGGTGGGA TACACTCTGG ATGAAATTAT TAATGATATT
ACTAAAAAAA CCCCTGCATG TTTTGAACCG TCGATTGATT ATGTAGTAAC CAAAGTTCCA
AGATTTGCTT TTGAAAAATT TAAAGGTTCT TCAAATATTT TGAATACGGC AATGAAGTCT
GTTGGAGAAA CAATGGCAAT TGGTCGTTCA TTTGAAGAAT CATTTCAAAA AGCATTGAGA
TCTTTGGAAA TTGGAATTTT TGGTTGGGAA TGCGATTCTA TTGAAGATTT CAATAATGAA
AATGATTTAA AAAATAATTT AAGAAATCCC ACTGCTGAAA GAATTTTAAT AGTAAAAAAA
GCAATGGAAG CTGGTAAGAG TGATAGTTAT ATCAACGAAA TAACCAATAT AGATTTATGG
TTTCTTGAGA AGTTACGAAA TATTTTTAAT TTTCAGAACC AATTTTTGAA AGAGAAAAAT
CTTAATGAAA TTGATAGAGA TTTAATGTTA AGGGCTAAGC AAATTGGTTT TTCAGATCAA
CAGATTGCAA AATTAACTGA CTCTGATTTT TTTGAAGTAA GAAACTATCG AAAAAAATTA
AATATCCTAC CACTTTATAA AACAGTGGAT ACTTGTTCGG CTGAGTTTTC ATCAGAAACT
CCCTATCATT ACTCAACTTA TGAAGAATCT TTTTCTGAGA GTAATTTACA TTTTTCTGAC
AATGAATCAT ATTCAAATAA AGCCAATCCT GCAAAAAAAA TCATGATATT AGGAGGTGGA
CCTAATAGAA TTGGTCAAGG GATTGAATTT GATTACTGTT GTTGCCATGC CTCATATCAA
TCTTCAAGTA ATGGATATCA AACCATTATG GTTAATAGCA ATCCTGAAAC TGTTTCAACT
GATTATGATA CTAGCGATAT TTTATATTTT GAACCTGTTA CTTTTGAAGA TATTCTAAAT
ATAATTGAGG CTGAAAATCC ATATGGATTG ATAGTCCAAT TTGGAGGCCA AACTCCCTTG
AAATTATCGT TACCTCTATC AAATTGGTTG AAATCCAAAG AGGGTCTAAA ATGTAATTCA
AAAATTCTTG GAACTTCCCC CCTCTCAATT GATATTGCCG AAGATAGAGA AGAGTTTACA
AAAATCCTTA AAGACTTAAA TATTAGACAA CCTCTCAATG GAATAGCGCG CACTCAAGAG
GAAGCATTAT CAGTTTCACA AAATATAGGC TTCCCTCTGG TGGTTAGGCC TTCTTATGTT
TTGGGAGGAA GAGCTATGGA AATAGTTAAA GACAAAATTG AATTATCAAG GTATATATCT
GAGGCTGTTA AAGTTTCCCC AGATCATCCA ATCCTTTTGG ATCAATATTT AAGTAATGCA
ATAGAGATTG ATGTTGATGC ATTGTGTGAT GAAAATGGTT CTGTCGTAAT AGCAGGACTT
ATGGAACATG TAGAACCTGC TGGGATCCAT TCCGGTGATT CAGCCTGTTG CCTACCTTCT
ATTTCATTAT CACAACTTAC ATTAGAAACT GTAAAAGAAT GGACAAAATT AATTGCAAAT
AGATTAAACG TTGTTGGCTT AATTAATTTA CAATTTGCAG TTACAAATAT AAATCATCAA
GAAAATGAAC TATATATTCT TGAGGCTAAT CCACGAGCAT CTAGAACAAT TCCTTTTGTT
TCTAAAGCTA TTGGTAAGCC TGTTGCGAAA TTAGCAACTC AATTAATGCA AGGAAGTTCT
TTAAAGGATA TAAATTATAC TGAAGAGATA ATTCCAAAAT ATCAAGCTGT AAAAGAAGCT
GTTTTACCTT TTAAAAGGTT TCCTGGCTCG GATACTTTGC TTGGCCCTGA GATGCGTTCT
ACTGGCGAAG TTATGGGCTT GGCTGAGGAC TTCGGTTTAG CTTATGCAAA ATCAGAATTA
GCATCAGGGA ATGGAGTCCC TTCTGAGGGA GTTGCTTTTT TATCTACAAA TGATTTAGAT
AAAGATAAAT TAGAGGAGAT CGCCAGAGAA CTAATAGTTT TAGGTTTTAA ATTAATTGCA
ACAAAAGGTA CTGCAAGCTA TTTATTAAAT TTAGGTATTG AGGTAGATGA GGTTTTGAAA
GTTCATGAAG GAAGGCCCAA TATTGAAGAT TTAATTCGTT CTGGTCTTAT CCAATTAGTT
ATAAACACGC CTATTGGTTC TCAAGCATTT CATGATGATG CTTATCTCAG GCGTGCAGCC
TTAGAATATA ATATTCCAAC ATTTACAACT ATTCCAGGGG CAAAAGCAGC TTTACAAGCA
ATTAAATCTT TGCGTATAAA TAAAATTGAT ACACTATCAC TTCAAGAAAT TCATAAAGAT
TAG
 
Protein sequence
MPQRGDLKRI LILGSGPIVI GQACEFDYSG TQACKALRKA GYEIILINSN PASIMTDPDI 
ANKTYIEPLT PEIVSQIILK EKPDAILPTM GGQTALNLAV KLSESNFLKN NNVELIGADL
KAINKAEDRK LFKESMEKIN VNVCPSGIAS DLVEAMKVSK QINSYPLIIR PAFTLGGVGG
GIAYNLEEFN DLCKSGLDES PSNQILIEKS LIGWKEFELE VMRDTSDNVV IICSIENLDP
MGVHTGDSIT VAPAQTLTDK EYQRLRDLSL KIIREVGVET GGSNIQFAVN PNNGDVIVIE
MNPRVSRSSA LASKATGFPI AKIAALLSVG YTLDEIINDI TKKTPACFEP SIDYVVTKVP
RFAFEKFKGS SNILNTAMKS VGETMAIGRS FEESFQKALR SLEIGIFGWE CDSIEDFNNE
NDLKNNLRNP TAERILIVKK AMEAGKSDSY INEITNIDLW FLEKLRNIFN FQNQFLKEKN
LNEIDRDLML RAKQIGFSDQ QIAKLTDSDF FEVRNYRKKL NILPLYKTVD TCSAEFSSET
PYHYSTYEES FSESNLHFSD NESYSNKANP AKKIMILGGG PNRIGQGIEF DYCCCHASYQ
SSSNGYQTIM VNSNPETVST DYDTSDILYF EPVTFEDILN IIEAENPYGL IVQFGGQTPL
KLSLPLSNWL KSKEGLKCNS KILGTSPLSI DIAEDREEFT KILKDLNIRQ PLNGIARTQE
EALSVSQNIG FPLVVRPSYV LGGRAMEIVK DKIELSRYIS EAVKVSPDHP ILLDQYLSNA
IEIDVDALCD ENGSVVIAGL MEHVEPAGIH SGDSACCLPS ISLSQLTLET VKEWTKLIAN
RLNVVGLINL QFAVTNINHQ ENELYILEAN PRASRTIPFV SKAIGKPVAK LATQLMQGSS
LKDINYTEEI IPKYQAVKEA VLPFKRFPGS DTLLGPEMRS TGEVMGLAED FGLAYAKSEL
ASGNGVPSEG VAFLSTNDLD KDKLEEIARE LIVLGFKLIA TKGTASYLLN LGIEVDEVLK
VHEGRPNIED LIRSGLIQLV INTPIGSQAF HDDAYLRRAA LEYNIPTFTT IPGAKAALQA
IKSLRINKID TLSLQEIHKD