Gene P9211_09421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_09421 
SymbolcarB 
ID5731110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp838243 
End bp841551 
Gene Length3309 bp 
Protein Length1102 aa 
Translation table11 
GC content37% 
IMG OID641285308 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_001550827 
Protein GI159903483 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.48338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00020479 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCACGGC GTAAAGATAT ACGTCGCATT CTCATCCTAG GTTCTGGACC AATTGTGATA 
GGCCAGGCCT GTGAATTCGA TTATTCAGGA ACCCAAGCCT GTAAAGCCTT AAAAGAGGAA
GGATATGAAG TCGTATTAGT TAATTCTAAT CCGGCTTCAA TAATGACAGA TCCTGAAATT
GCCTATAGGA CCTATATAGA ACCAATAACA TCAGATGTCA TCGAAAAGGT TATTGAGGTG
GAAAGGCCTC AGGCACTCCT TCCGACAATG GGAGGTCAAA CGGCTTTAAA CATATCTGTA
GAACTTGCTG AGAAAGGAAT TCTTAAAAAA TTTGGCGTAG AACTTATCGG AGCAGATCTA
GCCTCAATAA AAAGGGCAGA AGATCGCCAA TTATTTAAAG ACTCTATGAA GAATATTGGA
GTAAATGTAT GTCCATCTGG TATTGCATCC AATATTGAGG AGTCTCTTTC TGTAGGAAAT
ACAATATCGA CTTTCCCAAG AATTATCCGA CCTGCCTTTA CCCTTGGTGG AAGTGGCGGA
GGCATTGCAT ATAACAAAGA AGAATTTATA TCTATTTGCA AGCAAGGATT GGAAGCTAGT
CCAGTTTCTC AAATACTTAT TGAAAAGTCT TTATTGGGAT GGAAGGAGTT TGAATTGGAA
GTAATGAGAG ATATTTCAGA CAATGTAGTT ATTGTTTGCA GTATTGAAAA TCTCGATCCT
ATGGGTATTC ACACTGGTGA TTCTATAACA GTAGCACCAG CGCAAACATT AACTGACCGT
GAATATCAAA GACTAAGGGA TTATTCAATA AAAATAATTA GAGAAATTGG AGTAGAAACT
GGAGGGAGCA ATATTCAATT TGCAGTGAAT CCTATTGATG GTGAAGTAAT AGTTATTGAA
ATGAACCCTA GGGTTAGTCG TTCATCAGCT TTGGCAAGTA AAGCTACTGG TTTCCCAATA
GCAAAGATCG CAGCATTGTT AGCAGTTGGT TACAGACTCG ATGAAATTGT TAACGACATC
ACAGGTAAAA CACCAGCTTG CTTTGAGCCA ACAATTGATT ATGTAGTCAC AAAAATTCCA
CGATTTGCAT TTGAGAAATT CGGTGGAAGT CCTGCAGTTC TGAACACATC AATGAAGTCT
GTAGGAGAAG CTATGGCAAT AGGCAGATCT TTTGAAGAAT CATTTCAGAA AGCATTAAGG
TCCCTAGAAG TAGGTCTATC AGGATGGGGA TGTGATGGTA CTGATCAATT AATTAAGTCA
GTTGATTTAG ATAAATTATT GAGGACACCT TCACCAGAAA GAATAATGGC AGTTAGAACT
GCAATGTTGG AAGGTAGGTC TGATGAAAAT ATATACAGAA TTTCTAATAT TGACCCCTGG
TTTTTATCTA AACTACGAAA CATTATTATA GCTGAACAGT CAATTCTAAC AGATAAGAAC
ATAGAGGATG TTAATGAAGA GGAATTATTT TTGCTCAAAC AACTAGGATT TTCTGATAGA
CAAATAGCAT GGGCTCTTAG AGTGAATGAA TTGCAAATCC GGAGTCTAAG AAAACGGTTT
AATATATTGC CAAAATTTAA AACAGTGGAT ACCTGTGCAG CTGAATTTAG CTCCTCCACT
CCCTACCATT ATTCAACTTA TGAAAGACAA GTTAAAAAAA TAGATTTAGA CGGAACAATT
AATAAGATTG AGCTAGATAG AGAAATTGAG GCAAGCTATA CTAATAAAGT TCTTATTTTA
GGTGGAGGGC CAAATAGAAT AGGGCAAGGG ATAGAATTTG ATTATTGCTG CTGTCACTCA
TCATATCAAT TTCAGGTAGA TGGTTACACA ACTATAATGG TTAATAGTAA TCCTGAAACT
GTTTCCACAG ACTATGATAC TAGCGATATT CTTTATTTTG AACCATTAAC TCTAGAGGAT
ATTTTAAATA TTATTGAATA TGAGCAACCT AATGGAGTAA TCGTTCAATT TGGAGGTCAG
ACACCACTAA AGTTATCAAT ACCAATCATG AACTGGCTTA ATTCTAGTGA TGGAATCAAA
ACCCAAACAA AAATATTAGG AACATCTCCT ATATCTATAG ATAAGGCGGA AGATCGTGAG
CAATTTGATA AAATTTTAAA TGACCTAAAG ATTAAACAAC CTCGAAATGG TATAGCAAGA
TCTATTAGTG AAGCTATAGC TATAGCACAA GATATTAAAT ATCCAATAGT AGTTCGTCCA
TCTTATGTAC TAGGGGGTCG AGCAATGGAA ATTGTCTATG AAGATGAGGA ATTAGTTAGA
TATATGAATG AGGCTGTAAA GGTTGAGCCA GATCATCCTG TATTGATAGA TCAATATCTA
CAGAATGCAA TTGAAGTTGA TGTTGATGCT CTTTGTGATC GTACAGGTAA TGTTGTTATT
GGCGGTTTGA TGGAACATAT AGAACCAGCA GGTATTCACT CTGGAGATTC TGCCTGCTGC
TTGCCATCTA TATCGTTATC AAAAGATTCA CAGCTAACTA TAAGAAAATG GACCGAGTCC
CTATCTAAAG CACTAAATGT TGAAGGTTTA ATAAACCTAC AATTTGCTGT ACAGATAGAT
ACTGAAGGTA TAGAGCAAGT TTTTATAATT GAAGCAAATC CAAGAGCATC TAGGACTGTT
CCATTCGTCT CTAAGGCTAC AGGCTTGCCA TTGGCACGTA TAGCTACAAG TCTAATTGGA
GGCAAAACAT TAAATCAACT TGGAGTGACT AAAGAACCAA TACCGCCTCT TCAAACAATA
AAAGAGGCAG TCATGCCCTT TAAGCGCTTT CCAGGATCAG ACAGTGTTCT AGGACCAGAA
ATGAGATCTA CAGGTGAGGT AATGGGCTCG GCTACTGATT TCGGTATGGC ATACGCCAAG
TCAGAGCTTG CAGCGGGAGA AGCTTTGCCC ACGAGTGGGG TAGTGTTTTT ATCCACTCAT
GACAGAGATA AACCTTCCCT CGTTCCTATT GCAAGATCAT TGATTGACTT AGGTTTTACC
CTGACTGCAA CTCTCGGCAC TTCAAAATAT CTTATGGACG CTGGAGTATT GGTAGAACCT
ATTCTAAAAG TTCATGAGGG ACGACCAAAT ATAGAAGACC TAATAAGATC TGCACAAATT
CAATTAATAA TAAATACTCC TATAGGAAGG CAAGCAGCCC ACGATGATAA GTATTTAAGA
AGAGCAGCTC TTGATTATTC AGTGCCTACA TTAACTACTG TTGCAGGAGC GAAGGCAGCT
GTCGAAGCTA TAACAGCATT GCAAAATCAC AAAATAACAA TTAATGCATT ACAAGACATT
CATCATTAA
 
Protein sequence
MPRRKDIRRI LILGSGPIVI GQACEFDYSG TQACKALKEE GYEVVLVNSN PASIMTDPEI 
AYRTYIEPIT SDVIEKVIEV ERPQALLPTM GGQTALNISV ELAEKGILKK FGVELIGADL
ASIKRAEDRQ LFKDSMKNIG VNVCPSGIAS NIEESLSVGN TISTFPRIIR PAFTLGGSGG
GIAYNKEEFI SICKQGLEAS PVSQILIEKS LLGWKEFELE VMRDISDNVV IVCSIENLDP
MGIHTGDSIT VAPAQTLTDR EYQRLRDYSI KIIREIGVET GGSNIQFAVN PIDGEVIVIE
MNPRVSRSSA LASKATGFPI AKIAALLAVG YRLDEIVNDI TGKTPACFEP TIDYVVTKIP
RFAFEKFGGS PAVLNTSMKS VGEAMAIGRS FEESFQKALR SLEVGLSGWG CDGTDQLIKS
VDLDKLLRTP SPERIMAVRT AMLEGRSDEN IYRISNIDPW FLSKLRNIII AEQSILTDKN
IEDVNEEELF LLKQLGFSDR QIAWALRVNE LQIRSLRKRF NILPKFKTVD TCAAEFSSST
PYHYSTYERQ VKKIDLDGTI NKIELDREIE ASYTNKVLIL GGGPNRIGQG IEFDYCCCHS
SYQFQVDGYT TIMVNSNPET VSTDYDTSDI LYFEPLTLED ILNIIEYEQP NGVIVQFGGQ
TPLKLSIPIM NWLNSSDGIK TQTKILGTSP ISIDKAEDRE QFDKILNDLK IKQPRNGIAR
SISEAIAIAQ DIKYPIVVRP SYVLGGRAME IVYEDEELVR YMNEAVKVEP DHPVLIDQYL
QNAIEVDVDA LCDRTGNVVI GGLMEHIEPA GIHSGDSACC LPSISLSKDS QLTIRKWTES
LSKALNVEGL INLQFAVQID TEGIEQVFII EANPRASRTV PFVSKATGLP LARIATSLIG
GKTLNQLGVT KEPIPPLQTI KEAVMPFKRF PGSDSVLGPE MRSTGEVMGS ATDFGMAYAK
SELAAGEALP TSGVVFLSTH DRDKPSLVPI ARSLIDLGFT LTATLGTSKY LMDAGVLVEP
ILKVHEGRPN IEDLIRSAQI QLIINTPIGR QAAHDDKYLR RAALDYSVPT LTTVAGAKAA
VEAITALQNH KITINALQDI HH