Gene A9601_02541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_02541 
SymbolpyrB 
ID4716938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp236198 
End bp237214 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content34% 
IMG OID640077953 
Productaspartate carbamoyltransferase catalytic subunit 
Protein accessionYP_001008649 
Protein GI123967791 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0540] Aspartate carbamoyltransferase, catalytic chain 
TIGRFAM ID[TIGR00670] aspartate carbamoyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.290207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATTT GGCCTCATAA ACATATACAC ACACTAGCTA ATTTTTCAAT TAAAGATTAT 
GAGTCAGTAT TTGAATTAGC TAATAGATTT GATGCACTAA AGAATGCAGG AACAAAAAAG
ATACCGGCTT TACAAGGGAC TTTGGTAACT TCTTTATTTT TTGAAGCAAG TACAAGAACA
AAAAATAGTT TTGAGCTTGC AGCAAAAAGA CTTTCTGCTG ATGTCCAAAC GTTTGCGCCA
TCCTCCAGCT CTTTAACAAA AGGCGAAACA ATAATTGATA CCGCTATAAC TTATTCAGCT
ATGGGGGCGG ATACATTAGT TATCAGACAT TCATCAAGTT ACATAACCTT TGAAATCGCA
AAAAAACTTG ATGCAATAAA TTCCAAGACA TCGGTTCTTA ATGCGGGAGA TGGATTACAT
AGTCACCCCA GCCAAGGATT GCTTGACATC TATACATTGA TAAAATTCTT TTCCCCACAA
ACATTGAATC CAGAGGTTTT AAATTCCAAA AAAATTTTAA TAATTGGAGA CGTAAATCAT
TCAAGGGTTG CGAGGTCAAA TCTTTGGGCT TTAAGTGCAT TCGGCGCGGA TATAATCTTA
TGTGGTCCTA AGGCATTAAT ACCTGATGAA TTTATCAATT TTTTAAAAAC CCCCGCGCCA
AATCAAACAG AAGATCCTGT TAAATCAAGA GGTTCCATAA CAATTTCTAG ATCATTGGAA
GAATCAATAA AAACTGCAGA TGCGATTATT GTTTTAAGAC TCCAGAAAGA GAGAATGATG
GAAAATTTAC TAAGTAGCAT TGATTCATAT AGTTTGGATT ATGGCTTAAC CCCAGAGAAA
TTATCTTTAA ATAATAAAGA AATTCCAATT CTACATCCTG GTCCCATTAA CAGAGATATT
GAAATAAGCA GCAAAGTGGT AGATCGATAT CCTAATTGCT TAATAAATAA TCAAGTTGCA
AATGGAATCC CCATAAGAAT GGCTTTGCTT TATCTATTAC AAAAACACAA CAAGTAA
 
Protein sequence
MQIWPHKHIH TLANFSIKDY ESVFELANRF DALKNAGTKK IPALQGTLVT SLFFEASTRT 
KNSFELAAKR LSADVQTFAP SSSSLTKGET IIDTAITYSA MGADTLVIRH SSSYITFEIA
KKLDAINSKT SVLNAGDGLH SHPSQGLLDI YTLIKFFSPQ TLNPEVLNSK KILIIGDVNH
SRVARSNLWA LSAFGADIIL CGPKALIPDE FINFLKTPAP NQTEDPVKSR GSITISRSLE
ESIKTADAII VLRLQKERMM ENLLSSIDSY SLDYGLTPEK LSLNNKEIPI LHPGPINRDI
EISSKVVDRY PNCLINNQVA NGIPIRMALL YLLQKHNK