Gene NATL1_03121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03121 
SymbolpyrB 
ID4780421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp289712 
End bp290731 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content34% 
IMG OID640083577 
Productaspartate carbamoyltransferase catalytic subunit 
Protein accessionYP_001014141 
Protein GI124025025 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0540] Aspartate carbamoyltransferase, catalytic chain 
TIGRFAM ID[TIGR00670] aspartate carbamoyltransferase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.789221 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATT GGAAACATAA TCACGTTCTT GATTTATCTA CGTTTTCTTT AGAGGATTAC 
AAAACAGTTT TAGAATTAAC AACAAGATTT AAAGATGTCC ATAAATCAAG TTCTAAGAAA
CTTCCCGCTC TTCATGGAAG ATTAATTGCC AACTTATTTT TCGAGCCAAG TACAAGGACA
AGAACTAGCT TTGAGCTTGC AGCAAAGAGA CTATCTGCGG ATGTTCAGAA TTTTTCTGTA
TCTTCAAGTT CTCTAAGCAA AGGAGAGACT CCTTTAGACA CTATTCTCAC ATACATCTCA
ATGGGGGCTG ATATCTTAGT AATTAGGCAT GAGTCAACAA ATGTACCCGC AGAGTTAGCT
AATTACGTTG ACATAAATAA CATCAATACA TCCATTCTCA ATGCAGGTGA TGGATTTCAT
AGCCATCCAA GTCAAGGTCT TTTGGATCTA TTCACACTTG CTACTTTTTT TAATCCAAAT
GAACCATCAA CTAATAGTCT TTTAAATAAA AAAATTACAA TAGTTGGAGA TATTCTTCAT
TCTAGAGTTG CCAGATCAAA CCTTTGGGCT CTAACTGCAT GTGGAGCAGA GGTAACACTA
TGTGGACCTC CAAGCCTTCT TCCAGAAGAA TTCATTGATT TTGTTCAGAA TCCTCGGCTA
GGGCAAAATT TTGATCCCAT TAATAAGAGA GGTTCTGTTT TCATAAAAAG ATCGCTAAAA
GACGCATTAA AAAATAGTGA TGCTGTTATG ACACTTCGAT TACAGAAAGA GAGAATGAAG
CAAAATATGC TTAAGGATCT TGATAGTTAT TATGCACAAT ATGGGATAAC CCATGAAAGT
TTAAAATGGT GTGAAAAGAA AGTTCCTGTC CTTCATCCTG GACCAGTAAA TAGAGGGGTA
GAAATAAGTA ACCGATTAGT TGAAGATAAT TCAATCAATC TTATAAGTAA ACAAGTAGAA
AATGGTATTC CAACAAGAAT GGCTTTGTTG TATTTACTAG GGTTAAATAA AAAAGATTAA
 
Protein sequence
MNNWKHNHVL DLSTFSLEDY KTVLELTTRF KDVHKSSSKK LPALHGRLIA NLFFEPSTRT 
RTSFELAAKR LSADVQNFSV SSSSLSKGET PLDTILTYIS MGADILVIRH ESTNVPAELA
NYVDINNINT SILNAGDGFH SHPSQGLLDL FTLATFFNPN EPSTNSLLNK KITIVGDILH
SRVARSNLWA LTACGAEVTL CGPPSLLPEE FIDFVQNPRL GQNFDPINKR GSVFIKRSLK
DALKNSDAVM TLRLQKERMK QNMLKDLDSY YAQYGITHES LKWCEKKVPV LHPGPVNRGV
EISNRLVEDN SINLISKQVE NGIPTRMALL YLLGLNKKD