Gene NATL1_06041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_06041 
SymbolrbcL 
ID4779721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp549278 
End bp550690 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content43% 
IMG OID640083881 
Productribulose bisophosphate carboxylase 
Protein accessionYP_001014431 
Protein GI124025315 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1850] Ribulose 1,5-bisphosphate carboxylase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.337887 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAAA AGTATGATGC TGGAGTTAAG GAGTATAGAG ATACTTACTT CACTCCTGAT 
TACGTCCCCC TAGATACTGA TCTACTTGCA TGTTTTAAAT GCACAGGTCA GGAAGGTGTA
CCTAAAGAAG AAGTTGCAGC AGCTGTTGCG GCTGAATCAT CTACAGGTAC ATGGTCAACA
GTTTGGTCAG AATTACTTGT AGATCTTGAA TTCTACAAAG GCCGCTGTTA CCGCATTGAA
GATGTCCCTG GTGACAAGGA TGCCTTCTAT GCATTTATTG CTTATCCCTT AGATCTTTTT
GAAGAAGGAT CTATAACTAA CGTTTTGACA TCACTTGTTG GAAACGTCTT TGGTTTTAAA
GCTTTGCGTC ATCTTCGACT TGAAGATATT CGCTTCCCAA TGGCATTTAT CAAAACCTGC
GGCGGACCAC CTAGTGGAAT AGTAGTTGAA CGTGATCGTC TTAATAAATA CGGTCGTCCA
TTACTTGGTT GTACTATTAA GCCAAAACTT GGTCTTTCTG GTAAAAACTA CGGTCGTGTT
GTTTACGAAT GCCTTCGTGG TGGTCTTGAT CTAACTAAAG ATGATGAGAA TATCAATTCT
CAGCCATTCC AGCGTTGGAG AGAGCGTTTT GAATTTGTTG CTGAAGCTGT AAAGCTAGCT
CAACAGGAAA CTGGTGAAGT TAAAGGTCAC TACCTGAATT GCACAGCAAC TACTCCTGAA
GAGATGTATA AGCGTGCTGA GTTCGCTAAA GAACTTGACA TGCCAATCAT CATGCATGAC
TACATAACTG GTGGTTTTAC TGCTAATACA GGTCTTGCTA ATTGGTGCCG TGAAAATGGC
ATGCTTCTTC ATATTCACCG TGCTATGCAT GCGGTTATCG ACCGTCATCC TCAGCATGGT
ATCCACTTCA GAGTTCTTGC TAAGTGTTTG CGTCTATCTG GCGGAGATCA ACTTCATACC
GGAACAGTTG TTGGAAAACT AGAAGGAGAT CGTCAAACAA CTCTTGGTTA TATCGATAAC
CTACGTGAAT CCTTTGTTCC TGAAGACCGT ACTCGCGGTA ACTTCTTTGA TCAAGATTGG
GGTTCTATGC CTGGTGTATT TGCTGTGGCA TCTGGTGGTA TTCATGTTTG GCATATGCCA
GCATTGCTTG CAATCTTTGG AGATGATTCA TGTCTCCAAT TTGGAGGTGG TACCCATGGA
CACCCTTGGG GATCAGCTGC TGGTGCAGCC GCTAACCGAG TAGCTCTAGA GGCATGTGTG
AAAGCGCGTA ATGCAGGCAG AGAAATCGAA AAAGAAAGTC GCGATATCCT TCTAGAAGCC
GCAAAGCATA GCCCTGAGTT GGCAATTGCT CTTGAGACAT GGAAGGAGAT TAAGTTCGAA
TTCGATACAG TCGATAAACT CGACGTTCAG TAG
 
Protein sequence
MAKKYDAGVK EYRDTYFTPD YVPLDTDLLA CFKCTGQEGV PKEEVAAAVA AESSTGTWST 
VWSELLVDLE FYKGRCYRIE DVPGDKDAFY AFIAYPLDLF EEGSITNVLT SLVGNVFGFK
ALRHLRLEDI RFPMAFIKTC GGPPSGIVVE RDRLNKYGRP LLGCTIKPKL GLSGKNYGRV
VYECLRGGLD LTKDDENINS QPFQRWRERF EFVAEAVKLA QQETGEVKGH YLNCTATTPE
EMYKRAEFAK ELDMPIIMHD YITGGFTANT GLANWCRENG MLLHIHRAMH AVIDRHPQHG
IHFRVLAKCL RLSGGDQLHT GTVVGKLEGD RQTTLGYIDN LRESFVPEDR TRGNFFDQDW
GSMPGVFAVA SGGIHVWHMP ALLAIFGDDS CLQFGGGTHG HPWGSAAGAA ANRVALEACV
KARNAGREIE KESRDILLEA AKHSPELAIA LETWKEIKFE FDTVDKLDVQ