Gene NATL1_09401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_09401 
SymboldnaG 
ID4779445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp867908 
End bp869785 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content35% 
IMG OID640084217 
ProductDNA primase 
Protein accessionYP_001014763 
Protein GI124025647 
COG category[L] Replication, recombination and repair 
COG ID[COG0358] DNA primase (bacterial type) 
TIGRFAM ID[TIGR01391] DNA primase, catalytic core 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.029627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0569121 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTAA TTCCTGGCAA GCAATTTTAT TATTGTTTTT CATGTGGAGC TGGTGGAAAT 
GCAATTAAAT TCTTAATGGA GTTTCAGAGA CAAAGTTTTA GTGATGTTGT TCTTGAGCTG
GCTAAGAAAT ATCAAGTACC AATTGATACG ATTGAAGGAC CTCAACAAGA AAGACTCAAG
CAACAGCTTT CACGTAGAGA TACCCTTTAT CGTGTTTTAA AAACCGCTAC TGGTTGGTTT
AGAAATCAAT TGAATTCTCC ATGTGGAGAA AATGCACTTA ATTACCTTAA GAATAAGCGT
AATTTAAGTG ATGGAACTTT AATCAATTTT GAACTTGGTT TTGCTCCAGA TAATTGGGAT
TCACTACTTA AATATTTTGT AGACATAGAA AAAGTCAGTG TTGAAATCCT TGAATCGGCG
GGATTGATTG TTCCTCGAAA GGGTGGTAAT GGTTTCTATG ACAGATTTCG CAATCGAATA
ATTGTTCCTA TTCACGACAG GCAAAAAAGA GTTATTGGTT TCGGCGGACG AAGTCTTGAT
GGTTCAGAAC CTAAGTATTT AAATTCACCT GAGACTGAAA TCTTTGAAAA AGGAAAAAAT
CTTTTTGGTT TTGATAAATC CACACTTTCC ATTAGGAAAA AAGATTATGC AGTTGTTGTA
GAGGGATATT TTGATGTGAT GGCACTTCAT GATTCGGGTA TTACAAATGT TGTCGCTTCT
TTAGGAACAG CTTTAAGTCG CAATCAAATA ACGCTTCTTT CTCGTGCCAC CGATAGTAAA
AAGATCCTCT TAAATTTTGA CTCAGATAAT GCTGGAATTC GTGCGGCTAA TAGAGCCATT
AGTGAAGTAG AAAACCTTGC TATTCAAGGT CAACTAGATT TACGAGTCCT TCAATTACCT
TCAGGTAAAG ATCCAGATGA ATTTCTTAAG GGTAATTCTC CATCCGAATA TGAAGCATTA
GCGGCAAGAT CACCTCTTTG GATGGATTGG CAAATTGATC AATCATTGAA GGATTTAGAT
TTAAGTAAAT CTGATCAATT TCAGGAAGCC GTTAGCAGCT TAGTAAGTCT CCTTGGAAAG
CTTCCTCAAA CTGCAATAAG AACTCATTAT CTACAGAAGG TTGCTCAGCG TCTTAGTGGA
GGTCAAGGTA GATTCGCTCT ACAACTAGAG GAGGATTTAC GTAATCAAAT AAGTGGTCAA
AGATGGCACG GTCGCTCGAA AAAAATTGAT AAGCCTCAAG AAATTAGTCT CAGAGAAAGA
AGTGAGTCAG ATATACTTTT TACTTATATT CACTGTCCTA ATTACAGATC TTTTATTCGT
TATGAACTTC GCTTAAGGGA TCTTGATGAT TTTGCGATTA ATCATCATCG TGCAATATGG
TCTACAATAA GTAACATTGA GGAAAATATG TTTGGTCCAG AAACTGTTGA GAAGATTAAT
CGTTTTAATG ATTCTAATAA TATTTTAGCT GATGTTGATT TAATTAAAAA GTTGTTAGAC
AATTTCCTAT CCAGTGATAA TGAGCATCTT CCTAAACTTA CTCCTTTACT AGATGTTAAT
GAACTTCGTT TGGCAACATT AAATGACCCC GAGTCGTTCA TCCGTGGAGC TATGGCTGCT
CTTGAAAAGC AAAAATCCTT AAAACGTTGT AGACATTTAA TTGATGCATG GAGTTCACAG
AGATTGCAAA CTCTTGAGAA CTGTATAGCC TCTCTTATTG TTCAGGAAAA ATCTGAGCCT
AGCGATTCAT CTGATATGGA ACAGAGGGTT ATTAGCATGT TTGAAGACTT AAATAATGAT
GCTATAAATT TTCAACAACT TTATTACGCT GAAAGAAAAC ACATACTAAA TCTAGATCAA
CAGAGATGTT ATAAATAA
 
Protein sequence
MSVIPGKQFY YCFSCGAGGN AIKFLMEFQR QSFSDVVLEL AKKYQVPIDT IEGPQQERLK 
QQLSRRDTLY RVLKTATGWF RNQLNSPCGE NALNYLKNKR NLSDGTLINF ELGFAPDNWD
SLLKYFVDIE KVSVEILESA GLIVPRKGGN GFYDRFRNRI IVPIHDRQKR VIGFGGRSLD
GSEPKYLNSP ETEIFEKGKN LFGFDKSTLS IRKKDYAVVV EGYFDVMALH DSGITNVVAS
LGTALSRNQI TLLSRATDSK KILLNFDSDN AGIRAANRAI SEVENLAIQG QLDLRVLQLP
SGKDPDEFLK GNSPSEYEAL AARSPLWMDW QIDQSLKDLD LSKSDQFQEA VSSLVSLLGK
LPQTAIRTHY LQKVAQRLSG GQGRFALQLE EDLRNQISGQ RWHGRSKKID KPQEISLRER
SESDILFTYI HCPNYRSFIR YELRLRDLDD FAINHHRAIW STISNIEENM FGPETVEKIN
RFNDSNNILA DVDLIKKLLD NFLSSDNEHL PKLTPLLDVN ELRLATLNDP ESFIRGAMAA
LEKQKSLKRC RHLIDAWSSQ RLQTLENCIA SLIVQEKSEP SDSSDMEQRV ISMFEDLNND
AINFQQLYYA ERKHILNLDQ QRCYK