Gene NATL1_05551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_05551 
SymbolpriA 
ID4780307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp502621 
End bp504864 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content37% 
IMG OID640083832 
Productprimosomal protein N' (replication factor Y) 
Protein accessionYP_001014382 
Protein GI124025266 
COG category[L] Replication, recombination and repair 
COG ID[COG1198] Primosomal protein N' (replication factor Y) - superfamily II helicase 
TIGRFAM ID[TIGR00595] primosomal protein N' 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.319885 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCTT TTATATTTGA TATTTGGTTA CATGTAGGTC GTGAAGGGCG ATGTTTTTCT 
TATCAAGATG GAAATAATTT AGACATTGAT TTAGGCGATG TTGTGACAGT GCGTCTGAAA
GGGCAACGCA TGCAAGGGTT GGTTGTTAAG AAGATGAAAA AAAACATAAA TAGTACACAC
CAAAATTTAA ATAATTTTTC GCTGAATAAT GTAGAAACAT TGGTTCAAAA AGCAGCTATC
AAAAAAGAAT GGAGAGAGTG GCTAGAAGAA ATTGCTCTTG ATTTATATGT AAGTGATTTT
CAGATGCTTA AGACCGCGTT ACCTCCTGGT TGGTTAGGAA GATCGAAACT ATCGAATAGA
CCTAAAAAAC TCTGGTGGGT AAAATTGTCT AGCAATAATT ATGAGGGAAA GATATCTTCT
CGACAGATTG AGTTAAAGAA AAATCTTCTC TTAAATGGAG GAGGAAAATG GCAAAAGGAT
TTGGAGGCTG AAGGATTTTC TTCTGTATTG ATTAGAAACT TTGTCTCAGT TGGTTGCGGA
GAAAGAGAAA AACGTTTTTA TCTTTTTAAT TCTTTTGATA ATGAAGAATC TAATGAAAAA
AAGATGTTAA AGATTGAGGA ACCTCAACCT TTAACACTTG AGCAAAAATT AGCAAAAGAA
AAATATGAGT CTCTTCCAAA CGGATCAGCT CTTTTGCTTT GGGGTGTTAC TGGGTCTGGG
AAGACGGAGG TATACCTACA AATTGCAGCG CAAGAGTTAT CTGAAAGTAG ACATTGTCTT
ATCCTTACAC CAGAAATTGG CTTAGTACCA CAATTAGTTG ATCGCTTTCG AAAAAGATTT
GGGTTAAATG TTTTTGAATA TCATAGTAAT TGTTCTCCTA AAGAGAAAAT TGAGACATGG
AAGAGAGCTT TAGACAACAC AAAACCTAGT GTTTTTATTG GTACTCGATC AGCTATTTTT
CTGCCTTTAT CCAGCTTAGG ATTGATAGTT CTTGATGAAG AACACGATAG CTCTTTTAAA
CAGGAATCCC CTATGCCTTG CTATCATGCA AGAGAATTGG CAATTCATAG AGCAAAAAAA
ACAAGTGCTA AAGTAATACT TGGAACAGCA ACTCCATCTT TGAATGTTTG GAAAAATTTA
AAACCAAATG GAAATGTAGT TGTTGCAAAA TTAACCAAAA GGATTTCGAA TCGTAAATTA
CCAACGGTTA GTGTAGTAGA CATGCGAGAG GAATTAGCGC TTGGAAATCG AAGCTTAATT
AGTAGATATC TAAAAAAACA ACTTTTGAGT ATAAAAGAGA GTGGAAATCA AGCTATTGTT
TTAGTCCCTA GACGTGGATA TAGTAGTTTT TTAAGTTGCC GCAGTTGTGG AGAGGTCGTT
CAATGTCCAC ATTGTGATAT TTCACTAACT GTACATCGTT CTAAAGAGGG TAATCAATGG
TTGCGTTGTC ATTGGTGCGA CTTTCGTTCG AAAATTAGTG ATAAGTGTGG AGAATGTGGT
TCAAATGCTT TTAAACCGTT TGGGACTGGA ACACAAAGAG TGATGGATCA TTTAGAAAGA
GAACTAGAGG GTATAAGTTT ATTAAGGTTT GATAGAGATA CAACTAGAGG CCGTGATGGC
CATAGATTGT TGCTAGAAAG ATTTGCTGAT GGTGACGCCG ATATTTTAGT AGGTACTCAG
ATGCTATCTA AGGGAATGGA TTTACCAAAG GTAACTCTTG CCGTCGTCTT AGCAGCAGAT
GGTTTATTGT ATCGTCCTGA TTTAATGGCT ACTGAAGAAA CGCTTCAATT ATTTATGCAA
TTAGCTGGTC GTGCAGGGCG AGGTGAGCAA CCTGGAAAGG TTGTAGTGCA AACTTATTGT
CCTGATCATC CAGTGATTCT TCATTTGATT GATGGGAGTT ACGAAGAATT TCTGAAAAAA
GAAGAGAAGA CTAGAAAAGA AGCCTCGATG GTTCCATACA GTCGAGCCTG CTTATTAAGG
TTCTCTGGCG AATCTTCAGA GTTGGCATCA CAAGGAGCAT TTAATGTTTT ATCAAAAATA
AAGAATGCTT GCAGTCAAAA AGGTTGGAAA TTAGTCGGTC CAGCACCTTC ATTAGTTGAG
AGAGTCGCAG GTAAAAGCCG TTGGCAACTT CTTTTGTACG GTCCAGAATC AAGTCATATA
CCACTCCCTT ATGGACCTGA ATTATGGAAA GATTTACCAA AAGGAGTAAC TCTTTCTATT
GATCCTGATC CTCTACAATT ATGA
 
Protein sequence
MNSFIFDIWL HVGREGRCFS YQDGNNLDID LGDVVTVRLK GQRMQGLVVK KMKKNINSTH 
QNLNNFSLNN VETLVQKAAI KKEWREWLEE IALDLYVSDF QMLKTALPPG WLGRSKLSNR
PKKLWWVKLS SNNYEGKISS RQIELKKNLL LNGGGKWQKD LEAEGFSSVL IRNFVSVGCG
EREKRFYLFN SFDNEESNEK KMLKIEEPQP LTLEQKLAKE KYESLPNGSA LLLWGVTGSG
KTEVYLQIAA QELSESRHCL ILTPEIGLVP QLVDRFRKRF GLNVFEYHSN CSPKEKIETW
KRALDNTKPS VFIGTRSAIF LPLSSLGLIV LDEEHDSSFK QESPMPCYHA RELAIHRAKK
TSAKVILGTA TPSLNVWKNL KPNGNVVVAK LTKRISNRKL PTVSVVDMRE ELALGNRSLI
SRYLKKQLLS IKESGNQAIV LVPRRGYSSF LSCRSCGEVV QCPHCDISLT VHRSKEGNQW
LRCHWCDFRS KISDKCGECG SNAFKPFGTG TQRVMDHLER ELEGISLLRF DRDTTRGRDG
HRLLLERFAD GDADILVGTQ MLSKGMDLPK VTLAVVLAAD GLLYRPDLMA TEETLQLFMQ
LAGRAGRGEQ PGKVVVQTYC PDHPVILHLI DGSYEEFLKK EEKTRKEASM VPYSRACLLR
FSGESSELAS QGAFNVLSKI KNACSQKGWK LVGPAPSLVE RVAGKSRWQL LLYGPESSHI
PLPYGPELWK DLPKGVTLSI DPDPLQL