Gene NATL1_09341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_09341 
SymboldnaE 
ID4780080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp861391 
End bp864909 
Gene Length3519 bp 
Protein Length1172 aa 
Translation table11 
GC content35% 
IMG OID640084211 
ProductDNA polymerase III subunit alpha 
Protein accessionYP_001014757 
Protein GI124025641 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.146562 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTTG TTCCTATTCA TAACCATAGT GACTACAGCC TTCTTGATGG AGCCAGTCAA 
CTCCCTTTAA TGGTTCAACG GGCAAAGGAA TTGGGGATGC CAGCTCTGGC TCTGACTGAT
CATGGAGTAA TGTATGGCGC GATCGAATTA TTGAAGTTAT GTAAGGCCGC GAATATAAAG
CCAATTATTG GGAATGAGAT GTACGTTATC AATGGTTCAA TTGATGACCC TCAGCCCAAA
AAGGAAAAAA GATATCATCT TGTTGTCGTA GCTAAAAACC AAATTGGTTA TGAAAATCTC
GTAAAGTTAA CTACGCTTAG TCATTTAAAC GGTGTTAGAG GAAGAGGGAT TTTTTCAAGA
CCTTGTATAG ATAAGTATTT ATTCAAAAAA TATAGCGAGG GGTTGATATG TTCAACAGCT
TGCTTAGGTG GTGAAATTCC ACAAGCGATT TTAAAAGGAA GAATTGACGT TGCTAGAGAA
GTAGCAGCTT GGTATAAAGA AGTTTTGGGT GATGATTTTT ATCTTGAAAT TCAAGACCAT
GGATCAATTG AGGATAGAAT TGTTAATAGT GAAATAGTCA AAATATCCGA AGAACTTGAT
ATTAAAATTA TTGCTACCAA TGATGCACAT TATTTATCAA AGAATGATAT TGAAGCTCAT
GATGCATTGA TTTGTGTTTT GACTGGAAAG TTAATAAGTG ATCACAAAAG ATTGAGATAT
ACAGGGACTG AATATATTAA ATCTGAGGAT GAAATGAGAA GTTTATTTAC TGATCATTTA
GACAAAAATG TCATAAACAG TGCAATAGAA AATACAGTTA AACTATCAAA TAAAGTTGAA
GAATATAAGA TATTAGGCAC TTATAAGATG CCTAATTTTC CTATACCTGA TGGTTATCAA
CCAATTGAAT ATCTTAAAGA GATAACTATC AAAGGTTTAC TAGAAATTTT AGATATTTCT
AAATTTGAAA ATCTTCCAAT CACATATAAA GAACGACTTG ATTATGAGTT GAAAGTAATA
GAACAAATGG GGTTTCCTAC ATATTTCCTT GTTGTATGGG ATTATATAAG ATTTGCAAGA
GAGCAAAATA TTCCTGTAGG CCCAGGTAGA GGATCAGCAG CTGGCTCTTT AGTCGCTTTT
TCTCTTCATA TAACTAATAT TGACCCAGTA GAGAATGGTT TGTTATTTGA AAGATTTCTC
AATCCTGAGA GAAAGTCAAT GCCTGATATT GATACTGATT TTTGTATTGA AAGACGTGGC
GAAGTTATAG ATTATGTAAC TAAAAAGTAT GGTGAAGATA AAGTTGCACA GATAATTACA
TTTAACAGAA TGACATCTAA GGCTGTTTTG AAAGATGTTG CTCGTGTCCT TGATATTCCC
TATGGAGATG CAGACCGATT AGCGAAATTA ATTCCAGTTG TGAGGGGAAA GCCTGCGAAA
TTGGCAGCTA TGATTTCTAA AGAATCGCCA AATAAAGATT TCTATGAAAA ATACAATAAT
GATTCAAAAG TAAAGAAATG GGTTGATATG GCAATGAGGA TAGAAGGGAC AAATAAGACT
TTTGGTGTTC ATGCAGCAGG TGTAGTTATT GCTGCTAATT CACTTGATAA TTTAGTTCCT
CTTCAAAGAA ACAATGATGG ACAAATAATT ACTCAATATT TTATGGAAGA TATTGAATCA
CTTGGACTTT TGAAGATGGA CTTTTTAGGA CTTAGAAATC TTACAATGAT CGAAAAGACA
ATTGATTTAG TTGAGAAATC AATTGGTAAG AGATTAGATC CTGATTCTTT GCCTTTCACA
GATGAAAAAA CATTCGAACT TCTTTCTAGG GGTGATTTAG AAGGAATTTT CCAACTTGAA
TCTAGTGGAA TGAGACAAAT AGTAAAAGAT CTAAAGCCTT CATCTCTTGA GGATATTTCT
TCAATTCTTG CTCTTTATCG TCCAGGTCCT CTTGATGCAG GATTGATTCC TAAATTTATA
AATAGAAAAC ATGGGAAGGA GAGTATTGAT TTTCAACATC AATCACTTGA GCCAATTTTA
AGTGAGACTT ATGGAATCAT GGTTTATCAA GAGCAGATCA TGAAGATTGC ACAGGATTTA
GCCGGATATA CGCTTGGGCA AGCAGATTTA TTGAGAAGGG CAATGGGTAA GAAAAAAGTA
TCCGAGATGC AGCGCCATAG AACGCTCTTT GTTGATGGAG CTGTTAAAAA TGGTGTCACA
GATGTCATCG CTGAACAGTT ATTTGATCAA ATGGTTTTAT TTGCTGAATA CTGCTTTAAC
AAAAGTCATT CAACTGCTTA TGGAGCGGTT ACTTATCAGA CTGCTTATTT AAAAGCACAT
TATCCTGTCG CTTATATGGC GGCATTGCTT ACCGTTAATG CTGGATCGGC TGACAAGATT
CAAAGATATA TTTCTAACTG TAATTCAATG GGCATAAACG TAATGCCTCC AAATATCAAT
ACCTCTGGTG TTGATTTCAC TCCAAAAGAT AATTCAATTC TTTTTGGTTT TTCGGCTGTC
AAAAATTTAG GTGATGGTGC AATTAGAAAA ATTATCACCT CTAGAGATGA AGATGGACAA
TTTACTTCTT TAGCACAATT CTGTGATCGA ATTTCACTTG GTTCCGTTAA CCGAAGAGGT
CTTGAGGCGT TGATACATAG TGGAGCGCTT GATTGTCTTG AAAAAAATGC AAATCGTGCT
CAGCTTATTG CTGATTTGGA TTTAACTATT GAATGGGCTT CTTCTAGAGC AAAAGATAGA
ACGAGCGGTC AAGGTAATCT CTTCGATTTA TCTAATTCCA CAAATAATGA ATCATCACCG
AATGATGATT ATTCATCAGC TCCAAAGGCG AAAGAAGTCC AAGAGTATCT TCCTTCAGAC
AAACTTAAAT TAGAAAAAGA GCATGTTGGA TTCTATCTAT CTGATCATCC TTTGAAGCAA
CTTTCAGAAC CAGCAAAATT GATTGCTCCT ATCAGCCTTA GTTCTTTAGA AGAGCAAAAA
GATAAGTCAA AGGTTAGTGT TATTGCAATG ATTCCAGAAA TGAGAGAAGT CACAACTAGA
AAAGGTGACA GGATGGCAAT TATTCAATTA GAGGATTTAA CTGGTTCTTG TGAAGCTGTT
GTTTTCCCAA AAAGCTATGA ACGATTATCA GATCATTTGA TGGTTGAAAC CAGGTTATTG
ATATGGGGCA GCGTGGACAG GAGAGATGAA ACTGTTCAAT TGCTTATTGA TGATTGTCGT
GAAATTGATG ACTTAAGATT TCTCTTGATT GATCTTCGTC CTGATCAAGC TACAGATATC
AATATTCAGC ATAAATTAAG AGAATGCCTT TCTAAAAACA GACCTAACAG AAATGAATTA
GGTGTACGTA TCCCAGTAGT AGCATGTCTA AAGGACAACA CTAATACTAG GTATGTAAGG
TTGGGCGATC AATTTTGCGT TAAGGATGCA GACCTGGCTT TGGAGGCATT ATCTAAGAAT
TCCTTCATTG CAAGATCAAG TAAAAGCCTC GTAATTTAA
 
Protein sequence
MAFVPIHNHS DYSLLDGASQ LPLMVQRAKE LGMPALALTD HGVMYGAIEL LKLCKAANIK 
PIIGNEMYVI NGSIDDPQPK KEKRYHLVVV AKNQIGYENL VKLTTLSHLN GVRGRGIFSR
PCIDKYLFKK YSEGLICSTA CLGGEIPQAI LKGRIDVARE VAAWYKEVLG DDFYLEIQDH
GSIEDRIVNS EIVKISEELD IKIIATNDAH YLSKNDIEAH DALICVLTGK LISDHKRLRY
TGTEYIKSED EMRSLFTDHL DKNVINSAIE NTVKLSNKVE EYKILGTYKM PNFPIPDGYQ
PIEYLKEITI KGLLEILDIS KFENLPITYK ERLDYELKVI EQMGFPTYFL VVWDYIRFAR
EQNIPVGPGR GSAAGSLVAF SLHITNIDPV ENGLLFERFL NPERKSMPDI DTDFCIERRG
EVIDYVTKKY GEDKVAQIIT FNRMTSKAVL KDVARVLDIP YGDADRLAKL IPVVRGKPAK
LAAMISKESP NKDFYEKYNN DSKVKKWVDM AMRIEGTNKT FGVHAAGVVI AANSLDNLVP
LQRNNDGQII TQYFMEDIES LGLLKMDFLG LRNLTMIEKT IDLVEKSIGK RLDPDSLPFT
DEKTFELLSR GDLEGIFQLE SSGMRQIVKD LKPSSLEDIS SILALYRPGP LDAGLIPKFI
NRKHGKESID FQHQSLEPIL SETYGIMVYQ EQIMKIAQDL AGYTLGQADL LRRAMGKKKV
SEMQRHRTLF VDGAVKNGVT DVIAEQLFDQ MVLFAEYCFN KSHSTAYGAV TYQTAYLKAH
YPVAYMAALL TVNAGSADKI QRYISNCNSM GINVMPPNIN TSGVDFTPKD NSILFGFSAV
KNLGDGAIRK IITSRDEDGQ FTSLAQFCDR ISLGSVNRRG LEALIHSGAL DCLEKNANRA
QLIADLDLTI EWASSRAKDR TSGQGNLFDL SNSTNNESSP NDDYSSAPKA KEVQEYLPSD
KLKLEKEHVG FYLSDHPLKQ LSEPAKLIAP ISLSSLEEQK DKSKVSVIAM IPEMREVTTR
KGDRMAIIQL EDLTGSCEAV VFPKSYERLS DHLMVETRLL IWGSVDRRDE TVQLLIDDCR
EIDDLRFLLI DLRPDQATDI NIQHKLRECL SKNRPNRNEL GVRIPVVACL KDNTNTRYVR
LGDQFCVKDA DLALEALSKN SFIARSSKSL VI