Gene NATL1_21151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21151 
SymbolholA 
ID4781047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1769647 
End bp1770675 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content32% 
IMG OID640085412 
ProductDNA polymerase III subunit delta 
Protein accessionYP_001015935 
Protein GI124026820 
COG category[L] Replication, recombination and repair 
COG ID[COG1466] DNA polymerase III, delta subunit 
TIGRFAM ID[TIGR01128] DNA polymerase III, delta subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATAC ATTTAATCTG GGGGAATGAT TATGAAGCCT GTAACAGAGA AATAGAAGAA 
CTAATTAATT CAGTCATTGA CTCCTCATGG AAAAGTTTCA ATTATAGTCA TCTAGATGGA
AATGATCCTA AGCAAAATCT AAGAGCACTA GAAGAGGCTC AAAGCGCTCC CTTAGGCAGC
GGAGGCAGGA TTGTGCTAGT TAGAAGAAGT CCATTTTGTA ACGGATGCTC TATTGAGCTT
AGTAATAAAC TTGAACAAGT AATCAAATTA ATTCCTGATA AGACGCATCT CATTTTAAGT
AATTCAAATA AACCTGATAA AAGACTTAAA ACTACTAAAT TAATAGAAAA AAGCATCCAA
TCAAATACTT TATCACAGGA AAAAAGTTTT ATTCTTCCAC TACCATGGGA TATCAATGGG
CAAAGGAATT TAGTGAAGAA TATTTTACAA CAATTAAATC TAAAAATGAA TTATGAAACA
ATTGATTTAA TAGTAGAAAG TATAGGTAAT GATAGCTCTT TAATCAATAC TGAGCTTCAA
AAGCTTTCAT TATTCTCAGA AGCAGTTAAT ACAAACTTAA CTACAGATAA ACCGCGAGAA
ATATCAAAAG AACTAGTCAA AAAACTAATT CAAAATAACT CGACTAATGC ACTAGAAATT
GCCAATTCAC TATTGAAAGG AGAGATAATT ATAGCTCTAA ATAAAATTCA ATCCTTACTT
AAAAATGGAG AGCCAGCTTT ACGATTAATA ACAACGTTGA CTGGTCAATC AAGAGGATGG
CTTTGGGTAC ATCTATTAGA TTCACAAGGG AATCAAGATG TCAAAGAAAT AGCCAAACTT
TCTGGGATTG CAAACCCAAA ACGTATTTTT GTAATTCGCA AACAAATTCA AGGTAAATCT
TTGGAAACAT TGCTTGAATT GATGAAAAAA CTTTTAAAAA TTGAAGCCTC AATAAAATCA
GGAATCAAGC CAATCGATTC TTTTAAAGAT AATCTGCTAA CACACAGTAA ATTTTTGGCT
AAGAACTGA
 
Protein sequence
MPIHLIWGND YEACNREIEE LINSVIDSSW KSFNYSHLDG NDPKQNLRAL EEAQSAPLGS 
GGRIVLVRRS PFCNGCSIEL SNKLEQVIKL IPDKTHLILS NSNKPDKRLK TTKLIEKSIQ
SNTLSQEKSF ILPLPWDING QRNLVKNILQ QLNLKMNYET IDLIVESIGN DSSLINTELQ
KLSLFSEAVN TNLTTDKPRE ISKELVKKLI QNNSTNALEI ANSLLKGEII IALNKIQSLL
KNGEPALRLI TTLTGQSRGW LWVHLLDSQG NQDVKEIAKL SGIANPKRIF VIRKQIQGKS
LETLLELMKK LLKIEASIKS GIKPIDSFKD NLLTHSKFLA KN