Gene NATL1_02011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_02011 
SymbolholB 
ID4779168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp186200 
End bp187159 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content28% 
IMG OID640083466 
ProductDNA polymerase III subunit delta' 
Protein accessionYP_001014030 
Protein GI124024914 
COG category[L] Replication, recombination and repair 
COG ID[COG2812] DNA polymerase III, gamma/tau subunits 
TIGRFAM ID[TIGR00678] DNA polymerase III, delta' subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.847421 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAATT TTCAAAATAT ATGTGGGCAA GATTTAGCAA TTAAAATTTT AAAATCTGCT 
ATTTCAAAAG AACATATTTC TCATGCTTAT TTATTTTCTG GTCCAGAGGG AGTTGGAAGA
AAAAAAACTG CTAAAATTTT CATTAAATCT ATTCTTGACC AAAATCAGGA AAAAAAAAGT
ACAAAAAGAA AAATAAATAA TAATAATCAT CCTGACTTAT TATGGGTAGA GCCATCTTAT
ATAGTCCAAG GTCAAAGTAT TTCTCAGACA AAAGCAATTT TAGATGGTAT AAGTATGAAA
TCGCCTCCGC AAATAAGGCT AAATCAAATT AAAGAAATAA TAGAATTTTT AGGGAAAAAG
CCATTTGAAT CAGAAAGAAG CATAGTAATA ATTGAGGATA TTGAAAGAAT AAATGAATCT
GCAGCAAATG CATTATTAAA AACTCTTGAA GAAACATGTA CAGGATTATT TATTCTTATC
ACACAAAGAC CAGAGAAATT ATTATCAACT ATTAGATCAA GATGTCAGAT AGTCCCATTC
ATACGCCTGA ATGATAATCA AGTAAACAAA ATTATTGAAA AATCAGAAGT TGTTCAAGAA
ATAGATGATA TTCCTAGTGA AAAAATTAAA GAATTAATAG ATTTCTCAAA TGGATCTCCA
GGGCGCTGTA TAGTGAATCT TCAATTTTGG TTATCTTTTT CAACTTCATT AAGACAAAAG
TTAGAAATCC AATTAAATAA TCCAATTGAG TTATTACAAT TAGCCAAGGA AATCACAGAT
GAACTAAACA TAGAGCAACA ACTATGGCTT ATTGATTTTC AACAGAATAG GGCATGGATA
AAAGAAAAAA ATTCAAATAT TGTTGTACAA CTTGAAGAGC TTAGAAAACA ATTATTGAAA
TTTGTGCAGC CTAGACTTGC TTGGGAAGTA ACTTTATTAG AAATAAACTT TCTTAATTAA
 
Protein sequence
MDNFQNICGQ DLAIKILKSA ISKEHISHAY LFSGPEGVGR KKTAKIFIKS ILDQNQEKKS 
TKRKINNNNH PDLLWVEPSY IVQGQSISQT KAILDGISMK SPPQIRLNQI KEIIEFLGKK
PFESERSIVI IEDIERINES AANALLKTLE ETCTGLFILI TQRPEKLLST IRSRCQIVPF
IRLNDNQVNK IIEKSEVVQE IDDIPSEKIK ELIDFSNGSP GRCIVNLQFW LSFSTSLRQK
LEIQLNNPIE LLQLAKEITD ELNIEQQLWL IDFQQNRAWI KEKNSNIVVQ LEELRKQLLK
FVQPRLAWEV TLLEINFLN