Gene P9303_19021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_19021 
SymboldnaQ 
ID4776092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1662829 
End bp1663794 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content52% 
IMG OID640087411 
Productputative DNA polymerase III, epsilon subunit 
Protein accessionYP_001017909 
Protein GI124023602 
COG category[L] Replication, recombination and repair 
COG ID[COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.182153 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGCGG GAGAATACAA GCAATCGATC GATGAACACG GCGTGGGGGA AGCAAAGCAG 
CATCTCTTTG GAGCTGGTCA GTTAGATCTT CTCCCCGATC TCAATGCAGA AGAGGCAGTT
GTCACTCCTT CAAAGGCTGT CTCTACCTCA AAGGCTGTCT CTACCTCAAA GGCTGTCTCT
ACTTCAAAGG CTGTCTCTTT ACCGCAACCT CATGCACCAG TGGCACTGCC TTCATTGCAT
CGAGAAGCTT TGAGCTCACT GCCAGAGATG CTGCTGATCA TCGATACGGA AACCACTGGA
TTGGATCCGA AGCGAGGTCA ATGCCTAGAG GTTGGAGCCA TCCTTTTCCA TGCACCGCAG
CGTGCTGTGC TTGCCCAGCA TTCCTTTTTG CTACCTGTGG AAACCAATGC GGCTGAATCA
ATCAATCGCA TCCCCGCTGA GGTCACTCGC TTGGATCAGC CTTGGCGACA AGGGCTGGAC
TATTTCCAAG CCTTACTGGA TGCCGCTGAT CTGTTAGTTG CTCACAATGC TGGCTTCGAT
CGTCAGTGGT TCGGGAAGGA TCAACTTCCA GCTGTCTCCA AGCCCTGGCT GTGCACGATG
GAAGACATCG CTTGGCCAGT TGATCGTCAG CTTCGTTCCA GGCCTTCTGT AAGAGATTTA
GCTCTCGCTT ATGGCGTGCC GGTATGGGCC GCACATCGTG CTCTCACCGA CTGCATTTAT
CTCGCCGAGG TGTTCGCCCG TTGCAAGGAT CTCGAAACTC TGCTGCTTCA TGGGCTAGAG
CCAAGGCGTT TGATGCGTGC CCAGGTGTCT TATGCACAAA GACATTTAGC CAAGGAAGCT
GGGTTTCGTT GGAATGATCC AATTCAAGGT GCCTGGACTC GACGCTTAAG TGATCGAGAG
GCCGCCAAAC TGGAATTCCA AGTGGTTTCC ATTGATCAAC AAGAGGAGCA GCCATTGAGT
GCATAA
 
Protein sequence
MGAGEYKQSI DEHGVGEAKQ HLFGAGQLDL LPDLNAEEAV VTPSKAVSTS KAVSTSKAVS 
TSKAVSLPQP HAPVALPSLH REALSSLPEM LLIIDTETTG LDPKRGQCLE VGAILFHAPQ
RAVLAQHSFL LPVETNAAES INRIPAEVTR LDQPWRQGLD YFQALLDAAD LLVAHNAGFD
RQWFGKDQLP AVSKPWLCTM EDIAWPVDRQ LRSRPSVRDL ALAYGVPVWA AHRALTDCIY
LAEVFARCKD LETLLLHGLE PRRLMRAQVS YAQRHLAKEA GFRWNDPIQG AWTRRLSDRE
AAKLEFQVVS IDQQEEQPLS A