Gene P9303_29381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_29381 
SymbolmazG 
ID4776641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2598423 
End bp2599421 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content54% 
IMG OID640088462 
Productnucleoside triphosphate pyrophosphohydrolase 
Protein accessionYP_001018933 
Protein GI124024626 
COG category[R] General function prediction only 
COG ID[COG3956] Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain 
TIGRFAM ID[TIGR00444] MazG family protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCT GCGGCTTAAT CCCCAACCAA TCCTGTCACT GCTGCTCGGG CTGGAAGCCA 
GCGTTTGGCT ACACCTCATT CTCGATGGCG ATCCCCTCCC CGCCGAATGG CATCGCTGGC
GTCATCGATG AGAAGGTAGG CCAATACAAA CCAATGGCCA TGGACGCCGA ACAGCATCTG
GCTCCCACAG AAGCCATCGC GGAGCTGGTC AACATCGTGG CCCAGCTCAG GGATCCAAAG
GGAGGTTGCC CCTGGGATCT GGAGCAGACC CATACCTCCT TGATTCCATG CATGTTGGAA
GAAGCCCACG AAGTGGCCGA CGCCATCCGC AACGGCGATG ACAACCACCT CAGCGAAGAA
CTAGGGGACC TTCTGCTGCA GGTCGTGCTA CATGCTCAGA TCGCTAACGA AGAAGGACGC
TTCAATCTTG AAGACATCGC CCGAAGCATC AGCGCAAAGC TGATTCGCCG ACACCCACAT
GTGTTCGCAG AGGCAGTCGC AATCGACAGC GAAGCAGTTC GGCAAAGTTG GGAATCGATC
AAAGCGAGCG AGCAACCCAG CTCAGCCTCT AAAAGTCCGC TAAGCGATCG TCTACGTAGC
AAGGTCAGAG GTCAGCCAGC TCTGGCTGGA GCGATGGCCA TCTCCAAAAA GGTCGCGAAC
GTTGGCTTCG AGTGGAACAC CATCGATGGA GTATGGGGAA AAGTGCAAGA AGAGTTCGAG
GAACTCAAAG AGGCGGTAGA GCATGAAGAC CAAGCCCATG CACAAACAGA ACTTGGTGAT
GTGCTGTTCA CTCTTGTGAA TGTTGCTCGC TGGTGCGGCC TAAACCCAGA AGAAGGCCTT
GCAGGTACAA ACCAACGCTT TCTTGATCGC TTTTCTCGCG TTGAAGCAGC ACTCGAGGGC
CAGCTAAGCG GCCAATCACT GACGGAACTA GAACAACTTT GGCAAGAAGC CAAAGCAGCA
ATTCGAGAAG AGGCTGACGA CAAAAAGATA TCGAATTAA
 
Protein sequence
MSSCGLIPNQ SCHCCSGWKP AFGYTSFSMA IPSPPNGIAG VIDEKVGQYK PMAMDAEQHL 
APTEAIAELV NIVAQLRDPK GGCPWDLEQT HTSLIPCMLE EAHEVADAIR NGDDNHLSEE
LGDLLLQVVL HAQIANEEGR FNLEDIARSI SAKLIRRHPH VFAEAVAIDS EAVRQSWESI
KASEQPSSAS KSPLSDRLRS KVRGQPALAG AMAISKKVAN VGFEWNTIDG VWGKVQEEFE
ELKEAVEHED QAHAQTELGD VLFTLVNVAR WCGLNPEEGL AGTNQRFLDR FSRVEAALEG
QLSGQSLTEL EQLWQEAKAA IREEADDKKI SN