Gene A9601_05521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_05521 
SymbolhemC 
ID4717251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp479244 
End bp480194 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content35% 
IMG OID640078264 
Productporphobilinogen deaminase 
Protein accessionYP_001008945 
Protein GI123968087 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0181] Porphobilinogen deaminase 
TIGRFAM ID[TIGR00212] porphobilinogen deaminase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.132832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAATT TTAAGCTAAA AATAGCTAGT AGAAGAAGTA AGCTAGCAAT GGTTCAAACT 
TTATGGGTTA AAGATCAACT AGAAAAAAAT ATTCCCAATT TAGAGGTATC TATAGAAGCC
ATGGCAACTC AAGGTGACAA AATCCTTGAT GTAGCCTTAG CAAAAATAGG CGACAAAGGC
TTATTTACAA AAGAGCTTGA AGCACAAATG CTCGTAGGCC ATGCAGATAT AGCAGTACAT
TCTCTAAAAG ATTTACCAAC CAATTTGCCT AATGGACTTA AATTAGGATG CATTACAAAA
AGGGAGGATC CTGCAGATGC TTTAGTAGTA AACAAGAAAA ATGACTGTTA TAAATTAGAA
AACTTACCTG AAGGTTCGAT TGTTGGAACA AGCTCTCTAA GAAGACTTGC ACAATTAAGA
AATAAGTACC CACATCTTGT TTTCAAAGAT ATCAGGGGAA ATGTTATTAC AAGAATTGAA
AAATTAGATG CAGGAGAATT TGATTGTATA ATTCTTGCGG CCGCTGGTTT AAAGAGATTA
GGCTTTGAAT CAAGAATTCA CCAGATTATC CCAAGTGAAG TTTCCCTTCA TGCCGTTGGC
CAAGGAGCAC TAGGCATTGA ATGTAAATCT GATGATAAAA AAGTTTTAGA AATTATAAGT
ATCTTAGAAG ATAAACCCAC CTGTCAAAGA TGTCTAGCAG AAAGGGCTTT TTTAAGAGAG
CTTGAAGGTG GATGCCAAGT CCCAATAGGT GTGAATAGTA AAATTCAAAA TGAACAACTT
TGCCTTACTG GTATGGTTGC ATCTCTTGAT GGAGAAAGGC TTATTAAAGA TCAATATATT
GGCGATATTA ATGATCCCGA AGAAGTAGGC AAAGAACTAG CTAAAAAATT AAAGCAGCAA
GGTGCCGAAG AAATACTAAG CGAAATATTT AAAAAATTTA GAGAAAAATA A
 
Protein sequence
MTNFKLKIAS RRSKLAMVQT LWVKDQLEKN IPNLEVSIEA MATQGDKILD VALAKIGDKG 
LFTKELEAQM LVGHADIAVH SLKDLPTNLP NGLKLGCITK REDPADALVV NKKNDCYKLE
NLPEGSIVGT SSLRRLAQLR NKYPHLVFKD IRGNVITRIE KLDAGEFDCI ILAAAGLKRL
GFESRIHQII PSEVSLHAVG QGALGIECKS DDKKVLEIIS ILEDKPTCQR CLAERAFLRE
LEGGCQVPIG VNSKIQNEQL CLTGMVASLD GERLIKDQYI GDINDPEEVG KELAKKLKQQ
GAEEILSEIF KKFREK