Gene NATL1_05521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_05521 
SymbolhemC 
ID4780319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp499840 
End bp500787 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content41% 
IMG OID640083829 
Productporphobilinogen deaminase 
Protein accessionYP_001014379 
Protein GI124025263 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0181] Porphobilinogen deaminase 
TIGRFAM ID[TIGR00212] porphobilinogen deaminase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.276961 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTAG ACCAACTGCG CATCGCCTCT CGCCGAAGCC AGCTGGCCAT GGTTCAGACG 
AACTGGGTAC GAGATGAACT ACAAAGAGCA CATCCTGATC TTGCTATAAC TATCGAAGCA
ATGGCAACGC AGGGTGACAA AATACTTGAT GTGGCTTTAG CAAAAATAGG AGATAAAGGT
CTTTTCACTA AAGAGCTTGA AGCTCAAATG CTTCTTGGTC ATGCTGAAAT TGCAGTCCAC
TCACTTAAGG ATTTACCCAC GAACCTTCCA GAGGGATTGA TTCTTGGCTG CATCACAGAA
AGAGAAGATC CTTCAGACGC GTTAGTCGTT AATGAGAAAA ATCAAATTCA CAAACTAGAG
ACTCTGCCAG AAGGTTCTGT AGTGGGGACT AGTTCATTAA GAAGGCTTGC TCAATTGAGA
TACCATTATC CACATCTTGT TTTCAAAGAT GTAAGAGGAA ATGTTATTAC ACGATTAGAA
AAGCTTGACT CAGGAGAATA TGACTGTCTT ATCTTGGCTG CTGCTGGATT GCAAAGACTT
GGTTTTGCTA ATCGAATACA TCAATTAATC CCAACAGATA TTTCACTCCA TGCTGTGGGG
CAGGGAGCTC TTGGTATTGA ATGTGTAAGT GGTCAGCAAA AGGTACTAGA TATTTTAAAA
ACACTTGAAC ACGAATCAAC ATCTAAAAGA TGTTTAGCTG AAAGGTCTTT CCTCAGAGAG
CTTGAAGGAG GATGTCAAGT TCCAATAGGC GTTAGAACGG AAATTAACAA CAATGAATTA
ATTCTTGAAG GAATGGTCGC AAGTTTAGAT GGTAAAAGAT TAATTAGAGA TATAAAAAAA
GGATCGGTAA GCTCTGCGGA GGAAATAGGA ATAGACTTAG CCAATGAATT AAAAGGCCGT
GGAGCAGGGG AAATATTAGA AGAGATATTC AAGTCTGCAA GGGCCTAA
 
Protein sequence
MTLDQLRIAS RRSQLAMVQT NWVRDELQRA HPDLAITIEA MATQGDKILD VALAKIGDKG 
LFTKELEAQM LLGHAEIAVH SLKDLPTNLP EGLILGCITE REDPSDALVV NEKNQIHKLE
TLPEGSVVGT SSLRRLAQLR YHYPHLVFKD VRGNVITRLE KLDSGEYDCL ILAAAGLQRL
GFANRIHQLI PTDISLHAVG QGALGIECVS GQQKVLDILK TLEHESTSKR CLAERSFLRE
LEGGCQVPIG VRTEINNNEL ILEGMVASLD GKRLIRDIKK GSVSSAEEIG IDLANELKGR
GAGEILEEIF KSARA