Gene P9211_18391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_18391 
Symbol 
ID5731691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1670668 
End bp1671783 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content37% 
IMG OID641286226 
ProductNAD binding site:D-amino acid oxidase 
Protein accessionYP_001551724 
Protein GI159904380 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.299081 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.626898 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAACA GCAATCCAAG TATTAAACGA AATAATCATT TCAAAAATAT TGCAGTAATT 
GGTGGAGGAG TTATAGGAAG CACAACTGCA CTACAGCTAG CAACTCTAGG GTATGAAGTG
GAAATTATTG ATCCTGAACT CAATCAATCA ACAAACTTTT CCAAACTGCT AACAGGTACT
CAAGCTTCCT TAGGAGTGCT CATGGGAAAT GTATTTAGAA GAAATACAGG GCGCAGCTGG
GCTTTAAGGC AAAGAAGTAT GGAGCTTTGG CCAAAGTTAA TCTCCAAGCT AAGTACACAA
ACAAGTCCTC TGAAGCTTTA TACACCACTA GTTCAATTGG CGAGATCAGA ACATGAAGCA
ACACTAATGA ATGAACTAAT CATTAAACGA AGCCACCTAG GTTTAGAACA CTTAACCAAT
CATTCACCTA CTAAAGTTAG TCGTTTATGG CCTAAAGCAA AGTATGGTGG GTTAATCTCA
AATAATGATG GACGCATTAA TCCATTAAAT TTAATGGTTT GTCTAATGAA AGCCTTGGAC
AAATATAAAG TAAGCAAAGT CAATAGGAAA GTATCTAGCC TTGAAAGATT ACCCTCTAGT
CAAAATAAAA GATGGCAACT TCAATTAGAC AATAAAAGAA TCCTTCAAAA AGATTGTGTT
GTTATTTGTG CGGCTATAGG CAGTGAAGCA TTACTAAAAC CATTAGGTCA TCACTATCCT
ATTGAATCAG TCTTGGGCCA AGCAATAAGG CTAACAATCA AAAGTGACTT TAAAAATTGG
TCTGGATGGC CAGCTGTATT AATTAATCAT GGAGTAAATT TAATACCGGA AAATACTAAT
CAGCTTTTAA TAGGTGCAAC CCTAGAACCT GGAATAACTA CAAGTAATAC AGCTCTAACA
ACAATGAGAG AAATGCATGG GTCCTCTCCT GAATGGATGC AATTTGCTTC GATTGAGAAT
AAATGGACAG GGATCAGAGG AAAACCTAGC AACGAACCTG CTCCCTTGCT CAAAAACCTT
GAGCAAGGGC TAATTCTTAA CACTGCACAC TATCGTAATG GTATTCTCCT TGCTCCAGCA
TGTGCTGAAT GGGTTGGATA TGAATTAATT AAATAA
 
Protein sequence
MTNSNPSIKR NNHFKNIAVI GGGVIGSTTA LQLATLGYEV EIIDPELNQS TNFSKLLTGT 
QASLGVLMGN VFRRNTGRSW ALRQRSMELW PKLISKLSTQ TSPLKLYTPL VQLARSEHEA
TLMNELIIKR SHLGLEHLTN HSPTKVSRLW PKAKYGGLIS NNDGRINPLN LMVCLMKALD
KYKVSKVNRK VSSLERLPSS QNKRWQLQLD NKRILQKDCV VICAAIGSEA LLKPLGHHYP
IESVLGQAIR LTIKSDFKNW SGWPAVLINH GVNLIPENTN QLLIGATLEP GITTSNTALT
TMREMHGSSP EWMQFASIEN KWTGIRGKPS NEPAPLLKNL EQGLILNTAH YRNGILLAPA
CAEWVGYELI K