Gene P9211_10961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_10961 
SymbollysA 
ID5731074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp996900 
End bp998267 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content38% 
IMG OID641285463 
Productdiaminopimelate decarboxylase 
Protein accessionYP_001550981 
Protein GI159903637 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.297348 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.231621 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGTTT CAAGGCTCTT CGACTTAAAT AAGGATCAAA ACAGTCCCAA TAGAAACATA 
ACTCCAATTA CTGCTGAACT GGACAATTCA GATAGAATGA CAGTTGGCGG ATGTTTGCTT
AGTGATCTTG CCAATCAATA TGGGACACCT CTTTATGTAA TTGATGAGGC CAGTATTAGA
AGCTCTTGCA GGGCATATAG GAAGGCTCTC AAACAAAGCT ACCCTGGGGA TTCTTTTGTT
TTATACGCCT CTAAAGCAAA TAGTTCTTTG GCTATTGATA GGATTGTTGC TTCAGAAGGA
CTGGGTATAG ACGTAGTTTC CGAGGGTGAG TTAATTACAG CCTTAAAAAG TGGTGTCGCA
GGGGAACAAA TTGTTTTACA TGGAAATAAC AAGTCTGACA AGGAGTTGTT ACTAGCGCAT
GAGAGTAACG CAACTATTAT CATTGATAAT CAACATGATA TACAGCGCTT GGATAAACTA
ATTAGCCATA AAGAAGGTAG TGTTAGATTA ATGTTACGCT TTACCCCAGG AATAGAATGC
CATACGCATG AATATATTCG TACAGGTCAT TTAGATAGTA AGTTTGGGTT TGACCCTGAG
CAAGTTTCAG ATACATTTGC ACAATTAAAG GATTATAAAT GGGCAAAATT AGTTGGTCTT
CATGCACATA TAGGCTCTCA AATTTTCGAA TTATCACCTC ATATGGATTT GGTAGAAGTT
ATGGCAGATT TCTTTTTAAG AGCAAAAGAT CTAGGTCATC CTATAAAAGA CTTAAATATT
GGCGGAGGAC TTGGTGTTAA ATATATTCCT TCTGATGATC CCCCAGATAT TTATAGTTGG
GTAGAAACTG TATCAAATGC TGTTATTAAA GCATTTGATA CAAGAAATAT TGAATTGCCA
AGACTGATTT GTGAGCCTGG GAGGTCAATT ATTGCTACTG CAGGGCTAAC TCTCTATAGA
ATTGGAGCTC GTAAAGATAT TCCAGGAGGA AAGACCTATC TGTCAATAGA TGGAGGAATG
AGTGATAACC CTCGCCCAAT AACTTACCAA TCCACTTATA CAGCTTGTTT AGTTGACAGA
CCATTGGCTA ATACTGATCA GGTCGTCACA ATTGCTGGAA AACATTGTGA ATCAGGAGAT
ATTCTTTTAA ATAATATTGC TTTACCCACC GCTTCTAGTG GTGATGTCCT GGGTGTTTTT
GGCACAGGAG CTTATAACCT TTCTATGAGC TCTAATTACA ATAGAATTCC AAGACCGGCT
TCAGTTTTGG TTAATAATGC ACAGTCAGAT CTTGTTCAAG TAAGAGAGTT GCCTGAAGAT
CTATTGCGAT ATGACCGCCT TCCAGATCGC TTTATTGCCA AAGGGTAG
 
Protein sequence
MGVSRLFDLN KDQNSPNRNI TPITAELDNS DRMTVGGCLL SDLANQYGTP LYVIDEASIR 
SSCRAYRKAL KQSYPGDSFV LYASKANSSL AIDRIVASEG LGIDVVSEGE LITALKSGVA
GEQIVLHGNN KSDKELLLAH ESNATIIIDN QHDIQRLDKL ISHKEGSVRL MLRFTPGIEC
HTHEYIRTGH LDSKFGFDPE QVSDTFAQLK DYKWAKLVGL HAHIGSQIFE LSPHMDLVEV
MADFFLRAKD LGHPIKDLNI GGGLGVKYIP SDDPPDIYSW VETVSNAVIK AFDTRNIELP
RLICEPGRSI IATAGLTLYR IGARKDIPGG KTYLSIDGGM SDNPRPITYQ STYTACLVDR
PLANTDQVVT IAGKHCESGD ILLNNIALPT ASSGDVLGVF GTGAYNLSMS SNYNRIPRPA
SVLVNNAQSD LVQVRELPED LLRYDRLPDR FIAKG