Gene NATL1_14901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_14901 
SymbollysA 
ID4781148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1200167 
End bp1201531 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content36% 
IMG OID640084771 
Productdiaminopimelate decarboxylase 
Protein accessionYP_001015312 
Protein GI124026196 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.813083 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGAT CAAAGGCTTA TGAACCTAAT GTGGATATTG ATAGTCCAAA TCGAAATATA 
GCTCCGATCA CTTCAGAAAT TAATGAGAGT AAAAAATTAG TTGTTGGAGG ATGTCAACTC
AGTGAACTAG CGAAAAAATA TGGCACACCT CTTTATGTTT TAGATGAGTT TTCACTTAGG
ACTGCATGCA AAACTTATAT TTCTTCTTTA AATAAACATT ACCCAGGGAA GTCACTTCCT
CTATTTGCTT CAAAAGCAAA TAGCTCCCTA GCTATTTGTG CAGTTATTGC TTCTGAAGGT
TTTGGGCTTG ATGCAGTATC AGAGGGTGAA TTACTTACTG CAATAAATGG AGGTGTAAAA
GAGAAAGATA TTGTTTTTCA TGGAAATAAC AAATCTCAGG ATGAATTGAA TTTTGCCTAC
AGTAATAATG TGACGATTGT TTTAGATAAT TATCATGATA TTGAGTTACT AAAAAATATT
GCCTCCGATA ACAAGCCAGC AAAGTTAATG TTGAGGTTTA CTCCTGGAAT TGAATGTCAT
ACTCATGAAT ATATAAGGAC TGGGCATTTA GATAGTAAAT TTGGTTTTGA TCCTGATGAT
CTTAAGTCAA TTTTAGAAGA ATTAAAAACG TATAAGTGGG CTAATTTAAC TGGTTTACAT
GCACATATAG GGTCTCAAAT TTTTGAAGTT CAACCCCATA TCGATCTTGC TGGCGTTATG
GCTGATGCTT TAAAGCTTGC TAAGGAAATT GGTCATCCAG TTGTTGATCT AAATTTAGGA
GGCGGTTTAG GGATTAAATA TGTTCAAGAA GATAATCCTC CCTCTATTGA AAAATGGGTT
GAAATTATTT CTAAGGCTGT TGTTAAGGCT TGTAGGGAAA GAAATCTTGA TTTACCAAGA
TTAATGTGTG AACCGGGAAG ATCTCTTGTC GCTAATTCGG GGCTCACTAT TTACAAGATT
GGAGCTAAAA AAGTTGTCCC TGGTGTCAGA ACTTATTTAT CTGTTGATGG AGGGATGAGT
GATAATCCTC GTCCAATAAC CTATCAGTCT CTTTACAGTG CATGTTTAGT CGATAAACCA
ATGAATACAA ATTTTGAAAA AGTCACAATA GCCGGGAAGC ATTGTGAGTC TGGAGATGTT
TTATTGAAAG ATTTTCTACT TCCTTCTTGT GAAAGTGGCG ATTTTCTTGC TGTGTTTGGA
ACGGGTGCAT ACAACTATTC AATGAGTTCC AATTACAACA GAATACCTAG ACCTGCGACA
ATTATGGTTG GGAAAGGTTC GGCCGAGTTG ACTCAAAGGA GAGAACTTCC TGAAGATCTA
TTGCAATTAG ATGTATTGCC CGATCGCTTT ATTCCCAAGA ATTAG
 
Protein sequence
MHGSKAYEPN VDIDSPNRNI APITSEINES KKLVVGGCQL SELAKKYGTP LYVLDEFSLR 
TACKTYISSL NKHYPGKSLP LFASKANSSL AICAVIASEG FGLDAVSEGE LLTAINGGVK
EKDIVFHGNN KSQDELNFAY SNNVTIVLDN YHDIELLKNI ASDNKPAKLM LRFTPGIECH
THEYIRTGHL DSKFGFDPDD LKSILEELKT YKWANLTGLH AHIGSQIFEV QPHIDLAGVM
ADALKLAKEI GHPVVDLNLG GGLGIKYVQE DNPPSIEKWV EIISKAVVKA CRERNLDLPR
LMCEPGRSLV ANSGLTIYKI GAKKVVPGVR TYLSVDGGMS DNPRPITYQS LYSACLVDKP
MNTNFEKVTI AGKHCESGDV LLKDFLLPSC ESGDFLAVFG TGAYNYSMSS NYNRIPRPAT
IMVGKGSAEL TQRRELPEDL LQLDVLPDRF IPKN