Gene NATL1_15381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15381 
SymbolddpA 
ID4780496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1251281 
End bp1252858 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content31% 
IMG OID640084820 
ProductABC transporter substrate-binding protein 
Protein accessionYP_001015360 
Protein GI124026244 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.318492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAAAC TAAATTGTAT AAAAAATATT TTAAAAGTAT CAAGTATTCT CCTAATACTC 
ATACAGTTAT CTTGCTCCCA ATATAAGAAA AGAGAAAATA TTATTGTTGC AAGTGCAGGT
AAAATTGAAT CACTTGACCC TGCTCAAGCA AATACACTCA GGACATTACA AATATTAAGC
GCTCTTGGAG ATACTCTATA CAAAATAAAT AAGGAAGGGA ATCTATCACC AAGCTTAGCT
AAAGATTTAC CAAAAGTAAG TAAGAATGGT TTGCTAATAG ATATTCCACT CAAAGAAAAT
ATTTCTTTTC ACGATGGAAG TATTTTCAAT GCAGAAGCGA TGGCGTTTAG TCTTAATCGA
TTCAGAAAAA TTGGAACTTT AAATTACCTA TTAAATGACA AAATAGAGGA TATTGAAGTC
AAAGGAAAAT TTCTTTTAAG AATAAAATTA AAAAAACCAT CGAGTTCATT AGCAAGTCTT
TTAACATCAG TAAATTTGAC ACCTGTCTCT CCTGATTCAT ATTCAAACTA TAAAGATAGT
TTCAATAATA AAAAGTTTGT AGGGACAGGA CCTTATTTCT TAGAAAGTTT CAACTCAAGT
CAACAAATAA TAAAGCCATT CAAAAATTAT TGGGGAGAAA AACCCCTAAA TAAAGGTATT
AACTTTATAA ATTATAGTAA TTCTAGTACT CTTTTTGGAG CTATAAAAAC AAAGGAAGTT
GACGTCCTCA TCTCAAATTC TATAGATGAT TTGCAGCGAT TAACATTAAA TAATATGGCT
AAGAAAGATC AACTAAAATC CGGAGAGGGT GATCCAATAG AGATAGGATA CATTACATTT
AAAAGCAATA AATTACCTTT AGAAAATAAA GTAGTTAGGA AGGCTCTTTC CTACACTATT
GATAGAGAAT TAATTAGTCA ACAAGTAAGT TTCGGAACAA GAGAACCATT AAGATCAATT
GTGCCTCCTC AACTACATAA AAAAGAATTT AAGCCATGGC CTAAATATAA TCCTAATACT
GCAAGATCTT TATTAAAAAC AGAAGGCTAC TGTGTAACAG AGATTCTTTC TATTCCATTA
ACATTTAGAT CTAATGTACC TGCAGATAAA TTACTTGCCC TTACTTGGAG AGATCAAATC
AAAAGAGATT TATCTGATTG TTTAGAAATA ACTTTAAATG GAATTGAGTC AACCACAGTC
TACAAACAAC TTTCCGAAGG GGCTTTTGAA GCGGTTATAT TAGATTGGAC TGGGGCATAT
CCTGACCCAG AAGCATATTT AACTCCCTTA CTAAGTTGTA ATGAACTAAA TAATAATTCT
TGCCTCAAGG GTGAAGCTGT ATTCAGTGGT AGTTTTTGGG GTGATAAAAA ATTACAAGAA
CTCTTGGAGA AAAGTGAAGA ACTAGATGGA GAAAACAGAC TAAATAATTT AATAAAAGTT
GAAAAACTTG CAGCACAAGG AGGTGCCTAC TTACCAATTT GGCTCGTTAA TCCTAAAGCT
TGGTCTCTAA AAGATATAAG CCAACCAGAA TTTTCAAAAG ATGGATTAAT TATCCTGAAA
AACTTAGAGA GAGACTAG
 
Protein sequence
MIKLNCIKNI LKVSSILLIL IQLSCSQYKK RENIIVASAG KIESLDPAQA NTLRTLQILS 
ALGDTLYKIN KEGNLSPSLA KDLPKVSKNG LLIDIPLKEN ISFHDGSIFN AEAMAFSLNR
FRKIGTLNYL LNDKIEDIEV KGKFLLRIKL KKPSSSLASL LTSVNLTPVS PDSYSNYKDS
FNNKKFVGTG PYFLESFNSS QQIIKPFKNY WGEKPLNKGI NFINYSNSST LFGAIKTKEV
DVLISNSIDD LQRLTLNNMA KKDQLKSGEG DPIEIGYITF KSNKLPLENK VVRKALSYTI
DRELISQQVS FGTREPLRSI VPPQLHKKEF KPWPKYNPNT ARSLLKTEGY CVTEILSIPL
TFRSNVPADK LLALTWRDQI KRDLSDCLEI TLNGIESTTV YKQLSEGAFE AVILDWTGAY
PDPEAYLTPL LSCNELNNNS CLKGEAVFSG SFWGDKKLQE LLEKSEELDG ENRLNNLIKV
EKLAAQGGAY LPIWLVNPKA WSLKDISQPE FSKDGLIILK NLERD