Gene P9301_11551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_11551 
SymbolddpA 
ID4912038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp968408 
End bp969979 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content30% 
IMG OID640160741 
ProductABC transporter substrate-binding protein 
Protein accessionYP_001091379 
Protein GI126696493 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA AAATTGTTTT ATCAATATTT ATAATTTTAA TTTCTTTTTT ACAGAATTCT 
TGCGGCTCAA AAAGAATATC TAAAAAAATT ATAGTAGCAA GTTCTGGAAA AATTGAATCT
TTAGATCCAG CTAGAGCAAA TACTCTTAAA GCAATTCAAT TAATCAGTTC TCTTGGAGAC
ACATTATATG AATTAAATTC TAAGGGAGAA TTAATACCTG AATTGGCCTC GGGGATGCCA
GTTATTTCAA AGGATAGACT TCAAATAACT ATCAATTTAA GAAAGAATGT TTTTTTTCAC
GATGGAACTG CTTTTAACTC AAATGCTATG AAGTTTACCT TTGATAGATT CAAAAGAATT
GGAACTATGA ACTACATTTT AGGAAATAAG ATTAAATCAA TAGAAACGCC AAGTGAATAT
TCAGTCATAA TAAATTTGAA TAAACCATCA AGTTCTTTAA ATGGTTTACT CACATCAGTA
AATTTAACTC CAATATCCCC TACATTTTAC AAACAATATT CTGATAAGTT TCTAAATGAA
AAATTTGTTG GTACTGGCAA GTATGTGCTG ACCAGTTTTT CTAATGAAGT TCAATCAATT
GATCCATATT TGAATTATTG GGGTGAAAAG CCCTTCAATA ACGGCGTTAA TTTTGTGGGC
TATTCAAATT CATCCTCTCT TTTTGGGGCT TTAAAAAGTA AACAAATTGA CGTGCTTTTA
TCAAATTCAA TTGATGATAG TCAGAGAAAA AGTTTAAATG ATTTAAGCAA AAATAAACAG
TTTAATGAAG GTAATAGCCC TTTCACTGAA TTAAGTTTTA TAAGCCTCAA AACTAGTTCT
TATCCCTTAA GTAATCTTAA TTTAAGATTG GCTTTGGCAA AAAGTCTTAA TAGAAAATTG
ATTAGTGAGA AAGTAAGTTA TGGATTAAGG AAGCCATCTA GATCAATTAT TCCTCCGATA
TTAAAAAAAG ATAATCAAGA ACTGTGGCCT AAATATGATT ATTTAGAAGC AAGAAGGTTA
TTGCAAAAAG AAAATTATTG CAATGGAAAT ATTCTAAAAA TACCCCTTAC TTATAGATCT
AATGTACCAG CTGACAAGCT TATTGCTCTG ACATGGCAAG AAGAAATTAA AAATTCTTTG
AAAGATTGTA TTGATATTGA ACTCAATGGG GTTGAATCTA CAACAGTTTA TAAGAATCTA
AGTTTAGGAA TTTATACGGC AGTCCTTCTC GATTGGACTG GGGCTTATTC AGATCCAGAG
GCTTATCTTA CCCCTCTTTT AAGTTGTAAT GAAATAGTTG ACGGCATATG TAAAAAAGGA
GAATCAGTTT ATAGCGGGAG TTTTTGGGGA TCTAATAAAG TGGAAAGTTT ATTTCTTGAG
AGTGAAAAAA TAAGTGGAAT TAAAAGATTA GAAAAACTTG TTGAAATTGA AAAAATAGCA
GCAAGTTCAA TACCTTATAT TCCTATTTGG ATCTCCTCTC AAAAAGCATG GTCACAAAAT
AAAATATCAA AACCTATTTT TAATGGCGCA GGAATAATTT CATTGAGTAA TCTTGAGTTA
ATTAATGAGT AG
 
Protein sequence
MKKKIVLSIF IILISFLQNS CGSKRISKKI IVASSGKIES LDPARANTLK AIQLISSLGD 
TLYELNSKGE LIPELASGMP VISKDRLQIT INLRKNVFFH DGTAFNSNAM KFTFDRFKRI
GTMNYILGNK IKSIETPSEY SVIINLNKPS SSLNGLLTSV NLTPISPTFY KQYSDKFLNE
KFVGTGKYVL TSFSNEVQSI DPYLNYWGEK PFNNGVNFVG YSNSSSLFGA LKSKQIDVLL
SNSIDDSQRK SLNDLSKNKQ FNEGNSPFTE LSFISLKTSS YPLSNLNLRL ALAKSLNRKL
ISEKVSYGLR KPSRSIIPPI LKKDNQELWP KYDYLEARRL LQKENYCNGN ILKIPLTYRS
NVPADKLIAL TWQEEIKNSL KDCIDIELNG VESTTVYKNL SLGIYTAVLL DWTGAYSDPE
AYLTPLLSCN EIVDGICKKG ESVYSGSFWG SNKVESLFLE SEKISGIKRL EKLVEIEKIA
ASSIPYIPIW ISSQKAWSQN KISKPIFNGA GIISLSNLEL INE