Gene P9211_11431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_11431 
SymbolddpA 
ID5730260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1045242 
End bp1046846 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content37% 
IMG OID641285511 
ProductABC transporter substrate-binding protein 
Protein accessionYP_001551028 
Protein GI159903684 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.88428 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTAT TATTTAAATC CAAAGCCAAG GCCTACAGAA AGCTATTAAC ATTACTAGCT 
ACTTCTCTAA TAATCATCAC TCAAACCTCT TGTAAGGCAA CTAAAGAATC AGATCGAATA
ATCGTTGCAA GCAAAGGGAA AATTGAATCG TTAGATCCTG CCCAAGCCAA TAAGTTGTTG
GCAATTCAAC TTATTAGTGC TTTAGGAGAC CCTCTATACA GAATTAATGA ATCAGGTTTA
CTTGAGCCAA GACTTGCAAA AGACTATCCT CAGATAAGCA AAGATGGTTT AACTATTTCA
ATAGCTTTGA GAGAAGATGT ACTTTTTCAT GACGGGACAC CTTTTAATGC AGATGCTATG
GCATTTAGTA TTAAACGTTT TATGGAAATA GGTACTCTTA ATTATGTAAT AGGAGAAAAA
ATAACCAAAA TTGAAACACC AGGTCCTTTT TTGATTCGCC TTAGACTAAA TAAACCTTCA
AGTTCTATAA AAGGATTACT CAGTTCTATA AACCTGACCC CAGTATCTCC CAAAGCATAT
TTAGAGCATC AGGATAAATT CTTAAACAAA AAATTCATTG GAACAGGTCC CTATCAATTA
GATAGCTTTA CTCCAGAAAG ACAACGTTTA ATACCATTCC CACGTTATTG GGATAAAGCC
CCAAAAAACC TTGGAATTGA CTATGTAAGT TACACAACTT CTACATCTCT TTTTAGCGCA
ATTAAAACTG GTCAGGTAGA TGTTTTATTG TCCAACTCAG TTGAAGATGG GCACCGATTA
GCTCTTCATA AACTTTCAAA GAAAGGAAAG CTTATAGAAG GGATTGGACC GGCTATGCAA
ATTGGATATA TTGCCTTTCG TAGTAATTCA GCGCCACTGG ATAATAAGAT TTTAAGATCG
GCCCTTTCAT ACAGCCTTGA TAGAAATCTT ATTTCTAGAA AAGTAGGTTA TGGGTTAAGA
GAGCCTTTAA GATCAATAGT TCCACCTATT CTAAAAACCA GTAAAACATC ACCTTGGCCA
AAATATAATC CTCAAATTGC AAGAGACTTG TTTAAGAAAG CAGGCTTATG TGATTCCAAC
AAAGTGACAA TCCCTTTGAC ATTTAGATCA AACGTTCCTG CAGACAAACT ATTAGCGCTT
ACTTGGCAAG AACAATTAGG AAGAGATTTT TCAGATTGTA TAACAGTTAA ACTCAATGGA
GTTGAGTCTA CGACTGTATA TAAGCAGCTT GCAGAAGGAG CCTATGAAGC AGTCATACTT
GGCTGGACTG GAGAATACCC TGATCCAGAG GCATATTTAT CACCGTTACT GGACTGCACT
AAAAGTCAAG GTTCCACTTG CTTAGAGGGA GAGGCTGTTT ATGGTGGAAC TTTTTGGGCA
TCAAATCAAA TTAAAACCTC ACTTGAACAA AGCGAAAATC TACAAGGTTC TCAAAGACTA
GAAAAATTAT CCGAAATTGA ATTTTTAGCG GCCGATGGAG CCGCAATATT ACCTGTTTGG
CTTGTAAAGC CTAGAGCTTG GGCTCAACCT AATTTTTCTC GACCAAAGTT TGATGGGAAC
GGAAGATTAT TACTAGACCG TCTTCAAAAA GCAATAAATG AATAG
 
Protein sequence
MTLLFKSKAK AYRKLLTLLA TSLIIITQTS CKATKESDRI IVASKGKIES LDPAQANKLL 
AIQLISALGD PLYRINESGL LEPRLAKDYP QISKDGLTIS IALREDVLFH DGTPFNADAM
AFSIKRFMEI GTLNYVIGEK ITKIETPGPF LIRLRLNKPS SSIKGLLSSI NLTPVSPKAY
LEHQDKFLNK KFIGTGPYQL DSFTPERQRL IPFPRYWDKA PKNLGIDYVS YTTSTSLFSA
IKTGQVDVLL SNSVEDGHRL ALHKLSKKGK LIEGIGPAMQ IGYIAFRSNS APLDNKILRS
ALSYSLDRNL ISRKVGYGLR EPLRSIVPPI LKTSKTSPWP KYNPQIARDL FKKAGLCDSN
KVTIPLTFRS NVPADKLLAL TWQEQLGRDF SDCITVKLNG VESTTVYKQL AEGAYEAVIL
GWTGEYPDPE AYLSPLLDCT KSQGSTCLEG EAVYGGTFWA SNQIKTSLEQ SENLQGSQRL
EKLSEIEFLA ADGAAILPVW LVKPRAWAQP NFSRPKFDGN GRLLLDRLQK AINE