Gene P9303_08861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_08861 
SymbolddpA 
ID4776187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp803195 
End bp804670 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content57% 
IMG OID640086395 
ProductABC transporter substrate-binding protein 
Protein accessionYP_001016902 
Protein GI124022595 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCAGTG CCGGCAAGAT CACCTCTCTG GATCCTGCCC AGGCCAGTAC TTTTGATGCT 
CTGCAACTGC TGAGCGCCCT CGGGGACCCC CTCTACCGAC TTGATCACAA AGGGGACCTA
GAGCCACGAC TGGCATCCGC CCCACCTCAG ATCAGTGATG GCGGCTTCAC CATCTCGATC
CCTCTGCGCA AGGATGTGCT GTTTCACGAC GGCACCCAAT TCAACGCCGC CGCGATGGCC
TTCAGCCTGC GGCGATTTCT GCGTATCGGC ACCCTCAACT ACGTGGTAGG GGGACGGATT
GCCGCCGTGG AGGCAGCCGG TCCCTACCTG CTAAGCCTTC GACTAACACG ACCCTCCACT
TCTCTAGAGG GACTACTCAC CTCAATCAAT CTGACGCCGG TCTCACCAAC GGCCTACGCC
AAGCACAGAA ATCAGTTCCT CAATAAGCAA TTCATTGGAA CCGGCCCTTA CCGACTCACT
AGCTTCCAGA CCCAGCAACA ACGCCTTGAG CCTTTTCAGC AGTACTGGAG TACCGAGGCC
AGTAATGCTG GAATCGATTT CATCAATCTG AGCAACTCCA CTGCCCTATT CGGTGCCCTA
CGAAGCGGTG AAGTGGATGT ACTACTTTCC AATTCACTGG ATGAAGACCA ACGCCTTGCC
CTCCATCGCC TTGCCAAGCA AGGGAAGCTC CGCGAAGGAA CGGGGCCAGC ACTTGAGATT
GGTTACATCA CTCTACTCAG TAATACCACT CCACTCAATC AACCACTCCT GCGAAAGGCC
CTCGCTTACA GCCTCGACCG TCAGCTGATG GTTGAGCGCG TGAGCTATGG ACTCAGACGC
CCTTTGCGGT CTTTAGTGCC GCCAAACTTG CAGGCTGAAC CAATCACACC TTGGCCCAGC
TACAACCCTC AACGAGCCAA GCAGCTGCTT CAAAAGGCAG GTTATTGCAC AACTCAAAAG
CTGACACTCC CTTTCACGTT CCGTTCCAAC GTGCCAGCAG ACAAGCTATT GGCACTGACG
TGGCAGGCAC AGGTGGAACG TGACCTCTCG GATTGCCTGA CTCTGAAGCT CAATAGCGTC
GAATCGACCA CCGTTTACCG CCAACTGGGG GAGGGCGCAT TCCAGGCAGT CATCCTTGAG
TGGCGAGGCG CCTATCCCGA CCCGGAAGCC TATTTAGCTC CATTATTAAG CTGCAGCAAA
GCCAATGGAC CTGTATGCGA AGAGGGGGAA GCGGCAATTA GCGGCAGCTT CTGGACCGCA
AATGGGCTGG AAGCGAGCCT TCGCCACAGC GATGAACTTC GCGGACCCGA TCGTCTCCAC
CAGCTCGGAG AGATTGAGCA CCGCGCCGCC GCAGGAGCGG CCTATCTACC AATCTGGCTG
GTAGCACCAC GAGCTTGGGC CCAGCTGCGC TTATCCAAGC CAGAATTTGA CGGTAGTGGC
CAGTTAATGC TCCCCCGTCT ACGGGAGCTG CACTGA
 
Protein sequence
MASAGKITSL DPAQASTFDA LQLLSALGDP LYRLDHKGDL EPRLASAPPQ ISDGGFTISI 
PLRKDVLFHD GTQFNAAAMA FSLRRFLRIG TLNYVVGGRI AAVEAAGPYL LSLRLTRPST
SLEGLLTSIN LTPVSPTAYA KHRNQFLNKQ FIGTGPYRLT SFQTQQQRLE PFQQYWSTEA
SNAGIDFINL SNSTALFGAL RSGEVDVLLS NSLDEDQRLA LHRLAKQGKL REGTGPALEI
GYITLLSNTT PLNQPLLRKA LAYSLDRQLM VERVSYGLRR PLRSLVPPNL QAEPITPWPS
YNPQRAKQLL QKAGYCTTQK LTLPFTFRSN VPADKLLALT WQAQVERDLS DCLTLKLNSV
ESTTVYRQLG EGAFQAVILE WRGAYPDPEA YLAPLLSCSK ANGPVCEEGE AAISGSFWTA
NGLEASLRHS DELRGPDRLH QLGEIEHRAA AGAAYLPIWL VAPRAWAQLR LSKPEFDGSG
QLMLPRLREL H