Gene A9601_11541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_11541 
SymbolddpA 
ID4717867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp969540 
End bp971111 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content30% 
IMG OID640078869 
ProductABC transporter substrate-binding protein 
Protein accessionYP_001009545 
Protein GI123968687 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA AAATTGTTTT ATCAATATTT ATAATTTTAA TTTCTTTTTT ACAGAATTCT 
TGCGGCTCAA AAAGAATATC TGAAAAAATC ATAGTAGCAA GTTCTGGAAA AATTGAATCT
TTAGATCCAG CTAGAGCAAG TACTCTTAAA GCAATTCAAT TAATCAGTTC TCTTGGAGAC
ACATTATATG AATTAAATTC TAACGGAGAA TTAATACCTG AATTGGCCTC GGGGATGCCA
GTTATTTCAA AGGATAGACT TCAAATAACT ATCAATTTAA GAAAGAATGT TTTCTTTCAC
GATGGAACTG CATTTAACTC AAATGCTATG AAGTTTACCT TTGATAGATT CAAAAGAATT
GGAACGATGA ACTATATTTT AGGAAATAAG ATTAAATCAA TAGAAACGCC AAGTGAATAT
TCAGTGATAA TAAATTTGAA TAAACCATCA AGTTCTTTAA ATGGTTTACT CACATCAGTA
AATTTAACTC CAATATCTCC TACATTTTAC AAACAATATT CTGATAAGTT TCTAAATGAA
AAATTTGTTG GTACTGGCAA GTATGTGCTG ACCAGTTTTT CTAATGAAGT TCAATCAATT
GATCCATATT TGAATTATTG GGGTGAAAAG CCCTCAAATA ACGGCGTTAA TTTTGTGGGC
TATTCAAACT CATCCTCTCT TTTTGGGGCT TTAAAAAGTA AACAAATTGA CGTGCTTTTA
TCAAATTCAA TTGATGATAG TCAGAGAAAA AGTTTAAATG ATTTAAGCAA AAATAAACAT
TTTAATGAAG GTAATAGCCC TTTCACTGAA TTAAGTTTTA TAAGCCTCAA AACTAGTTCT
TATCCCTTAA GTAATTTTAA TTTAAGATTG GCTTTAGCAA AAAGTCTTAA TAGAAAATTG
ATTAGTGAGA AAGTAAGTTA TGGATTAAGG AAGCCATCTA GATCAATTAT TCCTCCGATA
TTAAAAAAAG ATAATCAAGA ACTGTGGCCT AAATATGATT ATTTAGAAGC GAGAAGGTTA
TTGCAAAAAG AAAATTATTG CAATGGAAAT ATTCTAAAAA TACCCCTTAC TTATAGATCG
AACGTGCCAG CTGACAAGCT TATTGCTCTG ACATGGCAAG AAGAAATTAA AAATTCTTTG
AAAGATTGTA TTGATATTGA ACTCAATGGG GTTGAATCTA CAACAGTTTA TAAGAATCTA
AGTTTAGGAA TTTATACGGC AGTTCTTCTC GATTGGACTG GGGCTTATTC AGATCCAGAG
GCTTATCTTA CCCCTCTTTT AAGTTGTAAT AAAATAGTTG ACGGCATATG TAAAAAAGGA
GAATCAGTTT ACAGCGGTAG TTTTTGGGGA TCTAAGAAAG TGGAAAGTTT ATTTCTTGAA
AGTGAAAAAA TAAGTGGAAT TAAAAGATTA GAAAAACTTG TTGAAATTGA AAAAATAGCA
GCAAGTTCAA TACCATATAT TCCTATTTGG ATCTCCTCTC AAAAAGCATG GTCACAAAAT
AAAATATCAA AACCTATTTT TAATGGTGCA GGAATAATCT CATTGAGTGA TCTTGAGTTA
ATTAATGAGT AG
 
Protein sequence
MKKKIVLSIF IILISFLQNS CGSKRISEKI IVASSGKIES LDPARASTLK AIQLISSLGD 
TLYELNSNGE LIPELASGMP VISKDRLQIT INLRKNVFFH DGTAFNSNAM KFTFDRFKRI
GTMNYILGNK IKSIETPSEY SVIINLNKPS SSLNGLLTSV NLTPISPTFY KQYSDKFLNE
KFVGTGKYVL TSFSNEVQSI DPYLNYWGEK PSNNGVNFVG YSNSSSLFGA LKSKQIDVLL
SNSIDDSQRK SLNDLSKNKH FNEGNSPFTE LSFISLKTSS YPLSNFNLRL ALAKSLNRKL
ISEKVSYGLR KPSRSIIPPI LKKDNQELWP KYDYLEARRL LQKENYCNGN ILKIPLTYRS
NVPADKLIAL TWQEEIKNSL KDCIDIELNG VESTTVYKNL SLGIYTAVLL DWTGAYSDPE
AYLTPLLSCN KIVDGICKKG ESVYSGSFWG SKKVESLFLE SEKISGIKRL EKLVEIEKIA
ASSIPYIPIW ISSQKAWSQN KISKPIFNGA GIISLSDLEL INE