Gene P9211_02101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_02101 
Symbol 
ID5731791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp201543 
End bp203153 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content41% 
IMG OID641284554 
ProductABC transporter, ATP binding component 
Protein accessionYP_001550095 
Protein GI159902751 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAGTT CTTTAGGGGA TGTTTTAAAT ATTCAAAACT TGAGGGTTTG TTATCCCAAC 
ACTTCTAAAT GGGTATTGGA TCGCTTCAAT TTAAATATTC GAGCTGGAGA ACGTGTTGCT
CTCATTGGTA GCTCAGGGTC TGGGAAAAGT ACTGTTGCTA AGGCTTTAAT GCAAATTCTT
CCTTCAGGGA GTATCTGTCA AGGTTCTCTA TTAGTTGCTG GACAAGATCT CATGAACTTA
GAGCCTAAAA GCTTGGTTCA ACTTAGAGGA GAGTTGGTTG GTTTGGTTTT TCAAGATCCA
ATGAGTCGCC TGAATCCATT AATGACAATT GGAGATCATA TCTTGGATAC ATTAAAGGCA
CATAGACCAG AGAAGACTTC TTCTTGGCGC AGATTTCGGG CTGAAGAATT GTTGATAAAA
GTTGGGATCA ATCCTGCTCG TTTCAATGCT TTTCCTCATC AATTTAGTGG TGGCATGCGT
CAACGATTAG CCATTGCTTT GGCAATTGCT TTGAATCCAC CTTTAGTCAT TGCAGATGAG
CCTACTTCTA GCTTGGATGT CGCAGTGGCA AACCAGGTAA TGAGAGAGTT GAACAACCTT
TGCAATGAAC TTGGCACTAG TCTCTTATTA ATTACCCATG ACCTTGCTCT GGCAGCCAGA
TGGTGTGAAC GCATGGCAAT CCTTGGGGAA GGCAATATAG TTGAGGAAGG TTTCAGTAGA
GATGTTGTAG AGCAGCCATT ATCTTTGCTG GGGAAGAGCT TGGTTGGTGC TGTTAAAGCG
CGAGAACAAA AATCTTTAAA GTCTCAAATT GAGGGAAAAG TTGTATTAAA GGTTGATCGA
TTGCGATGTT GGCATGCTGG GGGTTGGTTG CCTTGGCAAA CTAATTGGAT TAAAGCTGTT
GATGAAGTTA GTTTTTCTTT GCTACAAGGG GAAACATTAG GAGTAGTAGG AGTATCAGGC
TGTGGGAAGA GCACTTTGTG TAGAGCCCTC GTGGGCTTGT TGCCTATCAG AGGAGGTGAT
GTGATGTTGT TTGGGCAAAA TTTAGCAAGA TTAAATAGGT CCTCTGTTAA ACAAGCTAGA
CAGGCATTAC AGATGATCTT TCAGGATCCT TTTGGTTCTA TGAATCCCAA GATGACAGTT
TTAGACACCA TCTCCGATCC ACTACTTGCT CATAATTTAT GCAATAAAGC AAGTGCAAAA
GAGCAATCAA GAAAGCTGTT GGATCAAGTA GGTTTAAGCC CACCTGAAAA CTTTCAACAC
CGTTTACCTC ATGAACTTTC TGGTGGTCAG CAACAAAGAG TTGCGATTGC GCGTGCTCTT
GCTTTGACTC CCAAGGTACT TATTTGTGAT GAAAGTGTGA GTATGCTAGA TGCTGAAATG
CAAGCAGATG TCCTAAATTT ATTGAGCTCA CTGCAAAAAA AACTTGGATT AGCAATTCTT
TTTATCACGC ATGATTTATC GGTTGCCCAT AGCTTTTGCC ATAGGTTGAT TGTTTTAGAT
AAGGGAAAAA TTGTTGAAGA AGGTTTGTCA CATCAGATAT TTAATAAACC TCAGAATGAA
CTTACTAAAA CACTAGTTAG TGCTTGCCCA AGGATTAAAT CCTTTAATTG A
 
Protein sequence
MTSSLGDVLN IQNLRVCYPN TSKWVLDRFN LNIRAGERVA LIGSSGSGKS TVAKALMQIL 
PSGSICQGSL LVAGQDLMNL EPKSLVQLRG ELVGLVFQDP MSRLNPLMTI GDHILDTLKA
HRPEKTSSWR RFRAEELLIK VGINPARFNA FPHQFSGGMR QRLAIALAIA LNPPLVIADE
PTSSLDVAVA NQVMRELNNL CNELGTSLLL ITHDLALAAR WCERMAILGE GNIVEEGFSR
DVVEQPLSLL GKSLVGAVKA REQKSLKSQI EGKVVLKVDR LRCWHAGGWL PWQTNWIKAV
DEVSFSLLQG ETLGVVGVSG CGKSTLCRAL VGLLPIRGGD VMLFGQNLAR LNRSSVKQAR
QALQMIFQDP FGSMNPKMTV LDTISDPLLA HNLCNKASAK EQSRKLLDQV GLSPPENFQH
RLPHELSGGQ QQRVAIARAL ALTPKVLICD ESVSMLDAEM QADVLNLLSS LQKKLGLAIL
FITHDLSVAH SFCHRLIVLD KGKIVEEGLS HQIFNKPQNE LTKTLVSACP RIKSFN