Gene NATL1_15391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15391 
SymboldppB 
ID4780495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1252780 
End bp1253883 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content33% 
IMG OID640084821 
Productputative ABC transporter, oligopeptides 
Protein accessionYP_001015361 
Protein GI124026245 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.134058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTCTCTA AAAGATATAA GCCAACCAGA ATTTTCAAAA GATGGATTAA TTATCCTGAA 
AAACTTAGAG AGAGACTAGT TTTGTCAACT AAAAAGGCAC TTTTCAAATA TATCCTTTCA
AGATTAACTC TATTGCCAAT AATGCTTTGG ATCATTTCGA GCTTAGTTTT TATATTATTA
AGAATTGCTC CAGGCGATCC GGTAGATGCA ATCCTAGGCA CTCGAGCGAA TGAATTTGCA
AGGGAAAGCC TAAGAATTAA ACTTGGATTA GATAAGCCTC TAATTAATCA ATATATTGAA
TATTTAAATC AATTAATTCA TGGCAATTTA GGAATCTCAT TAAACACACA AGAACCTGTA
AAAGTAATTA TTTCAAAGGC TCTTCCAGCA AGTTTGGAAT TAGCTATATT TTCAATATTA
ATAGCATCAT TCGTAGGTTA TTTAATTGGT TTTTTAGGAG CAGTTAAGCC AGAAAGCAAA
ATAGATTTTT CAGGAAGGAT TTTTGGCATT GGTACCTATG CTCTCCCCCC TTTCTGGGCA
GCAATGTTAA TTCAAATTAT CTTCGCTGTT TTTTTAGGTT GGTTACCCAT TGGTGGAAGA
TTACCTCCCG GAGCTATTCC TCCGCCCCCA ATCACTGGTT TCTTACTTTT AGATAGTATT
TTAGATAAAA ACGTTGAAAT CATTTTTAGT TCTATTCAAC ATTTAATATT ACCCTCCGTC
ACTCTAGGGA TCTTATTAAG CGGAATATTT AGTCGAGCAT TAAGATTAAA TCTAGAAGAA
GTTTTAAAAA AAGATTACAT TGAAGCCGCT AAAAGTAGAG GTATAAATAA CTCCAGAGTA
TTAGTTAAAC ATGCCTTACC AAATACTCTT CTCCCAATAT TGACAATTAC TGGCTTAACA
GTTTCTTCTT TAGTTGGTGG AGCACTTTTA ATTGAAATAA CATTCTCATG GCCTGGAATA
GCTCTTGGAC TTCAAGAAGC TATAAATCAA AGAGATTATC CTGTTGTACA AGGAATAGTT
GTTGTAATAT CTAGCCTTGT TGTAATGATT AGTGTTTGTA TAGATATCGC TATTGCATAT
ATTGATCCAA GGGTTAGTTA TTGA
 
Protein sequence
MVSKRYKPTR IFKRWINYPE KLRERLVLST KKALFKYILS RLTLLPIMLW IISSLVFILL 
RIAPGDPVDA ILGTRANEFA RESLRIKLGL DKPLINQYIE YLNQLIHGNL GISLNTQEPV
KVIISKALPA SLELAIFSIL IASFVGYLIG FLGAVKPESK IDFSGRIFGI GTYALPPFWA
AMLIQIIFAV FLGWLPIGGR LPPGAIPPPP ITGFLLLDSI LDKNVEIIFS SIQHLILPSV
TLGILLSGIF SRALRLNLEE VLKKDYIEAA KSRGINNSRV LVKHALPNTL LPILTITGLT
VSSLVGGALL IEITFSWPGI ALGLQEAINQ RDYPVVQGIV VVISSLVVMI SVCIDIAIAY
IDPRVSY