Gene NATL1_17651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_17651 
Symbol 
ID4779757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1445087 
End bp1447390 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content35% 
IMG OID640085053 
Productouter envelope membrane protein-like protein 
Protein accessionYP_001015585 
Protein GI124026470 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4775] Outer membrane protein/protective antigen OMA87 
TIGRFAM ID[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.773387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAAAA GACACACAAG TTCTAATAGA AAATTAATAG GCAGAGGTGC TTATGCCATT 
GCTTTCGCAT TCCCTTTTAT CTGCCTTTTT GGAGAGACTA AAGCCTCTAA AATAGTCTCA
AGTGAAAATC AATTTTTTGT TCAAAATAAG CAGAGTAAGA GATTACAATT TAGTCATCAA
TTAAATCCTA ATAAACTTAA TTTAAATTTC GATGATTTTT TGATTTCTGA GGGGAAAAAT
CAAAAAGATG AAAATACTGT TGATGAAGAA AAAAGAGTAT TAATTTCAGA GATTGTTATT
GAAGGACTTG AAGATCATCC TGACAAAGAG CGCTTGGAAG TTATAGCTTA TGATGCAATG
TTAATTAGAC CAGGTAGTAA AGTTACAAGT GAAGATGTTA AGAAGGATTT AGATCGAATT
TATTCAACAG GCTGGTTTTC TGGTGCTAAA ATAGAGTCTT TGCAGAGTGC TCTAGGAGTT
CAACTACTGA TTAAGATTGA CCCTAATCCG ATATTAAATA AGATAACTAT ACTCCCGCTT
GAAAGCAAAC TATCAAATAC AAAATTAAAT GAGATTTTTA ATAATGATTA TGGGAAAACC
TTAAATCTCA ATACTCTACA AATAAAGATT AAAGAAATTA AGGATTGGTA TAGTAGTCAG
GGATACTCTT TGACGAGAAT AAATGGTCCA AGTCGTGTAA CTAAAGAAGG TAGTGTTGAG
CTTAATATTC AAGAAGGATA TATTGCTGGC ATTCAGATAA ATTTTATAGA TGAAGATGGA
AATTCTGAGG ACGAAAAAGG AAGGTTAATA AAAGGAAAAA CAAAGAAATG GGTGATTAAA
AGAGAATTAG TAACTAAAGT TGGAGATATT TTTAATAGAA ACAAATTAGA ATCAGATATT
AAAAGATTAT ACTCAACTTC ACTTTTTAGT GATGTAAAAG TAACTCTTAA ACCAGTAAGT
TCGGAGCCTG GAAAGATTAT AGTTTCACTA GGTATAACTG AGCAAAGAAC AGGCTCTTTG
ACTGGTGGAC TTGGTTATAG TGGAGGTCAA GGTGTGTTTG GACAGATCGG ATTGCAAGAG
TCCAATCTGG TTGGAAGAGC ATGGTCATCA AATATGAATT TGACTTATGG AGAATATGGG
GCACTTCTAA ATCTTTCCTT ATATGATCCA TGGATTAAAA ATGATAAACA TAGAACATCT
TTTAGAACTT CATTATATTT AAGCAGGGAA GTTCCTCAAG AATTTAGAAG TCAAGAGGGT
GGAAGTATTA GAGGAGTTAC AGATAGATAT GAAGCCCCGA ATTCATTAAC AAGTTTCGAC
ACTAATCAAT CTCAAAACTT AGACCTGAAT AATAATAATA CTTTAATAGA AACTGGGCCA
TTTACTAATC TTTATTCTGC TCAGTTATCT GCACCTCAGT TTAGCTGGTT TGATTATGAG
GGTGATTCTA TTGTCTTAGA GCGAACAGGA GGAGGCTTCT CTTTTGCAAG GCCTTTAAAT
GGTGGTCAGC CTCTAAAAAA GGTTCCTTGG AGTGTGCTTA TTGGTGCAAA CTTCCAGAAA
GTTAAGCCAA TAGACTATGC CGGAGATAAA AGACCTTATG GCGTAGCATC AACAAATTTT
ATAGATGGGA AAGTTCCTAA GAATGAAGTT ATTTGCGTTG CGTATAATTG CTCTACAGAA
AACACACTAG TTAGTTTTAA AAGTGCAGTT ACTTATAATA ATTTAGATAA TTCAAGAAAT
CCAACCTCTG GAGATTATTT AAATATTGGT TCTGAACAAT TTATTAGCTT AGGTGAGAAT
TCACCGACTT TTAATAGAAC TCGGGTTAGT TATTCTCGTT TTTATCCTGT TAATTGGCTG
AAGTTTCATA GTGGTTGTCG ACCAAAGCCT GGAGAAAAAT CAGATTGCTC TCAATCGATA
GGTTTTCAAG CAAAGATTGG AACAATCATC GGAGATTTAC CTCCTTATGA AGCATTTTGC
TTAGGAGGAG CCAGCTCTGT TCGGGGTTGG AACTCTTGTG ACTTAGGCGT CGCAAGAAAT
TTTGGAGAGG CAACAGGAGA ATATAGATTT CCTATTTGGC GATTAGTTTC TGGGGTCCTA
TTTGTTGACG CAGGATCTGA TTTTGGCTCT CAATCAAATG TTCCAGGGAA GCCTGGAAAA
ATTCTAGAGA AACCAGGCTC TGGTTTTTCT GTTGGACCTG GTGCAGTCAT TAATACACCT
GTTGGCCCCA TTCGAATAGA GGCGGCGACT CAAGATTTCA GTGGTAATTG GCGCTATAAC
ATCGGAATTG GCTGGAAATT CTAG
 
Protein sequence
MFKRHTSSNR KLIGRGAYAI AFAFPFICLF GETKASKIVS SENQFFVQNK QSKRLQFSHQ 
LNPNKLNLNF DDFLISEGKN QKDENTVDEE KRVLISEIVI EGLEDHPDKE RLEVIAYDAM
LIRPGSKVTS EDVKKDLDRI YSTGWFSGAK IESLQSALGV QLLIKIDPNP ILNKITILPL
ESKLSNTKLN EIFNNDYGKT LNLNTLQIKI KEIKDWYSSQ GYSLTRINGP SRVTKEGSVE
LNIQEGYIAG IQINFIDEDG NSEDEKGRLI KGKTKKWVIK RELVTKVGDI FNRNKLESDI
KRLYSTSLFS DVKVTLKPVS SEPGKIIVSL GITEQRTGSL TGGLGYSGGQ GVFGQIGLQE
SNLVGRAWSS NMNLTYGEYG ALLNLSLYDP WIKNDKHRTS FRTSLYLSRE VPQEFRSQEG
GSIRGVTDRY EAPNSLTSFD TNQSQNLDLN NNNTLIETGP FTNLYSAQLS APQFSWFDYE
GDSIVLERTG GGFSFARPLN GGQPLKKVPW SVLIGANFQK VKPIDYAGDK RPYGVASTNF
IDGKVPKNEV ICVAYNCSTE NTLVSFKSAV TYNNLDNSRN PTSGDYLNIG SEQFISLGEN
SPTFNRTRVS YSRFYPVNWL KFHSGCRPKP GEKSDCSQSI GFQAKIGTII GDLPPYEAFC
LGGASSVRGW NSCDLGVARN FGEATGEYRF PIWRLVSGVL FVDAGSDFGS QSNVPGKPGK
ILEKPGSGFS VGPGAVINTP VGPIRIEAAT QDFSGNWRYN IGIGWKF