Gene NATL1_08781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_08781 
Symbolwza 
ID4780032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp816931 
End bp818178 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content33% 
IMG OID640084153 
Producthypothetical protein 
Protein accessionYP_001014701 
Protein GI124025585 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000165811 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCACATAA GAAAACCAAC TCAAGTAAAA GCTCTTTCCT TTAAGGTTAT TGGTTTTATT 
TGCTTGTCTA GCATTATTGG TGTAAACGCG CAGCAAGTCA TTGAAGAGAG ATCAGTTCCC
TTAGATACCT CATACTTAGA ATCAAAAAAT GAACTTGAGG ACTATATTTT AGATACTGGA
GATGTATTGA ATATTGAATT TGTGAATGTT CCTGAACTTA ATGGCTTATT TAAAATTAAT
GAGCTAGGAG AGATATATTT TAAAAGAATA AAATCTACTT ATGTTAGAGG TCTAACCATT
AATGAACTAA CACAATTACT AGAGGAACGT TATAAAGAAT TTCTTGTAAA CCCAGAGATT
TATATAAGAA TTAATACATA CAAGTCTATT AGAGTTTCTA TAAGAGGTGA GGTGAAAGCA
CCTGGAGTGA TATCACTTCC TGCTTATATT TCAACATCTT TTGCAACATC CTTAGATGTT
TTTGATAATA AACAATCAAG CTTAGATTCT GATAATAACA TCAGCAAAAG AAATAAAAAC
TCAAGCTATC TATCATTGTC TACCAACAAA AATGTGAATG GAGATTCTTT AATTAATTCT
AACAATTTAA TTAAAAGAAA TAATGATTAT ATAACTACTC TCTCAAATGC AATTCAAAAA
GCAGGTGGTC TAACTTCTTC TAGTGATATT AGCAAGTTAG AAATTACCAG AGAAATACCT
CTTGGGAACG GAGGTGGCAA AAAACGAGCG ATAGTTAACT TTCTACCTTA TATCCGAAAT
GCAGACGCCT CATCAGATAT GAGACTATTT GATGGAGACG ATATCTTTAT TCCTCGTCTT
AAAGAAAAAG ATCTAACTAT TATTCCTGAC TCAATACTGT CCGGTCTATC TCCCAGGTTT
ATAAATGTAT CAGTTGGGGG GCGAATAGAA AATCCAAGTA CCGTAAAGAT TCCAATTGAA
GGAAGTCTTT CTGATGCAAT GAATTTAACA GGTCCAAGGA AGCCTTTGTC AGGAGAAATT
TATTTAATTA GATACAATCA AGACGGAACT TTATTAAGAA AAGGTATTAG TTATTCTTCA
AGTGCCCCTC CAGGATCTCC AAAGAATCCA TACTTATTAG CTGGTGATTC AATAACTGTT
AAAAATAGTA TTTTAGGAAG AACATCCGGA ACATTAAGAG CAATAACTGA ACCATTTGCG
GGCATTTTTG CCACCAAAGA GTTAATGGAG GGTCTCTACG AGAAATAA
 
Protein sequence
MHIRKPTQVK ALSFKVIGFI CLSSIIGVNA QQVIEERSVP LDTSYLESKN ELEDYILDTG 
DVLNIEFVNV PELNGLFKIN ELGEIYFKRI KSTYVRGLTI NELTQLLEER YKEFLVNPEI
YIRINTYKSI RVSIRGEVKA PGVISLPAYI STSFATSLDV FDNKQSSLDS DNNISKRNKN
SSYLSLSTNK NVNGDSLINS NNLIKRNNDY ITTLSNAIQK AGGLTSSSDI SKLEITREIP
LGNGGGKKRA IVNFLPYIRN ADASSDMRLF DGDDIFIPRL KEKDLTIIPD SILSGLSPRF
INVSVGGRIE NPSTVKIPIE GSLSDAMNLT GPRKPLSGEI YLIRYNQDGT LLRKGISYSS
SAPPGSPKNP YLLAGDSITV KNSILGRTSG TLRAITEPFA GIFATKELME GLYEK