Gene NATL1_11501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_11501 
Symbol 
ID4780834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1025936 
End bp1028200 
Gene Length2265 bp 
Protein Length754 aa 
Translation table11 
GC content38% 
IMG OID640084429 
Productalkaline phosphatase 
Protein accessionYP_001014973 
Protein GI124025857 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.991815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0150777 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA GGGTTGTACT AGCATGTGGC TCAGCAGTGC TAGCTCTCTT AGCCTCCTCT 
AGCACATCAA GTATTGCAGG TTGGCGGGAT GCTTCAAATA AGAATTTCAA TAGAGTTTCT
TCTTTCGCTA TCAATAGGAA TCTTCCATCT GGTGTAAAGT CCACAACTAA AACATCTGCA
GAGACCATCA CAGCAACTAA GGATGGAAAA ACACTTATCT ACACTGATAG TGATCTTGGT
GTTGTAGGAA TAATAGATAT TACCGATCCT TCAGATCCTA AAGGTGAAGG GATTATCGAG
TTAGATGCAG AACCAACATC AGTGATGGAG CGTAAAGGTA AAATCTTTGT AGGTATTAAT
ACATCTGAAA GCTACACTAA TCCATCGGGA TCAATAACCT CATATGATTT AAAATCAGGT
ATTAAGTCCA AAGAATGCAA CGTTGGTGGA CAACCTGACA GCGTCGCAAT TGCCCCAAGT
GGAAAATTTA TCGCAGTTGC GATTGAGAAT GAAAGAGATG AAGAATTTAA TGATGGTGTA
ATTCCACAAA TGCCTGCAGG TAATGTTGCT TTCGTAAAGC TAAAAGGAGG AGATCTTGAT
TGTGATTCTA TGTTCTTCGC AGATGTCTCT GGTCTTTCTG AAATAGCACC AAGTGACCCT
GAACCAGAAT TTCTAACGAT TAATAAAAAA GGTGAAACAG TTGTTTCACT TCAAGAAAAT
AATCACTTGG TGGTACTAAA TAAAAAAGGT GAAGTTATCT CCCATTTTTC AGCAGGTCTA
GTAAGTCAAA TGGCAGGAAT GGATACTAAA AAAGATGGAG CACATAAATT CAAGAAAAAG
CTAAAAAACG TTAGACGTGA ACCAGACGGA CTAACTTGGA TTGATAACGA TCATTTTGCG
ACAGCTAACG AGGGAGATTA CAAGTTAAAG AAAGAAGGTC AGGCAAAAAG AGGAGGTTCT
AGATCTTGGA CGATTTGGAA TAAAGATGGT TCTGTTGTTT ACGAAGATGG AAACAGACTA
GAAAGAGCTA TTGCTCAAAT TGGTCATTTC CAAGACGATC GTGCTGGAAA GAAAGGAGTT
GAACCTGAAT CAGTGACTTA TGCGAAAATC AAAGGTACTC CTTATATGTT TGTAGGAGCT
GAAAGAGCTG GAGTTGTAGC TGTATATGAT GTATCAGATT TAAGTCAGCC AACACTCCTT
CAACTATTAC CATCTGGTAT TGCACCAGAG GGATTTGTCG CAATTCCTAA ACGTGGTTTA
ATCGCATCTT CTAACGAAAA AGATTACAAC AAGAAAGAAC CTGGCTTGGC TTCACATGTA
ATGACTTACG AGCTCCAAAA GGCTGATGCA ATTTATCCAC ACATCACTAA TGAAGGTGGA
TATGACTTTG TTAGTTGGGG ATCGATTAGT GGAATGGTTG ATGGAGGCGA TGGAAAGATT
TATGCCGTTA ACGACAGCAC ATTTTCGTCA CAGCCAAGAA TCTATGTGAT CGATACGAAT
TACAGCCCTG CCATACTTGA TACTGCAATA GACATCAAGT TAAAAGGAAA GACTGCTCCA
TTTATGGATA TGGAGGGAAT TACACTTGAT GGTAAAGGTG GTTTTTATGT TTCCACAGAA
GGATTCAAAG AGAAAGGTGG TCCAGGGATA GAACAAGCTC CAGCTGCTGT TTATCACATC
AGCTCAGATG GTGAAATTCT TGAAAAAATA GATGTACCCT ATTCCCTTAT TCAATATCAA
ACGAAAGCTG GCTTTGAAGG GATAGCAAAA GTTGGAGATA CTCTTTACAT GGCTCAACAA
AAGCCATGGG CAGATGATCC ATTTAATACA ACAAAAATTG TTACTTACAA CTTAGAAAAC
AAAGAATGGG GGGCCGTTAA CTATGTATTC GAAAAGCAAG GTAGAAAAGG TGGCGTAGGA
ATTTCCGAAT TGACTCATCA TGACGGATAT CTTTATGCAA TTGAAAGAGA TAGCAATTAC
GGAGCAAAGG CTAAATTAAA GTCTATTTTC CGTATCAAAG TTTCTGATAT CAATCCTGAT
CCAATTTCGA ATGATACTTC TACACCAATC CTATATCCAT TAGTTAAGAA GGAGTTCGTT
AAGGATTTGA GATCAGATCT AACTTCAACA GGTGGATTTA TTCTTGAAAA AGTTGAAGGC
CTAGCTATCA AAAAAGATGG TACCGCTTAC ATATCAACTG ACAATGATGG AACTTCAAAA
AAATCAACTG GAGAAACCTT ATTCCTAAAT ATAGGAAAGC TTTAA
 
Protein sequence
MKKRVVLACG SAVLALLASS STSSIAGWRD ASNKNFNRVS SFAINRNLPS GVKSTTKTSA 
ETITATKDGK TLIYTDSDLG VVGIIDITDP SDPKGEGIIE LDAEPTSVME RKGKIFVGIN
TSESYTNPSG SITSYDLKSG IKSKECNVGG QPDSVAIAPS GKFIAVAIEN ERDEEFNDGV
IPQMPAGNVA FVKLKGGDLD CDSMFFADVS GLSEIAPSDP EPEFLTINKK GETVVSLQEN
NHLVVLNKKG EVISHFSAGL VSQMAGMDTK KDGAHKFKKK LKNVRREPDG LTWIDNDHFA
TANEGDYKLK KEGQAKRGGS RSWTIWNKDG SVVYEDGNRL ERAIAQIGHF QDDRAGKKGV
EPESVTYAKI KGTPYMFVGA ERAGVVAVYD VSDLSQPTLL QLLPSGIAPE GFVAIPKRGL
IASSNEKDYN KKEPGLASHV MTYELQKADA IYPHITNEGG YDFVSWGSIS GMVDGGDGKI
YAVNDSTFSS QPRIYVIDTN YSPAILDTAI DIKLKGKTAP FMDMEGITLD GKGGFYVSTE
GFKEKGGPGI EQAPAAVYHI SSDGEILEKI DVPYSLIQYQ TKAGFEGIAK VGDTLYMAQQ
KPWADDPFNT TKIVTYNLEN KEWGAVNYVF EKQGRKGGVG ISELTHHDGY LYAIERDSNY
GAKAKLKSIF RIKVSDINPD PISNDTSTPI LYPLVKKEFV KDLRSDLTST GGFILEKVEG
LAIKKDGTAY ISTDNDGTSK KSTGETLFLN IGKL