Gene NATL1_21071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21071 
Symbol 
ID4781116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1758539 
End bp1762237 
Gene Length3699 bp 
Protein Length1232 aa 
Translation table11 
GC content30% 
IMG OID640085403 
Producthypothetical protein 
Protein accessionYP_001015927 
Protein GI124026812 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAG GAATTGATAT TCAGGGTTGT CAATCTGAAG GCAGCAAAAG TCGAGGGATT 
GGTAGATACT CATTAACTTT AATTCGTTAT TTAATCAAAT ATTCAAGAGA TGATGATGAG
TTTATTCTTA TAGCTAATAA ATCTTTAGAT TCTGTTGATA TAGATTTTCT TTCTTTTATA
ACTTCTTGTG GTTCAAAAGT GCAATACTTT GAATGGATTT ACCCTGGGGC TACTTCAGGT
ATGTCTAGCT CTAATACCGA AAAAACAGCA ATAGCAGAGC AGTTAAGGTC TTATTCTTTT
TCTCTTTTAA ACTGCGACAT TATCTTAATT ACTAGTTTTT TTGAAGGATT CAGGGATAAT
TGCATAATTG AATTTGATAG CAACTTTGAT TTACCTCCTA TTGCGTCAAT ATTTTATGAT
CTAATTCCGT TGTTAAAACC AGAAAACTAT TTAGATAGCA ACAGTGAATT TAAACAATTC
TATCTTTCTA GATTAGATCT TTTAAACGAG GTTAGCTGTC TACTTTCGAT CTCAAACTCT
TCATCTAAAG AAGCAGAAAA ATATCTTTCT ATTGACAAAA AAGATATACA TAATATTTAT
GCTGGCTGTG ATCAGAGTAC TTTCTATCCC AGAAAATTAA TAGAAGATTC AAATAAAACC
ACATATAGTT TAGGTAAATA TATTTTGTAT TCTGGCGCAG GTGATCCTCG AAAAAATATT
CAAAGATTGG TAGAGGCCTA TTCTCTCTTA GCTAAAGATA TTATCTGGAA CTATAAGTTA
GTTCTCGTAG GTAAACTTTT GCCAGAAGAA ATTTCTCTTA TAAAAAGTTG GATTACCTCT
GTTAACTTAA CTGAAAATAA TATTGTTTTA CTTGGTTATG TTTCTGATGA CGAATTAGCA
TCTTTATATA GAAATTGTAC CCTTTTTGTA TTTCCCTCGT TGCATGAAGG ATTTGGGTTG
CCTGCTCTAG AAGCTATGTC TTGCGGGGCT GTTGTTCTTG GTTCGAATAC TACTAGTATT
CCAGAAGTTA TTCAGAACGA GTCTGCTTTA TTTGATCCAG AGAATGTTAA TGAAATGGCA
GAATTAATCT CTAAGGCTTT AACTAATAAA TTATTTTACA AAGAATTATC ATCTAATTTA
ATTAAAAGAG CATCTAAGTT TACCTGGGAG AATACTGCCT TAAAAGCTTT AAATGCTATG
AGAGTAACTA TAATGAAATC TCAAAAAAGT TCGTTATCAA ATAATGACAT TACTTCGATT
TTATCTTTTA AGAATAAACA ATATGATCTA ATGTTGGATA ATATTGTTAA TATTTTAGCA
ACCGCTAATA CTTTATCCAA TGATGAAGAC TATCTAGTTA ATTTAGCCGC TAGTATAAGT
TCTTGTGAAT TAAATTCAAA ATATCTTAAA CAATTTAAAA TTAATGATTT AGATTTCCCG
ACTTGGAGAA TAGAAGGACC ATTTGACTCG AACTATAGCC TTGCAATTCT AAATAGAGAA
TTATCATTGG CTTTAAAGAA AAGTATTCCT AATCTAAGTA TCTTATCTAC GGAGGGCCCT
GGTGATTATC ATCCAAATAT AAATTTTTTG AAAAAATTTC CTTCATTATA TAATTTATAC
ATAGATTCGT TAGAAGATAA TAATCAAACA CCTATTATAG TTTCAAGAAA TTTATATCCT
CCTCGAGTTA ATGATTTACA TTCTAGAATT AATATTCTTC ATGCCTATGG TTGGGAAGAA
TCAGAGATTC CGTCTGAATG GATAGCTGAA TTTAATCTAT ATTTAGATGG AATTACTGTA
ATGTCTACTC AGGTAAAAAA GAATCTAATT GATTCTGGTT TTTATAAACC AGTTTCTGTT
TGTGGCCTAG GAGTTGATCA TATAATTAGG TCTCCAGAAT GTAAAAACTT TAATCTTCCA
GCTAGAAATT TTAAATTTTT GCATATATCC TCTTGTTTCC CTAGAAAAGG AGTTAAGGCT
TTATTAGATG CTTATGGTAA AGCCTTTACT ATTAATGATG ATGTTTCTTT AATCATTAAA
ACCTTTACTA ACCCACATAA TAAAATTCGT CATTTTCTAC AGGAATATAA AGAAAAAAAT
GCCAGTTTTC CACACGTAAT CCTCATTGAA GATGAGTATT CACTTCCTGA GATTAAGGCA
CTATATAAGA TATCTAATGC ATATGTTTCG CCTAGTCATG GAGAAGGCTT TGGCTTACCA
ATCGCTGAGG CAATGTCAAA TCAAATACCT GTAATTACTA CTTCATGGGG TGGACAACTA
GATTTTGTTT CTGAGAAAAA TGCATGGTTA ATTGATTTTA AGTTTGCATA TTCTGAGACT
CATTTCAAAC AATTTAATTC CGTATGGGCA GAACCTTCTT CTAATCATTT AGCGCAACAA
ATGAAGTTGC TTAAAGACGC CGATTCATCT GAAATAATAA AAAAAACAGA GATTGCTTAT
GACGAAATAA CTAGTAATTA TTCTTGGGAA AAGACAGCGA ATATCAATGT TAAATTCGTA
AATAAACTTT TAAAACATGA CGATATCAAT CATGTCAAAC TTGGAGTAAT TACGACATGG
AATGTTATGT GTGGTGTTGC TAGCTATACA TCCAATTTAC TTAAAAATTT AGATCATCAG
AAGTTTATAT TTGCTCCTTA CTCTGAGTCA ATCTTGAAGG ATGATTCGCC CAATATATGT
AGGTGTTGGG ATATCAATAA GCCGTTTACA GATCAAATTA AAAATAATAT TTTAAAGTAT
AAAATTACTA CGATTATTAT TGAGTTTAAC TATGGTTTTT TTGATTTTTC AAGTTTAAAT
GAATTGATTA AATTTTTATA TAAAAATAAT ATACTGATAA TTATTCAAAT GCATTCTACT
ATTGATCCGA TTGAAATTAA AAATAAATCT CTGAACACTA TTGTTCAATC TTTACATTTA
GCTGATCGAA TTTTAGTTCA TACCTGTTCA GATCTGAATA GATTAAAAAA GATTAATATT
ATTAATAACG TTTCGATCTT TCCACATGGT GTCTTAGACT TTCCTCTAGT TAATGAATAT
TCATTAAATA ACTTCTCTTT GAATAATATA TTTAATTTTA AACAGAAAAA ATTTAATTTT
GCTACTTCTG GATTTTCTCT ACCAAATAAA GGCTTTCTAG AGTTGGTAAA GACTGTTTCT
ATACTAGTAG AACAGAACCT AGATATTCAT TTCACTTTTT TTACTCCTAA TTATAATAAT
AACTTTGCTT TTTACTCTAA GGAAGTATCA AATTTAATAA AAGAATTAAA TTTAGAAAAC
TATATCTCTT TTGATTATAC CTATTATGCT GAGCATGAAA TCGTGAATTT CTTATCAAAA
ATGGATGCTA TTGTATATCC ATATCAATTT AGTAATGAAT CCTCAAGTGC ATCAGTAAGG
CAGGGTATAG CCAGTGGATC TAGAGTGATT GTAACGCCAA TTTCAATTTT TGAAGATGTA
ATTGATGTTG TTGACGTTCT TCCTGGTATC TCTCCAGAAG AGATGGCAGT AGGTATTATT
GATTGGATTA ATAGCAATCG ATATGAAAAA TATACAAATT CTAAAAACAA TCAATCAGTT
TTATTAAATA AGTGGAGAAA AAGTCATCTA TTCAGTAATT TATCAAGCAG GTTAAGCCGA
TTAATTACTG CATTAGAAGT CGATAGAAAT TTCCTCTAA
 
Protein sequence
MKIGIDIQGC QSEGSKSRGI GRYSLTLIRY LIKYSRDDDE FILIANKSLD SVDIDFLSFI 
TSCGSKVQYF EWIYPGATSG MSSSNTEKTA IAEQLRSYSF SLLNCDIILI TSFFEGFRDN
CIIEFDSNFD LPPIASIFYD LIPLLKPENY LDSNSEFKQF YLSRLDLLNE VSCLLSISNS
SSKEAEKYLS IDKKDIHNIY AGCDQSTFYP RKLIEDSNKT TYSLGKYILY SGAGDPRKNI
QRLVEAYSLL AKDIIWNYKL VLVGKLLPEE ISLIKSWITS VNLTENNIVL LGYVSDDELA
SLYRNCTLFV FPSLHEGFGL PALEAMSCGA VVLGSNTTSI PEVIQNESAL FDPENVNEMA
ELISKALTNK LFYKELSSNL IKRASKFTWE NTALKALNAM RVTIMKSQKS SLSNNDITSI
LSFKNKQYDL MLDNIVNILA TANTLSNDED YLVNLAASIS SCELNSKYLK QFKINDLDFP
TWRIEGPFDS NYSLAILNRE LSLALKKSIP NLSILSTEGP GDYHPNINFL KKFPSLYNLY
IDSLEDNNQT PIIVSRNLYP PRVNDLHSRI NILHAYGWEE SEIPSEWIAE FNLYLDGITV
MSTQVKKNLI DSGFYKPVSV CGLGVDHIIR SPECKNFNLP ARNFKFLHIS SCFPRKGVKA
LLDAYGKAFT INDDVSLIIK TFTNPHNKIR HFLQEYKEKN ASFPHVILIE DEYSLPEIKA
LYKISNAYVS PSHGEGFGLP IAEAMSNQIP VITTSWGGQL DFVSEKNAWL IDFKFAYSET
HFKQFNSVWA EPSSNHLAQQ MKLLKDADSS EIIKKTEIAY DEITSNYSWE KTANINVKFV
NKLLKHDDIN HVKLGVITTW NVMCGVASYT SNLLKNLDHQ KFIFAPYSES ILKDDSPNIC
RCWDINKPFT DQIKNNILKY KITTIIIEFN YGFFDFSSLN ELIKFLYKNN ILIIIQMHST
IDPIEIKNKS LNTIVQSLHL ADRILVHTCS DLNRLKKINI INNVSIFPHG VLDFPLVNEY
SLNNFSLNNI FNFKQKKFNF ATSGFSLPNK GFLELVKTVS ILVEQNLDIH FTFFTPNYNN
NFAFYSKEVS NLIKELNLEN YISFDYTYYA EHEIVNFLSK MDAIVYPYQF SNESSSASVR
QGIASGSRVI VTPISIFEDV IDVVDVLPGI SPEEMAVGII DWINSNRYEK YTNSKNNQSV
LLNKWRKSHL FSNLSSRLSR LITALEVDRN FL