Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_21071 |
Symbol | |
ID | 4781116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1758539 |
End bp | 1762237 |
Gene Length | 3699 bp |
Protein Length | 1232 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640085403 |
Product | hypothetical protein |
Protein accession | YP_001015927 |
Protein GI | 124026812 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAG GAATTGATAT TCAGGGTTGT CAATCTGAAG GCAGCAAAAG TCGAGGGATT GGTAGATACT CATTAACTTT AATTCGTTAT TTAATCAAAT ATTCAAGAGA TGATGATGAG TTTATTCTTA TAGCTAATAA ATCTTTAGAT TCTGTTGATA TAGATTTTCT TTCTTTTATA ACTTCTTGTG GTTCAAAAGT GCAATACTTT GAATGGATTT ACCCTGGGGC TACTTCAGGT ATGTCTAGCT CTAATACCGA AAAAACAGCA ATAGCAGAGC AGTTAAGGTC TTATTCTTTT TCTCTTTTAA ACTGCGACAT TATCTTAATT ACTAGTTTTT TTGAAGGATT CAGGGATAAT TGCATAATTG AATTTGATAG CAACTTTGAT TTACCTCCTA TTGCGTCAAT ATTTTATGAT CTAATTCCGT TGTTAAAACC AGAAAACTAT TTAGATAGCA ACAGTGAATT TAAACAATTC TATCTTTCTA GATTAGATCT TTTAAACGAG GTTAGCTGTC TACTTTCGAT CTCAAACTCT TCATCTAAAG AAGCAGAAAA ATATCTTTCT ATTGACAAAA AAGATATACA TAATATTTAT GCTGGCTGTG ATCAGAGTAC TTTCTATCCC AGAAAATTAA TAGAAGATTC AAATAAAACC ACATATAGTT TAGGTAAATA TATTTTGTAT TCTGGCGCAG GTGATCCTCG AAAAAATATT CAAAGATTGG TAGAGGCCTA TTCTCTCTTA GCTAAAGATA TTATCTGGAA CTATAAGTTA GTTCTCGTAG GTAAACTTTT GCCAGAAGAA ATTTCTCTTA TAAAAAGTTG GATTACCTCT GTTAACTTAA CTGAAAATAA TATTGTTTTA CTTGGTTATG TTTCTGATGA CGAATTAGCA TCTTTATATA GAAATTGTAC CCTTTTTGTA TTTCCCTCGT TGCATGAAGG ATTTGGGTTG CCTGCTCTAG AAGCTATGTC TTGCGGGGCT GTTGTTCTTG GTTCGAATAC TACTAGTATT CCAGAAGTTA TTCAGAACGA GTCTGCTTTA TTTGATCCAG AGAATGTTAA TGAAATGGCA GAATTAATCT CTAAGGCTTT AACTAATAAA TTATTTTACA AAGAATTATC ATCTAATTTA ATTAAAAGAG CATCTAAGTT TACCTGGGAG AATACTGCCT TAAAAGCTTT AAATGCTATG AGAGTAACTA TAATGAAATC TCAAAAAAGT TCGTTATCAA ATAATGACAT TACTTCGATT TTATCTTTTA AGAATAAACA ATATGATCTA ATGTTGGATA ATATTGTTAA TATTTTAGCA ACCGCTAATA CTTTATCCAA TGATGAAGAC TATCTAGTTA ATTTAGCCGC TAGTATAAGT TCTTGTGAAT TAAATTCAAA ATATCTTAAA CAATTTAAAA TTAATGATTT AGATTTCCCG ACTTGGAGAA TAGAAGGACC ATTTGACTCG AACTATAGCC TTGCAATTCT AAATAGAGAA TTATCATTGG CTTTAAAGAA AAGTATTCCT AATCTAAGTA TCTTATCTAC GGAGGGCCCT GGTGATTATC ATCCAAATAT AAATTTTTTG AAAAAATTTC CTTCATTATA TAATTTATAC ATAGATTCGT TAGAAGATAA TAATCAAACA CCTATTATAG TTTCAAGAAA TTTATATCCT CCTCGAGTTA ATGATTTACA TTCTAGAATT AATATTCTTC ATGCCTATGG TTGGGAAGAA TCAGAGATTC CGTCTGAATG GATAGCTGAA TTTAATCTAT ATTTAGATGG AATTACTGTA ATGTCTACTC AGGTAAAAAA GAATCTAATT GATTCTGGTT TTTATAAACC AGTTTCTGTT TGTGGCCTAG GAGTTGATCA TATAATTAGG TCTCCAGAAT GTAAAAACTT TAATCTTCCA GCTAGAAATT TTAAATTTTT GCATATATCC TCTTGTTTCC CTAGAAAAGG AGTTAAGGCT TTATTAGATG CTTATGGTAA AGCCTTTACT ATTAATGATG ATGTTTCTTT AATCATTAAA ACCTTTACTA ACCCACATAA TAAAATTCGT CATTTTCTAC AGGAATATAA AGAAAAAAAT GCCAGTTTTC CACACGTAAT CCTCATTGAA GATGAGTATT CACTTCCTGA GATTAAGGCA CTATATAAGA TATCTAATGC ATATGTTTCG CCTAGTCATG GAGAAGGCTT TGGCTTACCA ATCGCTGAGG CAATGTCAAA TCAAATACCT GTAATTACTA CTTCATGGGG TGGACAACTA GATTTTGTTT CTGAGAAAAA TGCATGGTTA ATTGATTTTA AGTTTGCATA TTCTGAGACT CATTTCAAAC AATTTAATTC CGTATGGGCA GAACCTTCTT CTAATCATTT AGCGCAACAA ATGAAGTTGC TTAAAGACGC CGATTCATCT GAAATAATAA AAAAAACAGA GATTGCTTAT GACGAAATAA CTAGTAATTA TTCTTGGGAA AAGACAGCGA ATATCAATGT TAAATTCGTA AATAAACTTT TAAAACATGA CGATATCAAT CATGTCAAAC TTGGAGTAAT TACGACATGG AATGTTATGT GTGGTGTTGC TAGCTATACA TCCAATTTAC TTAAAAATTT AGATCATCAG AAGTTTATAT TTGCTCCTTA CTCTGAGTCA ATCTTGAAGG ATGATTCGCC CAATATATGT AGGTGTTGGG ATATCAATAA GCCGTTTACA GATCAAATTA AAAATAATAT TTTAAAGTAT AAAATTACTA CGATTATTAT TGAGTTTAAC TATGGTTTTT TTGATTTTTC AAGTTTAAAT GAATTGATTA AATTTTTATA TAAAAATAAT ATACTGATAA TTATTCAAAT GCATTCTACT ATTGATCCGA TTGAAATTAA AAATAAATCT CTGAACACTA TTGTTCAATC TTTACATTTA GCTGATCGAA TTTTAGTTCA TACCTGTTCA GATCTGAATA GATTAAAAAA GATTAATATT ATTAATAACG TTTCGATCTT TCCACATGGT GTCTTAGACT TTCCTCTAGT TAATGAATAT TCATTAAATA ACTTCTCTTT GAATAATATA TTTAATTTTA AACAGAAAAA ATTTAATTTT GCTACTTCTG GATTTTCTCT ACCAAATAAA GGCTTTCTAG AGTTGGTAAA GACTGTTTCT ATACTAGTAG AACAGAACCT AGATATTCAT TTCACTTTTT TTACTCCTAA TTATAATAAT AACTTTGCTT TTTACTCTAA GGAAGTATCA AATTTAATAA AAGAATTAAA TTTAGAAAAC TATATCTCTT TTGATTATAC CTATTATGCT GAGCATGAAA TCGTGAATTT CTTATCAAAA ATGGATGCTA TTGTATATCC ATATCAATTT AGTAATGAAT CCTCAAGTGC ATCAGTAAGG CAGGGTATAG CCAGTGGATC TAGAGTGATT GTAACGCCAA TTTCAATTTT TGAAGATGTA ATTGATGTTG TTGACGTTCT TCCTGGTATC TCTCCAGAAG AGATGGCAGT AGGTATTATT GATTGGATTA ATAGCAATCG ATATGAAAAA TATACAAATT CTAAAAACAA TCAATCAGTT TTATTAAATA AGTGGAGAAA AAGTCATCTA TTCAGTAATT TATCAAGCAG GTTAAGCCGA TTAATTACTG CATTAGAAGT CGATAGAAAT TTCCTCTAA
|
Protein sequence | MKIGIDIQGC QSEGSKSRGI GRYSLTLIRY LIKYSRDDDE FILIANKSLD SVDIDFLSFI TSCGSKVQYF EWIYPGATSG MSSSNTEKTA IAEQLRSYSF SLLNCDIILI TSFFEGFRDN CIIEFDSNFD LPPIASIFYD LIPLLKPENY LDSNSEFKQF YLSRLDLLNE VSCLLSISNS SSKEAEKYLS IDKKDIHNIY AGCDQSTFYP RKLIEDSNKT TYSLGKYILY SGAGDPRKNI QRLVEAYSLL AKDIIWNYKL VLVGKLLPEE ISLIKSWITS VNLTENNIVL LGYVSDDELA SLYRNCTLFV FPSLHEGFGL PALEAMSCGA VVLGSNTTSI PEVIQNESAL FDPENVNEMA ELISKALTNK LFYKELSSNL IKRASKFTWE NTALKALNAM RVTIMKSQKS SLSNNDITSI LSFKNKQYDL MLDNIVNILA TANTLSNDED YLVNLAASIS SCELNSKYLK QFKINDLDFP TWRIEGPFDS NYSLAILNRE LSLALKKSIP NLSILSTEGP GDYHPNINFL KKFPSLYNLY IDSLEDNNQT PIIVSRNLYP PRVNDLHSRI NILHAYGWEE SEIPSEWIAE FNLYLDGITV MSTQVKKNLI DSGFYKPVSV CGLGVDHIIR SPECKNFNLP ARNFKFLHIS SCFPRKGVKA LLDAYGKAFT INDDVSLIIK TFTNPHNKIR HFLQEYKEKN ASFPHVILIE DEYSLPEIKA LYKISNAYVS PSHGEGFGLP IAEAMSNQIP VITTSWGGQL DFVSEKNAWL IDFKFAYSET HFKQFNSVWA EPSSNHLAQQ MKLLKDADSS EIIKKTEIAY DEITSNYSWE KTANINVKFV NKLLKHDDIN HVKLGVITTW NVMCGVASYT SNLLKNLDHQ KFIFAPYSES ILKDDSPNIC RCWDINKPFT DQIKNNILKY KITTIIIEFN YGFFDFSSLN ELIKFLYKNN ILIIIQMHST IDPIEIKNKS LNTIVQSLHL ADRILVHTCS DLNRLKKINI INNVSIFPHG VLDFPLVNEY SLNNFSLNNI FNFKQKKFNF ATSGFSLPNK GFLELVKTVS ILVEQNLDIH FTFFTPNYNN NFAFYSKEVS NLIKELNLEN YISFDYTYYA EHEIVNFLSK MDAIVYPYQF SNESSSASVR QGIASGSRVI VTPISIFEDV IDVVDVLPGI SPEEMAVGII DWINSNRYEK YTNSKNNQSV LLNKWRKSHL FSNLSSRLSR LITALEVDRN FL
|
| |