Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_00841 |
Symbol | |
ID | 4779378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 85539 |
End bp | 90293 |
Gene Length | 4755 bp |
Protein Length | 1584 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640083347 |
Product | hypothetical protein |
Protein accession | YP_001013913 |
Protein GI | 124024797 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTTCT CTCCTGGTCC TGCTACTGAA GCTGCACCTG CCGGCGATCC TGCCCCCGTT GCTGAAGCTG CACCTGCCGG CGATCCTGCC CCCGTTGCTG AAGCTGCACC TGCCGGCGAT CCTGCCCCCG TTGCTGAAGC TGCTCCAGTT AGTGATGCCG CAGCTGAGGC AGCAGCCGCA GATCCTGACG CTGCAGCCGC AGCCGCTGCA CCTGTTGGCG ATGCTGCGCC GGTAGGAGAC GACAGCTCTC CTAACGACGA AGGTCCGACA GAAGACGCAC CGTCTCTTGG ACCGAAAGGA GATGAGCCAC CAACAGGAGA TGTAGCACCT GATCAATCAG GCGATCCTGC ACCCACAGGT GACGACAGCT CTCCTGTCGA CGAAGGCCCG ACAGAAGACG CACCGTCTCT TGGACCGAAA GGAGATGAGC CACCAACAGG AGATGTAGCA CCTGATCAAT CAGGCGATCC TGCACCCACA GGTGACGACA GCTCTCCTGT CGACGAAGGC CCGACAGAAG ACGCACCGTC TCTTGGACCG AAAGGAGATG AGCCACCAAC AGGAGATGTA GCACCTGATC AATCAGGCGA TCCTGCACCC ACAGGTGACG ACAGCTCTCC TGTCGACGAA GGCCCGACAG AAGACGCACC GTCTCTTGGA CCGAAAGGAG ATGAGCCACC AACAGGAGAT GTAGCACCTG ATCAATCAGG CGATCCTGCA CCCACAGGTG ACGCAGCAAC AGCTCCTGTT AGTGATGCCG TTGCTGAGGC GGCAGCCGCA GATCCTGACG CTGCAGCCGC AGCCGCTGCA CCAGTCCCTC CTCCAGCTGA GATTGGAGGA GTAGCTCCAA GTGAGTTAGC AAAAGATGAT ATTGCCAATC TTGATTCAGA TGTAATTGAA GACCTTAAGG AAGATCAAGT AGCCGCGCTT GATGCGACAG CTGTTGGCGG TTTTAAGGAA GATCAAGTGG CTGCGCTCGA TGCGACAGCT GTTGGCGGTT TTAAGGAAGA TCAAGTAGCC GCGCTTGATG CGACAGCTGT TGGCGGTTTT CAGGAAGATC AAATAGCCGC GCTTGATGCG ACAGCTGTTG GCGGTTTTCA GGAAGATCAA GTAGCCGCGC TTGATGCGAC AGCTGTTGGC GGTTTTCAGG AAGATCAAAT AGCCGCGCTT GATGCGACAG CTGTTGGCGG TTTTCAGGAA GATCAAATAG CCGCGCTTGA TGCGACAGCT GTTGGCGGTT TTCAGGAAGA TCAAATAGCC GCGCTTGATG CGACAGCTGT TGGCGGTTTT CAGGAAGATC AAGTAGCCGC GCTTGATGCG ACAGCTGTTG GCGGTTTTCA GGAAGATCAA GTAGCCGCGC TTGATGCGAC AGCTGTTGGC GGTTTTCAGG AAGATCAAAT AGCCGCGCTC GATGCGACAG CTGTTGGCGG TTTTCAGGAA GATCAAATAG CTGCGCTTGA TGCGACAGCT GTTGGCGGTT TTCAGGAAGA TCAAATAGCT GCGCTTGATG CGACAGCTGT TGGCGGTTTT CAGGAAGATC AAGTAGCCGC GCTCGATGCG ACAGCTGTTG GTGGTTTTCA GGAAGATCAA ATAGCTGCGC TCGATGCGAC AGCTGTTGGT GGTTTTCAGG AAGATCAAAT AGCCGCGCTC GATGCGACAG CTGTTGGCGG TTTTCAGGAA GATCAAATAG CTGCGCTTGA TGCGACAGCG GTAGCTGGTT TTGATGAAAC GCATGTTGCA GCATTTGATC CACTAGCGGT CGCTGGTTTT GATGAAACGC ATGTTGCAGC ATTTGATCCA ACAGCGGTCG CTGGTTTTGA TGAAACGCAT GTTGCAGCAT TTGATCCACT AGCGGTCGCT GGTTTTGATG AAACGCATGT TGCAGCATTT GATCCACTAG CGGTCGCTGG TTTTGACGAA AAACATATTG CTGCTATTGA TACACAAGCA ATTACTGGAT TTAATGCCGA TCATGTTGCT GCGCTTGATG CTCAGGCAAT GACGGGGTTA GCCAAAGATC AATTCGTAGC CTTTGAACCA ACTGCGATGG CTGGCTTTAA TGCTGATCAT ATTGCTGCGA TTGACCATAG CTACATGACA GGTCTTGGTA AAGATCATGT TGCAGGTTTT GATCCAACAG CGGTCGCTGG TTTTGATGAA GCGCATGTTG CAGCATTTGA TCCACTAGCG GTCGCTGGTT TTGATGAAAC CCATTTTGCA GCATTTGATC CACTAGCGGT CGCTGGTTTT GATGAAACGC ATGTTGCAGC ATTTGATCCA CTAGCGGTCG CTGGTTTTGA TGAAACGCAT GTTGCAGCAT TTGATCCAAC AGCGGTCGCT GGTTTTGATG AAACGCATGT TGCAGCATTT GATCCACTAG CGGTCGCTGG TTTTGACGAA AAACATATTG CTGCTATTGA TACACAAGCA ATTACTGGAT TTAATGCCGA TCATGTTGCT GCGCTTGATG CTCAGGCAAT GACGGGGTTA GCCAAAGATC AATTCGTAGC CTTTGAACCA ACTGCGATGG CTGGCTTTAA TGCTGATCAT ATTGCTGCGA TTGACCATAG CTACATGACA GGTCTTGGTA AAGATCATGT TGCAGGTTTT GATCCAACAG CGGTCGCTGG TTTTGATGAA GCGCATGTTG CAGCATTTGA TCCACTAGCG GTCGCTGGTT TTGATGAAAC CCATTTTGCA GCATTTGATC CAACAGCGGT CGCTGGTTTT GATGAAACGC ATGTTGCAGC ATTTGATCCA ACAGCGGTCG CTGGTTTTGA TGAAACGCAT GTTGCAGCAT TTGATCCACT AGCGGTCGCT GGTTTTGATG AAACGCATGT TGCAGCATTT GAGATATCTG CAATGGAAGG ATTCAATCCC ACACATGTTG CTTCATTTAA TCCAGAGGCA ATGGCTGGAT TTAAGGGTAC ACAACTTAAA GAATTAGATC CAGAATCTTT TGCTGCTCTA ACAACTGAGC AAGCCGCAGA AATGACGCCT GACGCAGCAG CAGCCTTTAA AAATTGGGTC CTTCCAGATG ACGATATTAT TCGCTGGGAA GGAGGGTTAA GAGGCCCTGA TGGTGAGTGG ATGTCATCTG ATATTTACTT TGATAAAAGA ACAACAGGAG ATATCCCCGC CGAAACAACT GTTTGGACTC CGCCAGAAGT GGATATAATC GCTAAGGGAG GGGGCTTTAC AAAAGCAGAT GGAACTTTTG TTGAAGAAGA TGATTATTTT AAAGACCCAA CATCAGCTGG ATGGACTGTT CCCCCAGATG ATTTAATTAA AAAGGATGGT GGTTTTAGGA ACCCAAATGG TAAGTGGGTT ACTGCAAAGG AATTTATAAA TAATACTGGA GAAATACCTG AAACTGATGA AATCAAAAAA AATGGTGGGT ATTTTGATGC AGATAAACAA TGGGTTTCAT CAATATCACA TTTTGGAGAA GGAAGTGATG AAGTTCAAAA TGTTGCTTGG ATACCACCCG AGACAACTGA GATTAATACT GATGGAGGTT TTTGGGATCC CTTTGGCCAA TGGGTAACAT CGGAAGATTA TGAAAAAGAA GGATGGAAAG CACCAGAAGA TACGGTTAAA CCTGTAGTCC CAGAGGGGAT TACGGCTGAA GAAATCAAAA TCGAAGGTGG TTATACGGCT ACTGATGGTA GTTGGGTTAC AGATGCCAGC TTTTTTGATA CGACTTTGGA TAATGAAGTC AATAAAGTTG CTGGATATTG GGGGGCAACC AAAGATCCAA TTAAAGAAGC TGCTTTAATT GGTAGTGAAG AAGACACACT TGTTGATCCA ACAGAGGTTG AGCTAGAGAA AAAATTAATT AGTGATGTCG CTTATCCATG GAAAGATATC AAGGACTTAA AGGGTTCTAC CGCCACTGAA AATAGTGATT CGACAAGTCA TTGGGCAACG TTAGTTAAAA GCAAAGAAGA TGGTGATGAC AGGTTGGAAG GGGGAGAATC TTCCGACAAG ATTTTTGGAG GTTTGGGCTC AGACTTTATT GATGGTGGAG AAGGTGAAGA TATCGCTTTT TATGCAGGTA ATTTTGAGGA TTATAAATTT GATAGAACGA AAGATACAGT CTCTTTAGAA GATCAAAGAG AGGGACTTAA TGATGGGAAC GACGTTTTAA AGAATGTCGA ATACATCCAA TTTGCTGATC AGAAAGTTGA TGTTTCAAAG CTAGATATTG TGAAAACATA TACCGGAGAA AGTAAAGATT TTAAATTTTA TAAAAGAGAT GATGGGAGTA TAGAAGTGAA AACAGAAGAT GGATTTGATG ATATTACTGG TGTACCCAAG CTGGAATTTG ACGATAAGAC ATTTAGTGGA ATCAGCGACA TTGAAGAGAC GTTTAATCAA GTAACATCTA AAGATGATGA GACGGGACAA ATGTTTAGGG TTTATAACGC AGCTTTTGCA AGATTTCCCG ATGCAGATGG TCTCGAATAT TGGATTGATA AAAATCAATC TGGAGAAAAT AGTAATAGGC AGGTGGCTGA TTCTTTTTTA GGCTCTGAGG AATTCAAAGA AACCTACGGT GAAGATGTTG ATACGGGTAC GTACGTTAAT ACGCTATATA AAAATATTTT AGGTAGGGAA GCTGATCAGG AAGGTTATAA TTATTGGGTA ACTCAATTAG ATAGTGGTCA AGAAAATAGA GGAGAATTGC TTTTAGGGTT TGCAGAATCA GTAGAGAACA AGGCTCTTTT CTCTGAGGTT ACAGGGTTAT TTTAA
|
Protein sequence | MTFSPGPATE AAPAGDPAPV AEAAPAGDPA PVAEAAPAGD PAPVAEAAPV SDAAAEAAAA DPDAAAAAAA PVGDAAPVGD DSSPNDEGPT EDAPSLGPKG DEPPTGDVAP DQSGDPAPTG DDSSPVDEGP TEDAPSLGPK GDEPPTGDVA PDQSGDPAPT GDDSSPVDEG PTEDAPSLGP KGDEPPTGDV APDQSGDPAP TGDDSSPVDE GPTEDAPSLG PKGDEPPTGD VAPDQSGDPA PTGDAATAPV SDAVAEAAAA DPDAAAAAAA PVPPPAEIGG VAPSELAKDD IANLDSDVIE DLKEDQVAAL DATAVGGFKE DQVAALDATA VGGFKEDQVA ALDATAVGGF QEDQIAALDA TAVGGFQEDQ VAALDATAVG GFQEDQIAAL DATAVGGFQE DQIAALDATA VGGFQEDQIA ALDATAVGGF QEDQVAALDA TAVGGFQEDQ VAALDATAVG GFQEDQIAAL DATAVGGFQE DQIAALDATA VGGFQEDQIA ALDATAVGGF QEDQVAALDA TAVGGFQEDQ IAALDATAVG GFQEDQIAAL DATAVGGFQE DQIAALDATA VAGFDETHVA AFDPLAVAGF DETHVAAFDP TAVAGFDETH VAAFDPLAVA GFDETHVAAF DPLAVAGFDE KHIAAIDTQA ITGFNADHVA ALDAQAMTGL AKDQFVAFEP TAMAGFNADH IAAIDHSYMT GLGKDHVAGF DPTAVAGFDE AHVAAFDPLA VAGFDETHFA AFDPLAVAGF DETHVAAFDP LAVAGFDETH VAAFDPTAVA GFDETHVAAF DPLAVAGFDE KHIAAIDTQA ITGFNADHVA ALDAQAMTGL AKDQFVAFEP TAMAGFNADH IAAIDHSYMT GLGKDHVAGF DPTAVAGFDE AHVAAFDPLA VAGFDETHFA AFDPTAVAGF DETHVAAFDP TAVAGFDETH VAAFDPLAVA GFDETHVAAF EISAMEGFNP THVASFNPEA MAGFKGTQLK ELDPESFAAL TTEQAAEMTP DAAAAFKNWV LPDDDIIRWE GGLRGPDGEW MSSDIYFDKR TTGDIPAETT VWTPPEVDII AKGGGFTKAD GTFVEEDDYF KDPTSAGWTV PPDDLIKKDG GFRNPNGKWV TAKEFINNTG EIPETDEIKK NGGYFDADKQ WVSSISHFGE GSDEVQNVAW IPPETTEINT DGGFWDPFGQ WVTSEDYEKE GWKAPEDTVK PVVPEGITAE EIKIEGGYTA TDGSWVTDAS FFDTTLDNEV NKVAGYWGAT KDPIKEAALI GSEEDTLVDP TEVELEKKLI SDVAYPWKDI KDLKGSTATE NSDSTSHWAT LVKSKEDGDD RLEGGESSDK IFGGLGSDFI DGGEGEDIAF YAGNFEDYKF DRTKDTVSLE DQREGLNDGN DVLKNVEYIQ FADQKVDVSK LDIVKTYTGE SKDFKFYKRD DGSIEVKTED GFDDITGVPK LEFDDKTFSG ISDIEETFNQ VTSKDDETGQ MFRVYNAAFA RFPDADGLEY WIDKNQSGEN SNRQVADSFL GSEEFKETYG EDVDTGTYVN TLYKNILGRE ADQEGYNYWV TQLDSGQENR GELLLGFAES VENKALFSEV TGLF
|
| |