Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_01101 |
Symbol | |
ID | 4780843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 106831 |
End bp | 108666 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640083373 |
Product | hypothetical protein |
Protein accession | YP_001013939 |
Protein GI | 124024823 |
COG category | [V] Defense mechanisms |
COG ID | [COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGGCA CTAGCAAAAC ATATCATTTA TTAATGCGCC TACTGAAGGC ACTCCCAGTT AGAAGAAGAA GTTCTCTGTT GAAATTAATT CCTGTAGCAG CCTTTACTGG TTTAGTCGAT GTGATTGTAG TTGGAATTGT TTCTAGATTA TTTACTGTTT TTATAGGTCA ACCTAATCAA CCGCCCTTAC CATTTCAACA TTTTATACCT GAAGACCCAA AAACAAAAGT CATCAGTCTA GTCGTCATAT ACATAGCAAT GAATTGGCTA GCATCATTTT CGAAATTATT TCTTAAAGCA GCACAAGAAA GACTTCGAGT AGCAGTTTGG AAGGACTTAT CAGAACTAGC TCAGAAAAAA TTATTATCTC AATCATATGA ATTCTTCTTA AACAAGAAAA AATCTGATTT ATCATCAAAA GTTTTAATCA ACATTTCTAG AGTCTCAGAA TTCCTAGTCA AACCAATCCT TCAGATTTCT AGTGGGTTAT GTGTGATTAC TTTTATATGT ATTGCTGTTC TTTTTATAGC AAAATCGATT GCTCTTTATT TAATAATAAG TCTCTTAATA TTTTATATAT TTATTTCATC TTTTGTAACC CCTTTTATAA GATATTCTTC TCGTAAAAGA ATCAAATTAG AAAAAGAAAC AAATAATATA CTCACAGAAT CAATGCGCAC CATTATAGAT GTTCACCTTA CTAGCTCAGA ACCATACTTT GAAAAACGCT ATAAATTAGC AAGCAAAAGC TCTTTCCCTT TCATGTGGAA AGCTGAGGTT TTACCTGAAT TACCTAGGTC ATTAATAGAG CCATTTGGTA TCACTTTAAT TTTTGCTATT GGTCTTTTCC CATACATCAC TGGGGAAAAC GATTCAATAC TTATTGAGAT AGTTCCATTC TTAGCAACAA TTGCAGTAGC TGCATTAAAA CTAACTCCGC CATTACAAGA TTCATTTAGA GCATTAACTT CAATGCGAGC ATCAATACCT GATTTAGAGG AGATACTAAA GTTGATAGAA CTTCCTTCTA CTAGGCTAAC TAAAAGATCC ATAGGCGTTC CGACAAAAGA AGGAATTCAG CCTAGAAACA ACATAAAGCT TGAGAAATTG AGTTATAAGT ATCCCAACAG CAACGAATAC ACCTTAAAGG GTATCAACCT TACTATTCCT ATTGGTTCAC GAATAGCTTT TGTAGGAGAA ACTGGAAGTG GAAAAACCAC TACCGCTAAT CAATTACTAT GTCTTCTTAG ACCAACAGAC GGACATTTAC TATTGGATGG AGTTGCAGTT ACTGATACAG AAGTGCCTGC TTGGCAAGAT TGTTGCTCTT ACGTTCCCCA ATCAATCACC TTATTAAATA GCAATATTAT TCAAAATATT GCATATGGTT TAGATGAAAA AATAATTGAT CATGGAAGGG TCTGGGATGC GCTTAGAGCA GCTCAATTAG CAGATTTGGT ATCAGAAATG CCAATGGGTT TACATTCCTC AGTTGGTGAT AATGGCATCA GATTATCTGG TGGACAAAGA CAGCGACTAG CCATAGCAAG AGCTTTTTAT AGGCAATCAA AATTATTAGT TTTAGATGAA GCAACTAGTG CCTTAGATAA CCGAACAGAA GCTGAGGTAA TGAATGCAAT AGAAATAATA GGTAGACGTT GCACAATAGT CACAATTGCT CACAGATTAT CTACAATCGA AAGATCAGAT TGTATATATG AATTTAAAGA TGGAGAAATA GTTTCCTTTG GAAATTACCA ACAATTACTA AAGCAATCTA AAACTTTTTT TAATATGGTA GAAATAGCAA AAAGAACATA CGGATCTAAT ATATAA
|
Protein sequence | MIGTSKTYHL LMRLLKALPV RRRSSLLKLI PVAAFTGLVD VIVVGIVSRL FTVFIGQPNQ PPLPFQHFIP EDPKTKVISL VVIYIAMNWL ASFSKLFLKA AQERLRVAVW KDLSELAQKK LLSQSYEFFL NKKKSDLSSK VLINISRVSE FLVKPILQIS SGLCVITFIC IAVLFIAKSI ALYLIISLLI FYIFISSFVT PFIRYSSRKR IKLEKETNNI LTESMRTIID VHLTSSEPYF EKRYKLASKS SFPFMWKAEV LPELPRSLIE PFGITLIFAI GLFPYITGEN DSILIEIVPF LATIAVAALK LTPPLQDSFR ALTSMRASIP DLEEILKLIE LPSTRLTKRS IGVPTKEGIQ PRNNIKLEKL SYKYPNSNEY TLKGINLTIP IGSRIAFVGE TGSGKTTTAN QLLCLLRPTD GHLLLDGVAV TDTEVPAWQD CCSYVPQSIT LLNSNIIQNI AYGLDEKIID HGRVWDALRA AQLADLVSEM PMGLHSSVGD NGIRLSGGQR QRLAIARAFY RQSKLLVLDE ATSALDNRTE AEVMNAIEII GRRCTIVTIA HRLSTIERSD CIYEFKDGEI VSFGNYQQLL KQSKTFFNMV EIAKRTYGSN I
|
| |