Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_01861 |
Symbol | |
ID | 4780000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 173742 |
End bp | 175376 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640083450 |
Product | beta-N-acetylglucosaminidase |
Protein accession | YP_001014015 |
Protein GI | 124024899 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.314761 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAT CTGATCTTAG AAGACAAGTA GCTGAATTAT TTATAGTTCG AGCAAGTGGA TTTAATCTTG ATTCGCAACG TCTATACCCT AACTTAGAGG AATCTAATTC AAATCTAAAA AGACTTTTAG AAGAAGGTGT TGGTGGAGTT ATTGTTCTAG GAGGAACTGT AAAAGAATTA GAAATTCGTT GCAATATCTT AAAAAAATGG TCTGGTAAAC CTCTTCTTTT ATGTGCTGAC ATTGAGGAAG GTGTTGGTCA AAGATTTTAT GGAGGAATAA AGTTTGTTCC TCCAATGGGT ATTTCTCAGA TTTATAAAAA AGATCAAACT TTAGCAATTT CTATTGCTGA GAAAATTGGT TATTTTACTG GTAAAGAAGC AAAAAAGATT GGTTTAAATT GGCTATTAGC CCCAGTCTGT GATATTAATA ACAATCCAAA TAACCCAGTT ATAAATCTAA GAGCTTGGGG AGAAGAGCCT GAAACAGTAA AAAGTTTAAC TTGTGCTTTT CAGCGCGGTG TTTCTAGATC AAAAATGCTG ACTTGTGCGA AACATTTTCC TGGGCATGGA AATTCTGAAG TTGACTCTCA CTTGGATTTG CCAGAAATAC ATAATGACTT ATCTAAATTA GAGAAATTTG AGTTAATTCC ATTTAAGTCT TTAATCAATC AAGGAGTAAA TAGTGTCATG ATCGGGCATT TACTTTTTCC AAAGATTGAT CCTATTTTTC CTGCAACACT TTCAAAAAGA GTGGTTACTG ATTTGTTACG TATCAAATTT AAATACGATG GTTTAGTAGT CAGTGATGCC TTAGTTATGA ATGCAATAAC AAATAAATAT AGTAGTGGTG AGGCTGCAGT TATGGCATTT GATGCAGGAA TTGATTTGAT TATGATGCCA AAAGATATTG ATGAGGCAAT TGATTCTCTT GCCGATGCTT TTTATTCAGG AAAAATTTCT TTAGAAAGGT TAAATATATC TAGAGAAAGA AGAAAAAAAC AACTTGATTT AGTTAGCAAC GAAGATGATT TTAAAAAAGA AGATTTGAAC AATGAAGATA TTAAAAATGA ATTTTTATTG GATGCTTCTA AATTTAGTAA TTCTATAATA AAAAGTTCAA TTTTTGTCCG AGAAGAAAGT ACTATAAAAG CTGAATTTAA TGATATAAAT CTTATACAAA TTGATAATTT TGATCAAGTA CCTAATAAAT TTATTCCTGC ATTAGATATC CCTAAGGCAG TAGGTTTTAA AAATTTAATT ATACATCCAC TTGGTATCAG TCCGTGGGGA AAAACTAATA AGAAATTTTT AGAAATGGGG CAATTTAGCA ATAGTAAGAT TCTTGTTCAG CTTTTTGTGA GAGGTAAACC ATTTATTGGA TTAGATTATC ATAATGATCA TTGGATAGAT GCACTAAAAA GTTTAGAAAT TGAGGAAAGA TTATCAGGAA TTATAATTTA TGGGTGTCCA TATTTATTTG ATAAAATAAA AAAATCTATT CATAAAAATA TTCCTTTAGC TTATAGCCCT AGTCAAACAG AGGAAGCACA AAATCAAATT TTAAGTCGTA TTTTGCAATC AAAAACAACT CAAAAGGAAA TTGATAAAGA ATCAAGCATA GAATTTACTG ATTGA
|
Protein sequence | MNKSDLRRQV AELFIVRASG FNLDSQRLYP NLEESNSNLK RLLEEGVGGV IVLGGTVKEL EIRCNILKKW SGKPLLLCAD IEEGVGQRFY GGIKFVPPMG ISQIYKKDQT LAISIAEKIG YFTGKEAKKI GLNWLLAPVC DINNNPNNPV INLRAWGEEP ETVKSLTCAF QRGVSRSKML TCAKHFPGHG NSEVDSHLDL PEIHNDLSKL EKFELIPFKS LINQGVNSVM IGHLLFPKID PIFPATLSKR VVTDLLRIKF KYDGLVVSDA LVMNAITNKY SSGEAAVMAF DAGIDLIMMP KDIDEAIDSL ADAFYSGKIS LERLNISRER RKKQLDLVSN EDDFKKEDLN NEDIKNEFLL DASKFSNSII KSSIFVREES TIKAEFNDIN LIQIDNFDQV PNKFIPALDI PKAVGFKNLI IHPLGISPWG KTNKKFLEMG QFSNSKILVQ LFVRGKPFIG LDYHNDHWID ALKSLEIEER LSGIIIYGCP YLFDKIKKSI HKNIPLAYSP SQTEEAQNQI LSRILQSKTT QKEIDKESSI EFTD
|
| |