Gene NATL1_01861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_01861 
Symbol 
ID4780000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp173742 
End bp175376 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content30% 
IMG OID640083450 
Productbeta-N-acetylglucosaminidase 
Protein accessionYP_001014015 
Protein GI124024899 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.314761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAT CTGATCTTAG AAGACAAGTA GCTGAATTAT TTATAGTTCG AGCAAGTGGA 
TTTAATCTTG ATTCGCAACG TCTATACCCT AACTTAGAGG AATCTAATTC AAATCTAAAA
AGACTTTTAG AAGAAGGTGT TGGTGGAGTT ATTGTTCTAG GAGGAACTGT AAAAGAATTA
GAAATTCGTT GCAATATCTT AAAAAAATGG TCTGGTAAAC CTCTTCTTTT ATGTGCTGAC
ATTGAGGAAG GTGTTGGTCA AAGATTTTAT GGAGGAATAA AGTTTGTTCC TCCAATGGGT
ATTTCTCAGA TTTATAAAAA AGATCAAACT TTAGCAATTT CTATTGCTGA GAAAATTGGT
TATTTTACTG GTAAAGAAGC AAAAAAGATT GGTTTAAATT GGCTATTAGC CCCAGTCTGT
GATATTAATA ACAATCCAAA TAACCCAGTT ATAAATCTAA GAGCTTGGGG AGAAGAGCCT
GAAACAGTAA AAAGTTTAAC TTGTGCTTTT CAGCGCGGTG TTTCTAGATC AAAAATGCTG
ACTTGTGCGA AACATTTTCC TGGGCATGGA AATTCTGAAG TTGACTCTCA CTTGGATTTG
CCAGAAATAC ATAATGACTT ATCTAAATTA GAGAAATTTG AGTTAATTCC ATTTAAGTCT
TTAATCAATC AAGGAGTAAA TAGTGTCATG ATCGGGCATT TACTTTTTCC AAAGATTGAT
CCTATTTTTC CTGCAACACT TTCAAAAAGA GTGGTTACTG ATTTGTTACG TATCAAATTT
AAATACGATG GTTTAGTAGT CAGTGATGCC TTAGTTATGA ATGCAATAAC AAATAAATAT
AGTAGTGGTG AGGCTGCAGT TATGGCATTT GATGCAGGAA TTGATTTGAT TATGATGCCA
AAAGATATTG ATGAGGCAAT TGATTCTCTT GCCGATGCTT TTTATTCAGG AAAAATTTCT
TTAGAAAGGT TAAATATATC TAGAGAAAGA AGAAAAAAAC AACTTGATTT AGTTAGCAAC
GAAGATGATT TTAAAAAAGA AGATTTGAAC AATGAAGATA TTAAAAATGA ATTTTTATTG
GATGCTTCTA AATTTAGTAA TTCTATAATA AAAAGTTCAA TTTTTGTCCG AGAAGAAAGT
ACTATAAAAG CTGAATTTAA TGATATAAAT CTTATACAAA TTGATAATTT TGATCAAGTA
CCTAATAAAT TTATTCCTGC ATTAGATATC CCTAAGGCAG TAGGTTTTAA AAATTTAATT
ATACATCCAC TTGGTATCAG TCCGTGGGGA AAAACTAATA AGAAATTTTT AGAAATGGGG
CAATTTAGCA ATAGTAAGAT TCTTGTTCAG CTTTTTGTGA GAGGTAAACC ATTTATTGGA
TTAGATTATC ATAATGATCA TTGGATAGAT GCACTAAAAA GTTTAGAAAT TGAGGAAAGA
TTATCAGGAA TTATAATTTA TGGGTGTCCA TATTTATTTG ATAAAATAAA AAAATCTATT
CATAAAAATA TTCCTTTAGC TTATAGCCCT AGTCAAACAG AGGAAGCACA AAATCAAATT
TTAAGTCGTA TTTTGCAATC AAAAACAACT CAAAAGGAAA TTGATAAAGA ATCAAGCATA
GAATTTACTG ATTGA
 
Protein sequence
MNKSDLRRQV AELFIVRASG FNLDSQRLYP NLEESNSNLK RLLEEGVGGV IVLGGTVKEL 
EIRCNILKKW SGKPLLLCAD IEEGVGQRFY GGIKFVPPMG ISQIYKKDQT LAISIAEKIG
YFTGKEAKKI GLNWLLAPVC DINNNPNNPV INLRAWGEEP ETVKSLTCAF QRGVSRSKML
TCAKHFPGHG NSEVDSHLDL PEIHNDLSKL EKFELIPFKS LINQGVNSVM IGHLLFPKID
PIFPATLSKR VVTDLLRIKF KYDGLVVSDA LVMNAITNKY SSGEAAVMAF DAGIDLIMMP
KDIDEAIDSL ADAFYSGKIS LERLNISRER RKKQLDLVSN EDDFKKEDLN NEDIKNEFLL
DASKFSNSII KSSIFVREES TIKAEFNDIN LIQIDNFDQV PNKFIPALDI PKAVGFKNLI
IHPLGISPWG KTNKKFLEMG QFSNSKILVQ LFVRGKPFIG LDYHNDHWID ALKSLEIEER
LSGIIIYGCP YLFDKIKKSI HKNIPLAYSP SQTEEAQNQI LSRILQSKTT QKEIDKESSI
EFTD