Gene NATL1_09141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_09141 
SymbolamyA 
ID4780555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp845213 
End bp846964 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content34% 
IMG OID640084190 
Productglycoside hydrolase family protein 
Protein accessionYP_001014737 
Protein GI124025621 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000264002 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCTGAGT TGGATAGTGA ATTAGACGAG CTGAGATTGT CTCTCAGAGA AATTTATCCT 
GAGCACTCTG AACAAGAAAT CAATTCAGTG TGGTCGCAAT TGTTGCAGAT TCTTGATCCA
TTTTGTGTTA GCAAGGGCAC TGATGAATTT GAGATCGAAT CAATTTGGGA TTCTTCAAGT
GTTGTTTTGA TTACTTACCC TGATTCAATT TATAGGAAAG ATGAATCAAC TTTAAAAACA
TTAACTGAAT TCGTAAAAAA TAGATTAGGT GGCCTTTCAT CAGTCATACA TGTTTTACCA
TTTCTTCCTT CTACAAGTGA TGGAGGATTC GCTGTATCTA ATCATGAAAA AATCGATGAT
ACCTTTGGGA ATTGGAATGA TTTAAAAGAT TTATCTAGTA AGCATAAAAT AATGGCAGAT
CTAGTTTTAA ATCATGTCTC TTCTTCTCAT CCATGGGTTC ATCAATTTAT AAAATCAGAG
GATCCAGGTC CGTCTTACAT TGTTTCTCCT TCTGAGACTA ATGTTTGGGA AAATGTGATA
AGGCCCAGAA ATACATCACT CTTTACCAAT ATCAATACTA AACAAGGCTT TAAGAGTGTT
TGGACAACCT TTGGACCAGA TCAGATTGAT GTTGATTGGA GGAATCCACA TATCTTTTTA
GAGTTTTTGA AATTATTGGT TAGATACATA ACTAATGGAG CTGACTGGAT TCGACTTGAT
GCAATTGCAT TTATTTGGAA GGAGCCACAT ACTACTTGTT TACATTTAGA CCCAGTACAC
TCAATAGTTA AGCTGTTAAA TAAATGTTTG AAGATTATAA AACCTTCGGC AGTATTAATT
ACCGAGACTA ATGTGCCAGA GAAAGAAAAC CTTTCTTATT TGATTGAAGG AAATGAAGCT
AATCTTGCAT ATAACTTTAC TCTACCTCCT TTATTATTAG AAGCTATTTA TACCGGTAAA
ACAGATTTAT TGAAGAGTTG GTTGTCTACG TGGAAAGAGT TGCCAAGGCA CACTTCTTTG
CTCAACTTCA CTTCTTCACA CGATGGTATT GGATTACGAG CACTAGAAGG CATTATGGAC
GATCAGAGAA TACATAATCT CTTGGTTGAA TCAGAGAAAC GAGGAGGATT AGTTAGTCAT
CGTCGCTTGT CAAATGGAGA TGATCAACCT TATGAATTAA ACATTAGTTG GTGGAGTGCG
ATGTCAAACG AAGGCTCCGA TAAAACGGTA TTTCAATTTG AGCGTTTTTT ATTAAGTCAG
CTTTTTACTT TATCCATAAA AGGTGTTCCA GCTTTTTATC TTCCATCTGT ATTAGCTTCT
CCGAATGATA TAGATTCTTT TAGGAAAACA GGGCAAAGAA GAGATTTAAA TCGAGAAAAA
TTTGAAGCTA ACCAATTACT TGATGTACTT AAGAACTTTG ATTCTCCCGC TAGTAAAAAT
ATTTCATACC TCACTCATAT AGTTAAAGTC AGGTCAAGAC TTAAAGCTTT TCATCCGGAG
GCGAGTATGA AATGTATCTC TACGAATATA GCTAATTGTA TAATTCTTCA AAGAGGTTTG
GATGAAGATA CTGTCTATGT TATTTGTAAT ATGTCTAGTA AATTCTTATC TATTTCTCCA
TTAAATCAAA TTAATTCATT AGAATTAACC TCTGAAAAAC GTTTACTAGA TAATATTTCA
GGTTCTTATT TTAATACTGA TACTTTTAAA CTTAATCCTT ATCAAGTCGT TTGGCTTACA
TTAGCTGATT AG
 
Protein sequence
MSELDSELDE LRLSLREIYP EHSEQEINSV WSQLLQILDP FCVSKGTDEF EIESIWDSSS 
VVLITYPDSI YRKDESTLKT LTEFVKNRLG GLSSVIHVLP FLPSTSDGGF AVSNHEKIDD
TFGNWNDLKD LSSKHKIMAD LVLNHVSSSH PWVHQFIKSE DPGPSYIVSP SETNVWENVI
RPRNTSLFTN INTKQGFKSV WTTFGPDQID VDWRNPHIFL EFLKLLVRYI TNGADWIRLD
AIAFIWKEPH TTCLHLDPVH SIVKLLNKCL KIIKPSAVLI TETNVPEKEN LSYLIEGNEA
NLAYNFTLPP LLLEAIYTGK TDLLKSWLST WKELPRHTSL LNFTSSHDGI GLRALEGIMD
DQRIHNLLVE SEKRGGLVSH RRLSNGDDQP YELNISWWSA MSNEGSDKTV FQFERFLLSQ
LFTLSIKGVP AFYLPSVLAS PNDIDSFRKT GQRRDLNREK FEANQLLDVL KNFDSPASKN
ISYLTHIVKV RSRLKAFHPE ASMKCISTNI ANCIILQRGL DEDTVYVICN MSSKFLSISP
LNQINSLELT SEKRLLDNIS GSYFNTDTFK LNPYQVVWLT LAD