Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_09141 |
Symbol | amyA |
ID | 4780555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 845213 |
End bp | 846964 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640084190 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001014737 |
Protein GI | 124025621 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000264002 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCTGAGT TGGATAGTGA ATTAGACGAG CTGAGATTGT CTCTCAGAGA AATTTATCCT GAGCACTCTG AACAAGAAAT CAATTCAGTG TGGTCGCAAT TGTTGCAGAT TCTTGATCCA TTTTGTGTTA GCAAGGGCAC TGATGAATTT GAGATCGAAT CAATTTGGGA TTCTTCAAGT GTTGTTTTGA TTACTTACCC TGATTCAATT TATAGGAAAG ATGAATCAAC TTTAAAAACA TTAACTGAAT TCGTAAAAAA TAGATTAGGT GGCCTTTCAT CAGTCATACA TGTTTTACCA TTTCTTCCTT CTACAAGTGA TGGAGGATTC GCTGTATCTA ATCATGAAAA AATCGATGAT ACCTTTGGGA ATTGGAATGA TTTAAAAGAT TTATCTAGTA AGCATAAAAT AATGGCAGAT CTAGTTTTAA ATCATGTCTC TTCTTCTCAT CCATGGGTTC ATCAATTTAT AAAATCAGAG GATCCAGGTC CGTCTTACAT TGTTTCTCCT TCTGAGACTA ATGTTTGGGA AAATGTGATA AGGCCCAGAA ATACATCACT CTTTACCAAT ATCAATACTA AACAAGGCTT TAAGAGTGTT TGGACAACCT TTGGACCAGA TCAGATTGAT GTTGATTGGA GGAATCCACA TATCTTTTTA GAGTTTTTGA AATTATTGGT TAGATACATA ACTAATGGAG CTGACTGGAT TCGACTTGAT GCAATTGCAT TTATTTGGAA GGAGCCACAT ACTACTTGTT TACATTTAGA CCCAGTACAC TCAATAGTTA AGCTGTTAAA TAAATGTTTG AAGATTATAA AACCTTCGGC AGTATTAATT ACCGAGACTA ATGTGCCAGA GAAAGAAAAC CTTTCTTATT TGATTGAAGG AAATGAAGCT AATCTTGCAT ATAACTTTAC TCTACCTCCT TTATTATTAG AAGCTATTTA TACCGGTAAA ACAGATTTAT TGAAGAGTTG GTTGTCTACG TGGAAAGAGT TGCCAAGGCA CACTTCTTTG CTCAACTTCA CTTCTTCACA CGATGGTATT GGATTACGAG CACTAGAAGG CATTATGGAC GATCAGAGAA TACATAATCT CTTGGTTGAA TCAGAGAAAC GAGGAGGATT AGTTAGTCAT CGTCGCTTGT CAAATGGAGA TGATCAACCT TATGAATTAA ACATTAGTTG GTGGAGTGCG ATGTCAAACG AAGGCTCCGA TAAAACGGTA TTTCAATTTG AGCGTTTTTT ATTAAGTCAG CTTTTTACTT TATCCATAAA AGGTGTTCCA GCTTTTTATC TTCCATCTGT ATTAGCTTCT CCGAATGATA TAGATTCTTT TAGGAAAACA GGGCAAAGAA GAGATTTAAA TCGAGAAAAA TTTGAAGCTA ACCAATTACT TGATGTACTT AAGAACTTTG ATTCTCCCGC TAGTAAAAAT ATTTCATACC TCACTCATAT AGTTAAAGTC AGGTCAAGAC TTAAAGCTTT TCATCCGGAG GCGAGTATGA AATGTATCTC TACGAATATA GCTAATTGTA TAATTCTTCA AAGAGGTTTG GATGAAGATA CTGTCTATGT TATTTGTAAT ATGTCTAGTA AATTCTTATC TATTTCTCCA TTAAATCAAA TTAATTCATT AGAATTAACC TCTGAAAAAC GTTTACTAGA TAATATTTCA GGTTCTTATT TTAATACTGA TACTTTTAAA CTTAATCCTT ATCAAGTCGT TTGGCTTACA TTAGCTGATT AG
|
Protein sequence | MSELDSELDE LRLSLREIYP EHSEQEINSV WSQLLQILDP FCVSKGTDEF EIESIWDSSS VVLITYPDSI YRKDESTLKT LTEFVKNRLG GLSSVIHVLP FLPSTSDGGF AVSNHEKIDD TFGNWNDLKD LSSKHKIMAD LVLNHVSSSH PWVHQFIKSE DPGPSYIVSP SETNVWENVI RPRNTSLFTN INTKQGFKSV WTTFGPDQID VDWRNPHIFL EFLKLLVRYI TNGADWIRLD AIAFIWKEPH TTCLHLDPVH SIVKLLNKCL KIIKPSAVLI TETNVPEKEN LSYLIEGNEA NLAYNFTLPP LLLEAIYTGK TDLLKSWLST WKELPRHTSL LNFTSSHDGI GLRALEGIMD DQRIHNLLVE SEKRGGLVSH RRLSNGDDQP YELNISWWSA MSNEGSDKTV FQFERFLLSQ LFTLSIKGVP AFYLPSVLAS PNDIDSFRKT GQRRDLNREK FEANQLLDVL KNFDSPASKN ISYLTHIVKV RSRLKAFHPE ASMKCISTNI ANCIILQRGL DEDTVYVICN MSSKFLSISP LNQINSLELT SEKRLLDNIS GSYFNTDTFK LNPYQVVWLT LAD
|
| |