Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_06751 |
Symbol | amyA |
ID | 5731626 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 592932 |
End bp | 594668 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641285037 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001550560 |
Protein GI | 159903216 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.335085 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACAGC AACTAGTGAG ATTGAGTGAA TTACTCAATG AAGTTTACAG AGAACACTCT GCAGAAGAAA TTGATTATAT GTGGTCACAA TTGCTGCAGA TTTTGAATCA GCATAGTGAT AAACGAGATA ATTATGCTGA ACTCTCCGAA CTTTGGAACT CTTCTAGCGC CGTTTTGATT ACTTATGCTG ATGGTGTATA CAAGTCAGGA GAGCCAACTT TAAAGACCCT TAAAGATTTA ATCGATTTGC ATTTAAATGA CTTTGCATCG GTTATACATG TCTTGCCTTT TTTGTGTTCC ACAAGTGATG GTGGGTTTGC TGTATCAGAT TTTGAGAAAT TAGAAACACG TTTTGGCGAA TGGGATCATT TAAAAGCTCT CTCGAAGAAT CACATATTGA TGGCAGACTT GGTTCTAAAT CATGTTTCAT CTTCTCATCC ATGGGTCCAA CAATTTATTC AATCTAAGGA GCCTGGGAGT AAATATATTC TTTCCCCTTC ATCATCTGAA AACTGGGAAG ATGTTACTAG GCCAAGGAAT ACTTCTCTTT TTACTAACCT TTCTACTACT AAAGGTAAGA AAGATGTTTG GACAACATTT GGTCCAGATC AAATTGATAT TAATTGGAAA GAGCCATATG TTTTGATAGA ATTTTTAAGA TTAATTATTA GATATATAGA TTCTGGAATA AAATGGATTC GACTTGATGC CGTCGGCTTT ATATGGAAAG AGCCAGGTAC AACATGTTTG CACAGAAATG AAGTTCATAA GATAGTTAAG GCATTAAGAA TTCAGATTAA TGAACTTATA AACTCTAGTG TTTTAATTAC TGAAACTAAT GTTCCAGAAA AAGAGAATGT TTCATACCTT AGTTCAGGCG ATGAAGCGCA TCTTGCATAT AACTTTCCTC TCCCTCCCCT TTTACTGGAA AGTTTAATTA CCAATAAAGC AGATTTACTT AACAATTGGT TATCCTCATG GCCTGAGTTG CCTAAAAACA CAGGGTTTTT AAACTTTACG GCTTCCCATG ACGGTGTTGG GCTAAGAGCC TTGGAGGGTT TAATGGATCA GAAAAGAATT CGTGAATTAT TAATAGCTTG TGAGAAAAGA GGAGGTTTAA TCAGCCATAG AAGAATGTCT AATGGTGAGG ATCAACCTTA TGAATTAAAT ATTAGTTGGT GGAGTGCAAT GGCAGATAAA GGAAGAGATA CTTCCTTATT TCAGTTTGAG CGCTTTTTAT TGAGTCAACT TTTTGTAATG GCTTTAAAAG GTGTTCCAGC TTTTTATTTG CAGGCGTTAA TGGCATCGGA AAATGATTTA ACAACCTTTG CCAAATCTGG CCAAAGGAGA GATTTGAATC GTGAAAAGTT TGAAGCAAAT ACTTTACGCA TCAAACTGGA GGACGAAAAG TCACATCCAA GTAGAAATTT AACTTCTCTT AAGAAAGCAA TGCAGGTAAG AAGAAAATTA AATGCTTTTC ATCCCAACCA ACCAATGAAA TGCCTTAGTA AGAGTCGCAG TGATCTTGTG ATAATTTCTC GTGGTGAAGG TAATGAAACT ATTTGGGCAT TACATAATAT GACCAATTCA AAACTATGCT TTTCTCTTTC AGAAGGTTTA AATGTTAATG GAGAATCTAC TGTCTCTTGG GATGATTGCT TAAATGATTA TAAGCGACAT CAAAATAGAA TAGACTTGCA TCCCTACTCT GTTCATTGGT TAATGAAATC AAACTAA
|
Protein sequence | MEQQLVRLSE LLNEVYREHS AEEIDYMWSQ LLQILNQHSD KRDNYAELSE LWNSSSAVLI TYADGVYKSG EPTLKTLKDL IDLHLNDFAS VIHVLPFLCS TSDGGFAVSD FEKLETRFGE WDHLKALSKN HILMADLVLN HVSSSHPWVQ QFIQSKEPGS KYILSPSSSE NWEDVTRPRN TSLFTNLSTT KGKKDVWTTF GPDQIDINWK EPYVLIEFLR LIIRYIDSGI KWIRLDAVGF IWKEPGTTCL HRNEVHKIVK ALRIQINELI NSSVLITETN VPEKENVSYL SSGDEAHLAY NFPLPPLLLE SLITNKADLL NNWLSSWPEL PKNTGFLNFT ASHDGVGLRA LEGLMDQKRI RELLIACEKR GGLISHRRMS NGEDQPYELN ISWWSAMADK GRDTSLFQFE RFLLSQLFVM ALKGVPAFYL QALMASENDL TTFAKSGQRR DLNREKFEAN TLRIKLEDEK SHPSRNLTSL KKAMQVRRKL NAFHPNQPMK CLSKSRSDLV IISRGEGNET IWALHNMTNS KLCFSLSEGL NVNGESTVSW DDCLNDYKRH QNRIDLHPYS VHWLMKSN
|
| |