Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_09791 |
Symbol | malQ |
ID | 4778344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 890079 |
End bp | 891644 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640086487 |
Product | 4-alpha-glucanotransferase |
Protein accession | YP_001016993 |
Protein GI | 124022686 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1640] 4-alpha-glucanotransferase |
TIGRFAM ID | [TIGR00217] 4-alpha-glucanotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTGAGG GCAGCTCAGC ATTAGTCCGC ACGACCGGGG TGTTACTTCA TCCCACAGCA CTGCCTGCAA GTCCAGTTTG CGGCAGTTTC GGTGCTCCTT CCAGGGCTTG GTTGCAATCA CTTGCTCGCC ATGACATCGG CGTGTGGCAG TTGTTGCCTT TGGCTCCTCC TGATGCGACA GGGTCTCCCT ACAGCTCCCC TTCAAGCTTT GCTCTTAACC CTTGGTTGCT TGATGCCGAT GATCTTGTTG ATGAGGATTT CCTGTCCGCA TCCGTTCTTC GTGAGTTGCC AGGAACAGTC CCTATCCAGG AGCCTTGCGC ATCGGTTGAT TTTGCTCTGG CTGACCTTCG CAGTCAGCGA CTTGGCATAG CCTTAAGGGA GGCTTGGTCA GAGCAGGCTA GAGACCATCA TCTTGCCTTT GAGAGTTGGT GTGGCAAGCA GTTCTGGCTT GAGGACCATG TTTTATTTAT GGAGCTCCGT CGTCAGCATC AAGGCCTCCC TTGGTGGGAA TGGCCGGAAG GCTTGGCAGC TCATCAACGT AGGGCTCTCA ATGCCTGGAA AGGTCATCAT CAGGAGGCAC TGCTTGAACA TCGTTTGCTG CAATGGCAGC TTGATCGGCA GTGGCAAGCA CTGAGACATC TAGCAGGCGA GCTTGGCGTT TTGTTGTTTG GTGATCTGCC TTTCTATGTT GCGCGCGATA GCGCTGACGT CTGGAGTCAT CAGGGCCTGT TTTCGATCCT TCAAGGAGGG GAGCTTGAGA TCCAAAGTGG TGTTCCCCCT GATTATTTCT CCAGTACAGG ACAACTCTGG GGCACGCCTG TTTATCGCTG GTGGCGGCAT CGCTGGAGTG GATTCCATTG GTGGCGTAGC CGCTTTGTGC GTCAATGGCA ACAGGTGGAT CTCTTACGTC TCGACCATTT CAGGGCGCTT GCGTCTTATT GGGCAGTGCC TGGATCCGAT ACGACAGCGG AGCACGGGGA GTGGCGGCCT TCTCCAGGTG CAGCTCTATT GAAGCGGCTA AGGCGTGATG CTGGTGGCAT CTTGCCACTG GTTGCAGAAG ATCTGGGAGT GATTACCCCT GATGTGGAAG AGCTACGAGA CCAATTTGGG TTGCCTGGAA TGAAAATTCT CCAGTTTGCT TTTGATGGCA ATCCCAGTAA TCCCTATCTA CCTGAAAATA TTAAGGGTCA TCACTGGGTG GTTTATACAG GCACTCACGA CAATCCAACG ACTCTGGGTT GGTGGCAGCA GCTTGATCTC GACATCCGTG AGCGGGTCGT AGATCGATTG AAGGGTGTAG TTCATGCCCC TGGTTGGCAG TTGCTGGAGC TGGGCCTAGC GACAGAGGCG TGTTTAGTGA TAACCCCTGT GCAGGATTTG CTTCATTTAG ATGATGTAGC TCGTTTCAAC ACCCCTGGCA CTGTCGAGGG AAATTGGTGT TGGCGGTTGT CTTGCTTTGA TTCAGCTCTT GATGGAGCCC TGAGCGGCTA TGGCGAGAGA GGAGCTGTCT GGGGACGTTC GCTTGAATCG GCAGCCGCTT TGCTGACGGC ATCTTCGAGT CGGTAA
|
Protein sequence | MVEGSSALVR TTGVLLHPTA LPASPVCGSF GAPSRAWLQS LARHDIGVWQ LLPLAPPDAT GSPYSSPSSF ALNPWLLDAD DLVDEDFLSA SVLRELPGTV PIQEPCASVD FALADLRSQR LGIALREAWS EQARDHHLAF ESWCGKQFWL EDHVLFMELR RQHQGLPWWE WPEGLAAHQR RALNAWKGHH QEALLEHRLL QWQLDRQWQA LRHLAGELGV LLFGDLPFYV ARDSADVWSH QGLFSILQGG ELEIQSGVPP DYFSSTGQLW GTPVYRWWRH RWSGFHWWRS RFVRQWQQVD LLRLDHFRAL ASYWAVPGSD TTAEHGEWRP SPGAALLKRL RRDAGGILPL VAEDLGVITP DVEELRDQFG LPGMKILQFA FDGNPSNPYL PENIKGHHWV VYTGTHDNPT TLGWWQQLDL DIRERVVDRL KGVVHAPGWQ LLELGLATEA CLVITPVQDL LHLDDVARFN TPGTVEGNWC WRLSCFDSAL DGALSGYGER GAVWGRSLES AAALLTASSS R
|
| |