Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_11061 |
Symbol | malQ |
ID | 5731106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1008487 |
End bp | 1010034 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641285473 |
Product | 4-alpha-glucanotransferase |
Protein accession | YP_001550991 |
Protein GI | 159903647 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1640] 4-alpha-glucanotransferase |
TIGRFAM ID | [TIGR00217] 4-alpha-glucanotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0171419 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.287417 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAGAAG TTAATTCAGC TAAACCCCGA TTTAGTGGAG TATTGCTTCA CCCCACGTCT TTACCAGGTA AAAGGACATG TGGTGGGTTT GGACAAGAGG CAAGAGATTG GCTGGAGTTA CTTGCAAAGT CAGGAATCAG TGTTTGGCAA GTATTGCCAC TATCTCCCCC TGACTCAACT GGCTCTCCAT ATAGTTCTCC TTCAAGTTTT GCATTCAATC CTTGGCTGCT TGATGTAAAT GATCTTGCTC AACAGGGATT TATTGCGATG GATGTTTGCA ATGAATTATC CAGCAGTAAG CAAGACATAA ATTCAAGTAT GGACTTTGAC TTGGCTAATT TACATAGCAG AAAGTTAGGT CAGGCATTAA GAAAAGAATG GCCTATCCAA AATAGTTCTT CTCACAAGGA ATTTCTGGAT TGGTGTGCTG ATCAGTTTTG GCTTGAAGAT CATGTGATGT TTATGGAACT TCGTATTCAG AACAATCAAC TTCCTTGGTG GGAATGGCCT GAGGACTTAG CTCTGCATAA CAAAAAAGAA CTAAATAATT TTAAAGTTAA TTTTAAGGAG GCTCTTCTAG AACACTCTTT ATTGCAATGG CATTTAGATC GTCAGTGGTC ATCAATAAGG TCTTTAGCAA ATGATTTGGG GATATTGATT TTTGGAGATT TGCCTTTCTA TGTTTCAAGA GACAGTGCTG ATGTTTGGAG TAATAGGTCT TTATTCTCAA TTCTTGCAAA TGGAGAAATG TATATGCAAA GTGGAGTTCC ACCTGACTAT TTTTCAGAAA CAGGTCAGCT ATGGGGTACG CCTGTCTATC GTTGGCAAAG CAATAAAAGA TCTCACTTTA GATGGTGGCG CAGAAGGCTT TCCCGACAGT GGAATCAATT TGACTTATTA CGACTAGATC ATTTTCGAGC GCTTGATTCA TTCTGGGCTG TACCTGGTAA TGACAAAACT GCTCAAGATG GTTCGTGGAT TCCATCCCCT GGACTTAAGT TACTTAAGCT TCTTAAAAAA GATTATGGTC AAAAACTTCC ATTGATTGCT GAGGACCTTG GTGTTATTAC TCCTCGAGTT GAGAAATTAA GAAATTATTT TGGATTGCCC GGGATGAAAA TTCTTCAATT CGCTTTTGAT GGAAATCAAG AAAATCCCTA TTTGCCTGAA AACATTAGAG ATTATCGATC AATTGTCTAC ACAGGCACTC ATGACAATGA AACTACTACT GGTTGGTGGG CAGAGGTTGG ACCTGAAATT AAATCAAGGC TTAGAAAACA ATCCAATCAA GAAAATGATT CCCCTGCATG GCAATTAATA GAGCTTGGCT TACAGACACA AGCATGTTTA TTCATAGCTC CTATGCAGGA CATACTTGGA CTAGGCAATG AAGCCCGTTT TAATACTCCT GGCACTGTTA AAAGAAGTAA TTGGTCTTGG AGGCTAGAGG CATTTAATGA CTCAGTGTTG GCAGGTGTTG AAAAATATGG ACAATTGTCG AAATCATATG GTAGAAGCTT CGCAGAAGTT TCTACTTTAA TTGGATGA
|
Protein sequence | MIEVNSAKPR FSGVLLHPTS LPGKRTCGGF GQEARDWLEL LAKSGISVWQ VLPLSPPDST GSPYSSPSSF AFNPWLLDVN DLAQQGFIAM DVCNELSSSK QDINSSMDFD LANLHSRKLG QALRKEWPIQ NSSSHKEFLD WCADQFWLED HVMFMELRIQ NNQLPWWEWP EDLALHNKKE LNNFKVNFKE ALLEHSLLQW HLDRQWSSIR SLANDLGILI FGDLPFYVSR DSADVWSNRS LFSILANGEM YMQSGVPPDY FSETGQLWGT PVYRWQSNKR SHFRWWRRRL SRQWNQFDLL RLDHFRALDS FWAVPGNDKT AQDGSWIPSP GLKLLKLLKK DYGQKLPLIA EDLGVITPRV EKLRNYFGLP GMKILQFAFD GNQENPYLPE NIRDYRSIVY TGTHDNETTT GWWAEVGPEI KSRLRKQSNQ ENDSPAWQLI ELGLQTQACL FIAPMQDILG LGNEARFNTP GTVKRSNWSW RLEAFNDSVL AGVEKYGQLS KSYGRSFAEV STLIG
|
| |