Gene P9211_11061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_11061 
SymbolmalQ 
ID5731106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1008487 
End bp1010034 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content39% 
IMG OID641285473 
Product4-alpha-glucanotransferase 
Protein accessionYP_001550991 
Protein GI159903647 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0171419 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.287417 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGAAG TTAATTCAGC TAAACCCCGA TTTAGTGGAG TATTGCTTCA CCCCACGTCT 
TTACCAGGTA AAAGGACATG TGGTGGGTTT GGACAAGAGG CAAGAGATTG GCTGGAGTTA
CTTGCAAAGT CAGGAATCAG TGTTTGGCAA GTATTGCCAC TATCTCCCCC TGACTCAACT
GGCTCTCCAT ATAGTTCTCC TTCAAGTTTT GCATTCAATC CTTGGCTGCT TGATGTAAAT
GATCTTGCTC AACAGGGATT TATTGCGATG GATGTTTGCA ATGAATTATC CAGCAGTAAG
CAAGACATAA ATTCAAGTAT GGACTTTGAC TTGGCTAATT TACATAGCAG AAAGTTAGGT
CAGGCATTAA GAAAAGAATG GCCTATCCAA AATAGTTCTT CTCACAAGGA ATTTCTGGAT
TGGTGTGCTG ATCAGTTTTG GCTTGAAGAT CATGTGATGT TTATGGAACT TCGTATTCAG
AACAATCAAC TTCCTTGGTG GGAATGGCCT GAGGACTTAG CTCTGCATAA CAAAAAAGAA
CTAAATAATT TTAAAGTTAA TTTTAAGGAG GCTCTTCTAG AACACTCTTT ATTGCAATGG
CATTTAGATC GTCAGTGGTC ATCAATAAGG TCTTTAGCAA ATGATTTGGG GATATTGATT
TTTGGAGATT TGCCTTTCTA TGTTTCAAGA GACAGTGCTG ATGTTTGGAG TAATAGGTCT
TTATTCTCAA TTCTTGCAAA TGGAGAAATG TATATGCAAA GTGGAGTTCC ACCTGACTAT
TTTTCAGAAA CAGGTCAGCT ATGGGGTACG CCTGTCTATC GTTGGCAAAG CAATAAAAGA
TCTCACTTTA GATGGTGGCG CAGAAGGCTT TCCCGACAGT GGAATCAATT TGACTTATTA
CGACTAGATC ATTTTCGAGC GCTTGATTCA TTCTGGGCTG TACCTGGTAA TGACAAAACT
GCTCAAGATG GTTCGTGGAT TCCATCCCCT GGACTTAAGT TACTTAAGCT TCTTAAAAAA
GATTATGGTC AAAAACTTCC ATTGATTGCT GAGGACCTTG GTGTTATTAC TCCTCGAGTT
GAGAAATTAA GAAATTATTT TGGATTGCCC GGGATGAAAA TTCTTCAATT CGCTTTTGAT
GGAAATCAAG AAAATCCCTA TTTGCCTGAA AACATTAGAG ATTATCGATC AATTGTCTAC
ACAGGCACTC ATGACAATGA AACTACTACT GGTTGGTGGG CAGAGGTTGG ACCTGAAATT
AAATCAAGGC TTAGAAAACA ATCCAATCAA GAAAATGATT CCCCTGCATG GCAATTAATA
GAGCTTGGCT TACAGACACA AGCATGTTTA TTCATAGCTC CTATGCAGGA CATACTTGGA
CTAGGCAATG AAGCCCGTTT TAATACTCCT GGCACTGTTA AAAGAAGTAA TTGGTCTTGG
AGGCTAGAGG CATTTAATGA CTCAGTGTTG GCAGGTGTTG AAAAATATGG ACAATTGTCG
AAATCATATG GTAGAAGCTT CGCAGAAGTT TCTACTTTAA TTGGATGA
 
Protein sequence
MIEVNSAKPR FSGVLLHPTS LPGKRTCGGF GQEARDWLEL LAKSGISVWQ VLPLSPPDST 
GSPYSSPSSF AFNPWLLDVN DLAQQGFIAM DVCNELSSSK QDINSSMDFD LANLHSRKLG
QALRKEWPIQ NSSSHKEFLD WCADQFWLED HVMFMELRIQ NNQLPWWEWP EDLALHNKKE
LNNFKVNFKE ALLEHSLLQW HLDRQWSSIR SLANDLGILI FGDLPFYVSR DSADVWSNRS
LFSILANGEM YMQSGVPPDY FSETGQLWGT PVYRWQSNKR SHFRWWRRRL SRQWNQFDLL
RLDHFRALDS FWAVPGNDKT AQDGSWIPSP GLKLLKLLKK DYGQKLPLIA EDLGVITPRV
EKLRNYFGLP GMKILQFAFD GNQENPYLPE NIRDYRSIVY TGTHDNETTT GWWAEVGPEI
KSRLRKQSNQ ENDSPAWQLI ELGLQTQACL FIAPMQDILG LGNEARFNTP GTVKRSNWSW
RLEAFNDSVL AGVEKYGQLS KSYGRSFAEV STLIG