Gene A9601_11871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_11871 
SymbolmalQ 
ID4717901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1001419 
End bp1002939 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content33% 
IMG OID640078903 
Product4-alpha-glucanotransferase 
Protein accessionYP_001009578 
Protein GI123968720 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTATAG AGTCAATTAT TGCAAAAAAA TCATTAGGCG TACTTATGCA TCCAACATGT 
ATTCCGGGAG GAAATGTCTG TGGAACTTTT GGGAGAGGAG CTAAAGAGTG GATAAGAAAA
CTTCATAAGC ATGGAATTGA ATACTGGCAA TTTTTACCCC TTACACCTAC TGACTCTACA
GGTTCGCCAT ATAGTTCCCC ATCTAGCTTT GCACTAAACC CATGGTTTTT GGATATAGAT
GATTTAATCG AAAAAGGTTT TATTTACATC TCAAATAAAG AAAATCTGGG TCCAACAAAT
CAAAATAAAG ATCATTTTGA TTTTGATATT GCGGATGACC TAACAAAAAA ATTAGGTCTC
CTCCTTTTGC AAGGTTGGAG TTCACAATCT GAAAAAAGAA AAATTAATTT TAACAAATGG
GTCAGTAGGC ATTCTTGGGT TGAAGATTAT GCAACATTTG TTGTTATTAG AGAGGAATTT
AATATGTTGC CTTGGTGGGA ATGGCCTCAA GATTTTAAAA TAAAAAATAA CAAGTTCTTA
AAATCGTGGA TTAAGAAAAA AAGTGAAGAG ATACTTATTA AAAAATTAAT ACAGTGGCAT
CTTGATGAAC AATGGAGCGT CATTAAAAAC TTTGCAAAAT CAAAAAATAT TAAGCTCATA
GGAGATTTGC CTTTTTATGT CTCTAGAGAC AGCGCCGACG TATGGAGTAA TAAATCATTA
TTTTCAATTT TTAAAAATGG GGATTTAATC TTTCAAAGTG GTGTTCCGCC TGATTATTTT
TCATCAACAG GACAATTATG GGGTACCCCA ACTTACTTTT GGTCAAAGCA TAAAAGGACT
AATTTCAATT GGTGGAGAAA AAGATTTCAA AGACAATTTG AACTTGTGGA CATATTAAGA
TTTGATCATT TCAGGGGTTT AGCAGGTTAC TGGAGAGTTA ATGGCAATTC TAAAACGGCA
ATTATTGGAA AATGGATAAA TTCTCCAGGT AAAACACTAT TAAATAAAGT AAAAAAGGAT
CTAGGGGGTA ACTATCTACC TATTATTGCG GAGGATTTGG GAGTAATAAC TCCAGATGTA
GAGAAATTAA GGAAACGCTT CGAACTACCT GGCATGAAAA TATTACAATT TGCTTTTGAT
GGCAATAAAG ATAATCCTTA TTTACCAAAG AATATTGAAG GAGAAAATTG GGTTGTTTAT
ACAGGTACTC ACGACAACTC TACTTCTGTT TCATGGTGGG AAAATTTAGA TTATGAATCC
CAAAAAAGAA TAAAAGATGA GTATAAATTT TCAGAAAATC CTTCTTGGGA TTTAATAGAA
ATTGGCATGG AGACAAATGC TAATCTTTTT ATCGCTCCAT TACAAGATCT ATTATCTCTA
AACGATTCAA GTAGATTAAA CAAACCTGGG ACCACAAAAA ATAACTGGAA ATGGAAGTTA
AATTGTCCTT TAGAAGAAAT AGAAAATAAT ATAAAAATGT TTAGTGAGCT AGGAAATAAT
TTTGGGAGAA CTCTAATGTA G
 
Protein sequence
MPIESIIAKK SLGVLMHPTC IPGGNVCGTF GRGAKEWIRK LHKHGIEYWQ FLPLTPTDST 
GSPYSSPSSF ALNPWFLDID DLIEKGFIYI SNKENLGPTN QNKDHFDFDI ADDLTKKLGL
LLLQGWSSQS EKRKINFNKW VSRHSWVEDY ATFVVIREEF NMLPWWEWPQ DFKIKNNKFL
KSWIKKKSEE ILIKKLIQWH LDEQWSVIKN FAKSKNIKLI GDLPFYVSRD SADVWSNKSL
FSIFKNGDLI FQSGVPPDYF SSTGQLWGTP TYFWSKHKRT NFNWWRKRFQ RQFELVDILR
FDHFRGLAGY WRVNGNSKTA IIGKWINSPG KTLLNKVKKD LGGNYLPIIA EDLGVITPDV
EKLRKRFELP GMKILQFAFD GNKDNPYLPK NIEGENWVVY TGTHDNSTSV SWWENLDYES
QKRIKDEYKF SENPSWDLIE IGMETNANLF IAPLQDLLSL NDSSRLNKPG TTKNNWKWKL
NCPLEEIENN IKMFSELGNN FGRTLM