Gene A9601_14591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_14591 
Symbolglf 
ID4718180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1243082 
End bp1244266 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content33% 
IMG OID640079180 
ProductUDP-galactopyranose mutase 
Protein accessionYP_001009849 
Protein GI123968991 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0562] UDP-galactopyranose mutase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA AAGCATTAAT TATAGGTGGC GGGTTTGCAG GCTGCTCAGC TGCTCATCAA 
CTAGAACTAA TAGGGAACTG GGATGTCACT TTAATAGAAA AAGCAAATTA TCTTGGGGCA
GGTAATAAGA CCAGATGGTA TGGAGGTCAT CCTTATACAT TTGGCCCACG TCATTTTTTA
ACGCCTTATC AAGAAGTTTT TGATTATATA GATAAAATTA TTCCAATTAG AAAATGTCCT
GAACATGAAT TCTTAACATA TGTTGAGAGA GATAACGCTT TCTATGCATA TCCAATAAAT
ATGCAAGATG TAAGAGAAAT GCCCGATTAC GAAATAATCG ACTCAGAACT TAAATCAATC
AAAGAATCGC AATATAAAGG GGTAAAATTT GCAAGAAATC TTGAAGAATA TTGGATCGCA
AGTGTAGGTC AAACTTTATA TTCAAAAATG ATTGATAAAT ATAATAAGAA AATGTGGTTA
GTTGAAGATA ACAAATCAAT TGATACCTTC AATTGGTCTC CCAAAGGAGT TGCTTTAAAA
GATGGCCCAA GAGCAGCATG GGACTCAGCA ATTTCTGGAT ATCCCTATGC TAAAGATGGA
TATGATAAAT ACTTCCCTTT TGCTACAAAA AATACAAAAG TTCTTTTAAA TACAACTTGT
CAAATCGTAG ATATAAATAA GAAGAAAGTA CTTATAGAAG GTGAAGAATA TTATTTTGAC
CTTATCATTT CTAGTATTGC ACCAGATATA TTTTTAAATG AAATTTTTGG ACCTTTAAAA
TACATTGGCA GAGAGCTAAA ACTTATGGTT TTCCCTTCAG AATATATTTT TCCCGAGAAT
GTATATTTCC TTTACTACGC AAATGCTGAG CCTTTCACTA GACTTGTTGA ATACAAAAAA
TTTACTCACC ATAAATCAAA CACAACCTTG GTTGGGATGG AAATACCAGC ATTAAATGGA
GGATATGAAT ATCCTGTACC ATTTAAAGAA GAACAAAAAA AAGCAATGAA ATATTATGAA
GCAATGCCTG AAGGGGTTTA TTCAATCGGC AGAGCTGGAT CATATCTATA TGGAATTGAT
ATTGATGATT GCATTCGACA AACAATGATT ATTTCTGAAG AATTAAAAGA GGGAGGGCAA
GATAATACCG TGCCTGGGAA AGAATATCAA TTCCCAGAAT TATAA
 
Protein sequence
MNKKALIIGG GFAGCSAAHQ LELIGNWDVT LIEKANYLGA GNKTRWYGGH PYTFGPRHFL 
TPYQEVFDYI DKIIPIRKCP EHEFLTYVER DNAFYAYPIN MQDVREMPDY EIIDSELKSI
KESQYKGVKF ARNLEEYWIA SVGQTLYSKM IDKYNKKMWL VEDNKSIDTF NWSPKGVALK
DGPRAAWDSA ISGYPYAKDG YDKYFPFATK NTKVLLNTTC QIVDINKKKV LIEGEEYYFD
LIISSIAPDI FLNEIFGPLK YIGRELKLMV FPSEYIFPEN VYFLYYANAE PFTRLVEYKK
FTHHKSNTTL VGMEIPALNG GYEYPVPFKE EQKKAMKYYE AMPEGVYSIG RAGSYLYGID
IDDCIRQTMI ISEELKEGGQ DNTVPGKEYQ FPEL