Gene P9211_12441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_12441 
SymbolwecB 
ID5731240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1119301 
End bp1120482 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content33% 
IMG OID641285612 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_001551129 
Protein GI159903785 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase
[TIGR03568] UDP-N-acetyl-D-glucosamine 2-epimerase, UDP-hydrolysing 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.706059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00693738 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATAAAC GCAAAAACAA AATTTGTTTT ATTACTGGAA CAAGAGCTGA ATACGGACTG 
TTGAAATGTT TAATGAAGGA AGTTGAACAA TCAAGAGATC TTTCGCTTCA GTTAGTTGTT
ACTGGAAGTC ACTTATCCAG AATACATGGG TATACAAAAG ATGAAATTAT TAAAGACGGT
TTTTTGATAG ATAGTGAAAT TGAAATAGAT TTAAAAGAAG ATACAAATTC ATCTACATGT
TTCTCTCTTG CAGAGATTAT AACTAAGGCT TCAGGCACAT TTGAGCGGAT GAAACCTGAT
TTAATCGTAT TATTAGGAGA TCGTTATGAA TTACTTGGAG CAGCTTCTGC AGCAATGGTT
CACAGAATTC CAATTGCACA TATTCATGGA GGAGAAATAA CTGAAGGATC ATTTGATGAT
AATATAAGGC ATTGTTTAAC TAAACTTTCC CATATTCACT TTGTAGCTAC AGAGCAATAT
CGTAAGCGTG TCATTCAATT AGGTGAGAAA CCTTCTAATG TGCATAATGT AGGGGGGTTA
GGCGTAGATG CAATTGACAA AATAAATCTC TTAAGTAGGG CAGATCTAGA GAAAGATATT
GGAATAAATT TTCTCAAAAG AAATCTTATA ATTACATACC ATCCCTTAAC TCTTTCTTCA
TCAGAGCAAA CAGAATCAGA GGTCGTTGAA CTAATAAAAG CATTATCTCG ACTGGAAAAT
ACTCTTCAGA TTTTCACTCT ACCTAATGCT GACCCTGGTA ATTTCAGGAT CACAGAAATA
ATAAATTCAT ATGTTAATGA AAATGATTCG GCTATTGCAT TTAAATCCCT TGGCCAATTA
CGGTATCTTT CCTGTCTGTC TCATGTCGAT GCAGTTATTG GAAACTCATC AAGTGGACTT
ATAGAGGCAC CCTCTTTCAA TATAGGTACA ATAAACATTG GAGAAAGGCA AAAAGGTAGA
TTGACAGCAA AAAGTGTAAT TAATGTGAGA GCCGATGCAG ATTTAATACA TAACTCAATA
TCTACTATAT ATACAAAAGA GTTTCAGGAA TTACTAAATG ATAATTCTAA TCCTTATGGA
GAAGGAGAAG CCGTACAAAA GATATTGTAC ATATTGACTA ATCTCAAAAT AGAAAAATTG
CTAAGAAAAA AATTTTTTGA TTTAGATTTT AATCTACGAT GA
 
Protein sequence
MNKRKNKICF ITGTRAEYGL LKCLMKEVEQ SRDLSLQLVV TGSHLSRIHG YTKDEIIKDG 
FLIDSEIEID LKEDTNSSTC FSLAEIITKA SGTFERMKPD LIVLLGDRYE LLGAASAAMV
HRIPIAHIHG GEITEGSFDD NIRHCLTKLS HIHFVATEQY RKRVIQLGEK PSNVHNVGGL
GVDAIDKINL LSRADLEKDI GINFLKRNLI ITYHPLTLSS SEQTESEVVE LIKALSRLEN
TLQIFTLPNA DPGNFRITEI INSYVNENDS AIAFKSLGQL RYLSCLSHVD AVIGNSSSGL
IEAPSFNIGT INIGERQKGR LTAKSVINVR ADADLIHNSI STIYTKEFQE LLNDNSNPYG
EGEAVQKILY ILTNLKIEKL LRKKFFDLDF NLR