Gene P9211_12401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_12401 
Symbol 
ID5731236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1115687 
End bp1116688 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content38% 
IMG OID641285608 
Productnucleoside-diphosphate-sugar epimerase 
Protein accessionYP_001551125 
Protein GI159903781 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.509931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00693738 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTAGGA AAGTACTGGT TACTGGTGCG GACGGGTTTA TAGGCTCTCA TCTGGTAGAG 
AGCTTGCTAG ACAATGGCTA TGAGGTTAAA CCTTTTTGTT TCTATAATTC AAGTGGGAGC
TGGGGATGGC TAGAAGAGCT ATGTGATGAA AAAAGCAAGG AGCTTGATGT CTTTTTAGGT
GATATAAGAG ACCCTGTTTG TGTCAAGGAA GCCATGAAAG GATGCGACAT GGTATTCCAC
CTCGCAGCAC TAATAGGAAT TCCTTATAGC TACATAGCCG CTAGAAGTTA TATAGAAACA
AACATTATTG GCACACTAAA TGTATTAGAG GCAGCAAAAG ATTTAGGGGT TTCGAAAATA
ATTCACACGT CTACGTCAGA AACATATGGT ACGGCACAAT CTGTTCCTAT AAATGAAAAG
CACCCACTCT CTGGCCAATC TCCATATTCT GCAAGTAAAA TCGGGGCTGA CCAAATTGCT
CTTAGCTTTT GGCATAGCTT CAACATTCCC GTAACTGTTA TACGTCCATT TAATACTTTT
GGCCCTCGCC AGAGTAATAG AGCTGTAATA CCTACGATTA TTAGTCAAAT TGCATCAGGT
GCAAAAAAAA TTGAACTGGG CTCGCTTTCG CCAACAAGAG ACTTTACTTA TGTGTTAGAT
ACATGCTCAG CCTATATAGC AATCGCCAAT AGCAATAAAG TCACTGGGAA GGTAATTAAT
GCTGCTAGTA ATTTTGAAAT ATCAATTGGT GATACAGCAA GCTTAATTGC ATCTTTAATG
CAATCTAAAG TAGATCTTTG TACTGATTCA AAGAGAATTA GGCCAATTAA TTCAGAGGTC
AACAGGTTAT ATGGAGACAA TAGTCTTATA AAAGACTTGA CAGATTGGCA GCCTAAATTC
TCTGGTAAAA ATGGATTTAA TAATGGCCTT AAAAAGACTA TAGAGTGGTT TCAAAAACCA
TATAACCTAA GTAAATATAA GCACAATATT TACTCAATAT AA
 
Protein sequence
MTRKVLVTGA DGFIGSHLVE SLLDNGYEVK PFCFYNSSGS WGWLEELCDE KSKELDVFLG 
DIRDPVCVKE AMKGCDMVFH LAALIGIPYS YIAARSYIET NIIGTLNVLE AAKDLGVSKI
IHTSTSETYG TAQSVPINEK HPLSGQSPYS ASKIGADQIA LSFWHSFNIP VTVIRPFNTF
GPRQSNRAVI PTIISQIASG AKKIELGSLS PTRDFTYVLD TCSAYIAIAN SNKVTGKVIN
AASNFEISIG DTASLIASLM QSKVDLCTDS KRIRPINSEV NRLYGDNSLI KDLTDWQPKF
SGKNGFNNGL KKTIEWFQKP YNLSKYKHNI YSI