Gene P9303_00341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_00341 
Symbol 
ID4775928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp36453 
End bp37373 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content54% 
IMG OID640085533 
Productnucleoside-diphosphate-sugar epimerase 
Protein accessionYP_001016056 
Protein GI124021749 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACAG AACTTGTCAA GCAGAGTCCG CCTCTCCCCA CCGGCTCAAA GTTGCTGGTT 
CTCGGTGGAG GATTCAGTGG CCAGCATGTA GTGGCTCTGG CGAGAGCACT TGGCAGTACT
GCTATCTGCA GCCGCAGAGA CATCAACAGC CCAGGCGCAG ACATGGCATT TGATAGTGCC
ACCAAGCTGT TGCCAACCAC GAAAGTTCTA GAAGGAGTCA CTCATTTACT CAGCTGCATA
CCGCCAGCAG CCGATGGGAA AGATCCAGTA CTGACATGCT TGGGCGACCA ACTCAAAGCG
TTGCCATTGC AATGGGTTGG CTACCTCTCC ACCACAGGGG TTTACGGAGA TCGCCAAGGA
CGCTGGGTCA CAGAAATCGA TCATCCTCAG CCTCAGCAAG CACGGAGCAA ACGAAGATTG
GCCTGTGAAG AGGCCTGGCA AGCTTCGGGA TTACCCCTGC AGATTCTGCG ACTGCCTGGC
ATCTACGGAC CAGGCCGCTC AGTGCTTAAA AGCGTCAACA CAGGTCAAAG CAGAATGATC
CACAAGCCCA ACCAGGTGTT TTCAAGAATT CATGTCGATG ACATCGCAGG AGCCATCCTG
CATCTAATCC AATGCGCTGC TGATGGACAG CGACCCATCG TGATCAACGT CACCGATGAC
ATGCCAACAG CTTATACAGA CGTACTCGGG TTCGCCGCCC AACTACTTGG AAAGTCCCTA
CCCGAAATTG AGCCGTTTGC AGTTGCCGCT GCACAGATGA ATCCCATGGC TCTCTCCTTC
TGGCAAGAGA ATCGCAGGGT CAGCAATCAG CTCCTATGCC GCGAGCTTGG CTATTCCCTG
ATGCATCCCA ACTATCACTC CGGCCTTAGA GACTGTTATC TGGCAGAAGG TTTCAAGGTC
TCACAGACGA ATTTTCCTTA G
 
Protein sequence
MLTELVKQSP PLPTGSKLLV LGGGFSGQHV VALARALGST AICSRRDINS PGADMAFDSA 
TKLLPTTKVL EGVTHLLSCI PPAADGKDPV LTCLGDQLKA LPLQWVGYLS TTGVYGDRQG
RWVTEIDHPQ PQQARSKRRL ACEEAWQASG LPLQILRLPG IYGPGRSVLK SVNTGQSRMI
HKPNQVFSRI HVDDIAGAIL HLIQCAADGQ RPIVINVTDD MPTAYTDVLG FAAQLLGKSL
PEIEPFAVAA AQMNPMALSF WQENRRVSNQ LLCRELGYSL MHPNYHSGLR DCYLAEGFKV
SQTNFP