Gene P9303_17471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_17471 
Symbol 
ID4778116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1527832 
End bp1528857 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content53% 
IMG OID640087254 
Productputative mRNA binding protein 
Protein accessionYP_001017754 
Protein GI124023447 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGGCA AAGAACGCTG CCTTAGACCT CCATCCTGCA GGGCGAAGGG CCTTCAGTTG 
CTGATCGAAC GCGCTTCAAT ATGCGCGTTT GATGATTCAG CTGTTTTGAA GATTCTGATC
ATGGGGGGGA CCCGATTTGT CGGCAAGCCT CTGGTTACTC GACTTCAGGC CCAAGGCCAT
GCGCTCACGT TGTTCACTCG TGGCCGCCAT TCTTTGCCAG ATGGTGTGGA ACATCTCAGT
GGTGATCGAA CCACCACTGA GGGGCTGAGT CGTCTTCAAG GCCGAAGCTT CGATGTCATC
GTCGACAGCT CAGGGCGCAA GCTTGAAGAC AGTCAAAGGG TGGTGGCCTG TACAGGAGAG
CCAAAGCATC GTTTCCTCTA TGTCAGTTCC GCTGGCGTCT ATGCGGATTC CGAACACTGG
CCACTGAATG AGGAGAGTGC CACCGACCCG AACAGTCGTC ATGCCGGCAA GGCTCAGACC
GAATCATGGC TGCTTCAGCA AGGAATTCCC TTTACCAGTT TCCGACCTAC TTATATCTAT
GGTCCTGGTA ATTACAACCC GATTGAACGT TGGTTTTTCG ATCGTATCGT CCATAACCGA
CCGGTTCCGT TGCCACGAGA TGGCACCACC ATCACCCAAT TGGGGCATGT TGTTGATCTG
GCTGATGCCA TGGTTCGTTC CCTTGAGGTG GAGACAGCGA CGAATCGCAT TTACAACTGT
TCCAGCAAGC GTGGTATCAC CTTCAGGGGC TTGATTGCAG CGGCAGCAAG GGCTTGTGGC
AAAGATCCAA ATACCGTTGA GCTTCGTTCT TTTGATCCTT CAGGCCTGAA TCCCAAAGCT
CGTAAGGCCT TCCCGCTGAG GCTGAGTCAT TTCCTTACCG ATATCACCAG GGTGGAGCGG
GAATTGGCCT GGCAACCACG CTTTGACCTT GAGACTGGCC TCGAAGATAG CTACTGCAAC
GACTACTCCT TGAAGCCAAC GGCTGAACCA GATTTCAGTG CCGATCAATC CTTGATCGGG
GTTTGA
 
Protein sequence
MIGKERCLRP PSCRAKGLQL LIERASICAF DDSAVLKILI MGGTRFVGKP LVTRLQAQGH 
ALTLFTRGRH SLPDGVEHLS GDRTTTEGLS RLQGRSFDVI VDSSGRKLED SQRVVACTGE
PKHRFLYVSS AGVYADSEHW PLNEESATDP NSRHAGKAQT ESWLLQQGIP FTSFRPTYIY
GPGNYNPIER WFFDRIVHNR PVPLPRDGTT ITQLGHVVDL ADAMVRSLEV ETATNRIYNC
SSKRGITFRG LIAAAARACG KDPNTVELRS FDPSGLNPKA RKAFPLRLSH FLTDITRVER
ELAWQPRFDL ETGLEDSYCN DYSLKPTAEP DFSADQSLIG V