Gene P9303_00501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_00501 
Symbol 
ID4778970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp50668 
End bp51813 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content55% 
IMG OID640085550 
ProductSqdX 
Protein accessionYP_001016072 
Protein GI124021765 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGATCG CCTTTTTCAC AGAGACCTTC CTCCCAAAAG TGGATGGAAT CGTGACACGA 
CTCACCAAAA CGGTGCAGCA CCTGGTTGAA GCCGGTGATG AAGTGATGGT GTTCTGCCCG
GAAGGCTGTC CTAGCGAATA CATGGGAGCC GAATTGATAG GAGTGCCAGC CATGCCACTG
CCGCTCTATC CGGAACTGAA GCTTGCCCTG CCCAGACCAG CCGTTGCCGA AGCCCTAGAA
ACCTTCGAGC CCGATCTCGT ACACGTTGTC AATCCAGCTG TACTAGGCCT TGGGGGAATT
TGGCTAGCCA AAACCAACGG AATCCCTCTA ATCGCTAGCT ACCACACCCA CCTCCCGAAA
TACCTAGAGC ATTACGGCAT GGGCATGCTG GAGCCCTTGC TCTGGGAACT TCTCAAGGCA
GCCCACAATC AAGCAATACT CAATCTCTGC ACCTCAACAG CAATGGTGTC TGAATTGAGC
GAGAAAGGGA TCCAAAACAC CGCCCTGTGG CAACGCGGCG TGGATACCGA ACTCTTTCGA
CCGGAACTGC GCAACGAAAC CATGCGCTTA CGCCTTTTAA ACACAAACGA CGATCAAGGT
GCCCTGCTTC TCTACGTAGG GCGGCTCTCC GCCGAAAAGC AAATCGAACG CATCAAACCT
GTTCTAGATC GCATACCCGA GGCACGATTG GCCCTAGTAG GCGATGGACC TCACCGCCAG
CAACTGGAAA AAGCATTTGA AGGCACTGCT ACAACATTTG TGGGCTATCT CGAAGGGGAA
GAACTAGCCA GTGCATATGC AAGCGGGGAT GCCTTTCTAT TCCCCTCAAG CACCGAAACC
CTTGGGCTCG TTTTACTGGA AGCAATGGCA GCAGGTTGTC CTGTAGTGGG AGCCAATCGT
GGCGGAATTC CAGACATCAT TACCGACGGA GTCAACGGCT GTCTCTACGA GCCGGATGGA
GTGGATGGAG GGTCCACCAG CCTCATCAAT GCGACCCGAC GACTGCTCGG CAACGATCTC
GAGCGTCAAG GTCTGCGCAA AGCAGCCCGT CAAGAAGCCG AACGCTGGGG ATGGGCCAGT
GCCACGCAAC AACTGCGGAG CTACTACAGA ACAATCCTCG GCCAACCCCT CAACCTGGCC
GCCTGA
 
Protein sequence
MKIAFFTETF LPKVDGIVTR LTKTVQHLVE AGDEVMVFCP EGCPSEYMGA ELIGVPAMPL 
PLYPELKLAL PRPAVAEALE TFEPDLVHVV NPAVLGLGGI WLAKTNGIPL IASYHTHLPK
YLEHYGMGML EPLLWELLKA AHNQAILNLC TSTAMVSELS EKGIQNTALW QRGVDTELFR
PELRNETMRL RLLNTNDDQG ALLLYVGRLS AEKQIERIKP VLDRIPEARL ALVGDGPHRQ
QLEKAFEGTA TTFVGYLEGE ELASAYASGD AFLFPSSTET LGLVLLEAMA AGCPVVGANR
GGIPDIITDG VNGCLYEPDG VDGGSTSLIN ATRRLLGNDL ERQGLRKAAR QEAERWGWAS
ATQQLRSYYR TILGQPLNLA A