Gene P9303_25501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_25501 
Symbol 
ID4778922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2244152 
End bp2245195 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content58% 
IMG OID640088071 
Productputative polysialic acid capsule expression protein KpsF 
Protein accessionYP_001018546 
Protein GI124024239 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTGG TATGCAGGAG AATTACCGCA TCATCACGAA CTTACAGTTT GTCTGCCCTC 
ACCCGCTGCC TGCAGGAAGA GGCTGCCGCC ATCGCCGTCG CTGCCGAACG ACTGAGCAGC
AGCCAGGTGG AAAAGGCATT GGTCTTGCTT GAACGCTGTG GTGATCAACG CGCCAAATTG
GTGATCACCG GCGTAGGCAA AAGCGGCATC GTTGCACGCA AAATCGCTGC CACCTTTTCC
TCTATCGGCC TGATGGCCCT TTACCTAAAC CCCCTCGATG CAATGCATGG CGATCTGGGA
GTGGTCGCCC AAGAGGACGT CTGCCTACTG CTCTCCAACA GCGGGGAGAC TGCTGAACTG
TTAGAAGTGC TTCCCCACCT CAAACGACGT GGCACCGCTC GCATCGCTCT GGTTGGAAAG
CCCGACTCAT CGCTTGCTCG CGGTAGTGAC GTGGTGCTTG AAGCAAGTGT TGATCGCGAA
GTGTGTCCGT TGAATCTTGC CCCCACCGCC AGCACAGCGG TGGCCATGGC GATCGGTGAT
GCCCTTGCTG CCATATGGAT GGAACGTCGC AACATCTCAC CCGCAGACTT CGCGTTCAAC
CACCCAGCCG GTTCGCTCGG CAAACAGCTC ACCCTCACCG CTTCGGACCT AATGGTGCCA
GTTGCAAAGG TCCAGCCACT ACAACCCAAC ACCAGTCTGC AAGACGTGAT CTGCAAGCTG
ACCCAAGATG GTATTGGTAG TGGCTGGGTG GAAGACCCCT CCACCGCCGG ACTGCTGCTG
GGCCTCATTA CCGATGGTGA TCTACGCCGC GCCCTGCGTG ATCACAGTGC CGAGAATTGG
GCCAGCCTCA GTGCTGCAGA CCTGATGACA GCCGATCCAA TCACCGTGGA CGCTGATCTG
CTCGCAGTCG AAGCGATCAA GCAGATGGAA TGCAACCGTC GCAAACCCAT CTCAGTACTG
CCCGTCGTAG GCCCTGACAG CTCAGGCAAT CTGTTGCTCG GACTCCTACG GCTTCATGAT
CTGATTCAAG CAGGGCTGAC ATGA
 
Protein sequence
MAVVCRRITA SSRTYSLSAL TRCLQEEAAA IAVAAERLSS SQVEKALVLL ERCGDQRAKL 
VITGVGKSGI VARKIAATFS SIGLMALYLN PLDAMHGDLG VVAQEDVCLL LSNSGETAEL
LEVLPHLKRR GTARIALVGK PDSSLARGSD VVLEASVDRE VCPLNLAPTA STAVAMAIGD
ALAAIWMERR NISPADFAFN HPAGSLGKQL TLTASDLMVP VAKVQPLQPN TSLQDVICKL
TQDGIGSGWV EDPSTAGLLL GLITDGDLRR ALRDHSAENW ASLSAADLMT ADPITVDADL
LAVEAIKQME CNRRKPISVL PVVGPDSSGN LLLGLLRLHD LIQAGLT