Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_25501 |
Symbol | |
ID | 4778922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2244152 |
End bp | 2245195 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640088071 |
Product | putative polysialic acid capsule expression protein KpsF |
Protein accession | YP_001018546 |
Protein GI | 124024239 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0794] Predicted sugar phosphate isomerase involved in capsule formation |
TIGRFAM ID | [TIGR00393] KpsF/GutQ family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGTGG TATGCAGGAG AATTACCGCA TCATCACGAA CTTACAGTTT GTCTGCCCTC ACCCGCTGCC TGCAGGAAGA GGCTGCCGCC ATCGCCGTCG CTGCCGAACG ACTGAGCAGC AGCCAGGTGG AAAAGGCATT GGTCTTGCTT GAACGCTGTG GTGATCAACG CGCCAAATTG GTGATCACCG GCGTAGGCAA AAGCGGCATC GTTGCACGCA AAATCGCTGC CACCTTTTCC TCTATCGGCC TGATGGCCCT TTACCTAAAC CCCCTCGATG CAATGCATGG CGATCTGGGA GTGGTCGCCC AAGAGGACGT CTGCCTACTG CTCTCCAACA GCGGGGAGAC TGCTGAACTG TTAGAAGTGC TTCCCCACCT CAAACGACGT GGCACCGCTC GCATCGCTCT GGTTGGAAAG CCCGACTCAT CGCTTGCTCG CGGTAGTGAC GTGGTGCTTG AAGCAAGTGT TGATCGCGAA GTGTGTCCGT TGAATCTTGC CCCCACCGCC AGCACAGCGG TGGCCATGGC GATCGGTGAT GCCCTTGCTG CCATATGGAT GGAACGTCGC AACATCTCAC CCGCAGACTT CGCGTTCAAC CACCCAGCCG GTTCGCTCGG CAAACAGCTC ACCCTCACCG CTTCGGACCT AATGGTGCCA GTTGCAAAGG TCCAGCCACT ACAACCCAAC ACCAGTCTGC AAGACGTGAT CTGCAAGCTG ACCCAAGATG GTATTGGTAG TGGCTGGGTG GAAGACCCCT CCACCGCCGG ACTGCTGCTG GGCCTCATTA CCGATGGTGA TCTACGCCGC GCCCTGCGTG ATCACAGTGC CGAGAATTGG GCCAGCCTCA GTGCTGCAGA CCTGATGACA GCCGATCCAA TCACCGTGGA CGCTGATCTG CTCGCAGTCG AAGCGATCAA GCAGATGGAA TGCAACCGTC GCAAACCCAT CTCAGTACTG CCCGTCGTAG GCCCTGACAG CTCAGGCAAT CTGTTGCTCG GACTCCTACG GCTTCATGAT CTGATTCAAG CAGGGCTGAC ATGA
|
Protein sequence | MAVVCRRITA SSRTYSLSAL TRCLQEEAAA IAVAAERLSS SQVEKALVLL ERCGDQRAKL VITGVGKSGI VARKIAATFS SIGLMALYLN PLDAMHGDLG VVAQEDVCLL LSNSGETAEL LEVLPHLKRR GTARIALVGK PDSSLARGSD VVLEASVDRE VCPLNLAPTA STAVAMAIGD ALAAIWMERR NISPADFAFN HPAGSLGKQL TLTASDLMVP VAKVQPLQPN TSLQDVICKL TQDGIGSGWV EDPSTAGLLL GLITDGDLRR ALRDHSAENW ASLSAADLMT ADPITVDADL LAVEAIKQME CNRRKPISVL PVVGPDSSGN LLLGLLRLHD LIQAGLT
|
| |