Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_25071 |
Symbol | |
ID | 4778649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2202949 |
End bp | 2204796 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640088028 |
Product | hypothetical protein |
Protein accession | YP_001018503 |
Protein GI | 124024196 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAATC AGATCTCTCC AAACCCAAAC CGAAATGATA TACATGCAAA AGATGGTGAT TTTAATGATG ATATATTTGA AAACTACGGA AACATTTGGG TAAAAAATGT TATCCTGAAC AACACATACG AACTGCACAA CAACGACGGC GGCACGCTGC ACAACTTCAA GAGCGGCACG CTGAACAACA TCGACGGCGG CTTGCTGTTC AACTACGAGA ACAGCACGCT GAACAACAAC GGCACGCTGA ACAACAACAG CCGTCTGAAG AACTACAGCA TGCTGAACAA CAACAGCAGC GGCACGCTGA ACATCTACGG CAGCGGCACG CTGAACATCT ATCTGTGGAA CGGAGGCGAC AGCCTCAGCA GCGGCACGCT GAACAACAAC GGCACGCTGA ACAACATCGA CGGCGGCTTG CTGTTCAACT ACTCCTGGCT TGCTACACAA CTAAGCACGC TGAACAACAG AGGGTATCTG AGCAACAACG AGAGCAGCAC GCTTAGAAAC TATAGCGCCA CGCTTAACAA CAGCGGCAGG CTTGACAACT TGGGAATTCT GATCAACCAC AAGATAAGCA CGCTGGATCG GACACCACCA CCCTGCACGT TTAACAACAG CGGCAGGCTT GACAACAGCG GCAGGCTGAA CAACATCGAC GGCGGCGAGA TTAACAACAA CGACGGCGGC GAGATTAACA ACAACGGCAC GCTGAACAAC ATCGACGGCG GCACGCTGAA CAACAACAGC ACGCTGAACA ACTACGGCGG CGGCATGCTG AACAACAACA GCACGCTTAT TAACAACGTA GCCGGCAGGC TGCACAACAA CAGCTGGCTT ATTAACAACA GAGCCGGCAT GCTGAACAAC AAAGGCACGC TTATTAACAA CAGCGCCGGC AAGCTGCACA ACAGCGGCAC GCTGAACAAC AGCGGTATAA TTAAAAATTC TGACAATAGA CTAAAAGAAA AGGGGTTTAT TAATAACGGA ACCTATACAG GCGATGGCCA AATTAAAGGC AGTTGGACAG ACCATGGTCA CGTCAAGCCA GGGAGTTCCG CAGGCGGAAT GCTCGTTGAT GGCCATTATT ACAAGAAAGG TGGCTCTACA GAAATAGAAC TAGGTGGTAT AGACGATGGC GATGGAGATC GCACCGCTAC AGAACACGAT TGGATTGAAA TTACTGGCAA CCTAGAACTC GCAGGAGAAC TCAATGTTTC GCTGATTGAT GAATTCAAAC TCTCTGCTGG TGATTTCTTT GTGATCACCA AAGTTGGTGG AACTCTCACT GGTCAATATG AGGGCCTTGA TGAAGGAGAT TCAGTAGGCA GATTTGCCAG CGATAATGGA GGTACCCTAG AGCTCTTCAT TACCTACAAA GGTGGTGATA GCAATGATAT TGCGCTCTAC ACCCAATCAT TATCTGGTGT TCTTCCTGAG AGCTTGCGTG AACCAAGAAT CATTGGTTCT GATGCTGATG ATTCCTTAAC TGGAACCTCT GCAGATGAAG TGATCTTTGG TGGTAGTGGT GATGATGTTT TACTAGGAGG CGGTGGAGAT GATCAAGTGA CTGGAGGCAA TGGCGATGAT CGGCTATATG GTGGTTTCGG TGATGACATT CTCAAAGGTG ATCGAGGCGC TGATACCTAC AGGCTGAGTC GTGGTAATGA TGTGATCATC GCCTTTTCAT TCGCTGAAAA CGATCGCATC TCTGTTGCTA ATGGAGTGGA CCTTTCCTTT AAGCAAGTTG GTGATGATCT ATTGATCACA GCAGATGGCA TTCACACCAC CTTGAAGGAT GTTGATAAGG GTGAGTTTCT CGCTGCTGAT GTGATTGACT TTATCTAG
|
Protein sequence | MGNQISPNPN RNDIHAKDGD FNDDIFENYG NIWVKNVILN NTYELHNNDG GTLHNFKSGT LNNIDGGLLF NYENSTLNNN GTLNNNSRLK NYSMLNNNSS GTLNIYGSGT LNIYLWNGGD SLSSGTLNNN GTLNNIDGGL LFNYSWLATQ LSTLNNRGYL SNNESSTLRN YSATLNNSGR LDNLGILINH KISTLDRTPP PCTFNNSGRL DNSGRLNNID GGEINNNDGG EINNNGTLNN IDGGTLNNNS TLNNYGGGML NNNSTLINNV AGRLHNNSWL INNRAGMLNN KGTLINNSAG KLHNSGTLNN SGIIKNSDNR LKEKGFINNG TYTGDGQIKG SWTDHGHVKP GSSAGGMLVD GHYYKKGGST EIELGGIDDG DGDRTATEHD WIEITGNLEL AGELNVSLID EFKLSAGDFF VITKVGGTLT GQYEGLDEGD SVGRFASDNG GTLELFITYK GGDSNDIALY TQSLSGVLPE SLREPRIIGS DADDSLTGTS ADEVIFGGSG DDVLLGGGGD DQVTGGNGDD RLYGGFGDDI LKGDRGADTY RLSRGNDVII AFSFAENDRI SVANGVDLSF KQVGDDLLIT ADGIHTTLKD VDKGEFLAAD VIDFI
|
| |