Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_02101 |
Symbol | |
ID | 4777472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 227187 |
End bp | 228323 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640085709 |
Product | hypothetical protein |
Protein accession | YP_001016230 |
Protein GI | 124021923 |
COG category | [S] Function unknown |
COG ID | [COG3146] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCATCACT GGGAAAACCT GGTTGGTGAG CAGGCGATTC CTTTCTTTCG CTGGCGGTGG CTTGCTGCTC TTGAAGACTC AAGCAGCATT TCCGCCAAAC ATGGTTGGCA GCCACTGCAC TTAGCCCTGT GGCGAGACGA CACGCCTGTA GCGGTTGCCC CTCTTTATCT CAAGGGGCAT AGCTATGGCG AATTTGTTTT TGATCAGGCC TTTGCTCGCC TGGCAGGTGA TCTTGGTTTG GGGTATTACC CAAAACTGCT GGGGATGAGC CCTGTGAGCC CTGTGCAGGG CTATCGCTTC TATGTGGCGC CAGGGGAGGA CGAGGCAGAG ATGACAGTTT TGATGCTGGA AACCATCGAT GCTTTTGCGC GTCGCAACCA GATTCTCAGC TGTAACTTTC TTTATGTTGA TCCGCATTGG CGGCCTTTGG CGGAAGCTGC GGGCTGTGCC ACTTGGTTGA ACCAGCAGAG CCTTTGGTCA GCAGATGGGC AGTCTGATTT CTCTGCCTAT CTCAATAGCT TCAATGCCAA TCAGCGACGC AATATCAAGC GTGAACGCAA GGCCGTCCAG CAGGCGGGGC TCACGGTTTC AGCGTTGACA GGAGCAGAAC TTGATGTGCA GCTGTTGAGG TGCATGTATG GCTTTTATGA GCAGCATTGC GCTCGTTGGG GACCTTGGGG AAGCAAGTAT CTCTCTGAAG CGTTTTTTGA GGCCTTGGCA GATTCGTCTC TCAGAGATCA GGTGGTGTTG TTTAGTGCCC ATCGTGAGAG TCCTAGAGAG CCTGTAGCGA TGTCTCTTTG TATACAGGAT GGACAAATGT TGTGGGGGCG TTATTGGGGT AGCAAGGAGG AGATCGATTG CCTTCATTTC GAGGTTTGTT ATTACGCGCC GATTGCCTGG GCGTTGGAAC ATGGTTTAGA GCATTTTGAT CCTGGCGCAG GCGGTCAACA CAAGCGCCGT AGGGGCTTTG TGGCGAAGCC CCATGCCAGC TTGCATCGTT GGTATGAACC GCGTATGGAT GCTTTGATCC GTGGATGGTT GAGGAAGGTC AATCCTCTAA TGCTCGAGGA GATTGAGTCG GTGAATGCTG ATTTGCCGTT TCGGGTTGAG CCTGCCCCTC AGTTAATTGT GGAATAA
|
Protein sequence | MHHWENLVGE QAIPFFRWRW LAALEDSSSI SAKHGWQPLH LALWRDDTPV AVAPLYLKGH SYGEFVFDQA FARLAGDLGL GYYPKLLGMS PVSPVQGYRF YVAPGEDEAE MTVLMLETID AFARRNQILS CNFLYVDPHW RPLAEAAGCA TWLNQQSLWS ADGQSDFSAY LNSFNANQRR NIKRERKAVQ QAGLTVSALT GAELDVQLLR CMYGFYEQHC ARWGPWGSKY LSEAFFEALA DSSLRDQVVL FSAHRESPRE PVAMSLCIQD GQMLWGRYWG SKEEIDCLHF EVCYYAPIAW ALEHGLEHFD PGAGGQHKRR RGFVAKPHAS LHRWYEPRMD ALIRGWLRKV NPLMLEEIES VNADLPFRVE PAPQLIVE
|
| |