Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_22351 |
Symbol | |
ID | 4778251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1979150 |
End bp | 1980097 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640087752 |
Product | HEAT repeat-containing protein |
Protein accession | YP_001018235 |
Protein GI | 124023928 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATTCTC TTCTCAAGTC GACAACATTG ACACAGGACC AACCCTTCAG CATCAATACT GACGAGTCGT TGCTGAATGA AGACGAAGCC GCTGAACTGG CCGATGAATT AAAGGCTCTG CTCAGACGTG GAGACACACC GAAAGCCGAT GCAGAGCAGA TTCAACGCAT GGTGTCTGGT CTTGGCGATC ATCGAGGATT GATTCGACGT ACCTTTGCAG AAAGTCTCGG CGGCGTAGGC AAGGCAGCGG TGCCAGCACT TTGTGTGGCC CTACACAAGC ATTCCAGTGC CACGGTGAGG CGCGCCGCCG CAAAGACCCT CAAACTGGTT GGCGATCCAA ACACCTTGCC AAATCTCCTC GAGGCCCTTC TCAACGATCC AGACCCTGTC GTTCAGGGAT CAGCAGCAGG AGCCATGGCG ATTTTTGGTG CAGATGCAGT AGAGCTCCTG CTAGAGGTAA TCATGAATCC CAACAGCACA GCAATGCAAT GTGGATTCGC TAGCTGGGGG CTGGCCTTTG TGGGAGCTCA AGCACCTGAT GCTCTTCGCA ATGCAGCCCA ATCCGACCAT GCAGAAATTA GAGCGGCAGC CATCGCAGGC CTGGGAGAGC AAATACAAGC CTTGGGTGAT ACGGATGCTC GAGAGCTTCT CCTAGGAGCT TTAGTTGACC CTGCAAGCGA TGTGCGGGCT GAAGCCACCA TATTGCTTGG CAAGCTACAC GAACCATCCT GGGCACAGCC AATGTTATTG GCCAGACTCG ATGATTTACA CCCTCAAGTC CGAAAGAATG CAGCCATGTC TCTAATGAAA CTGAAGGCAA CGGGAACACT CAATGAACTG CTTGCAAGAA AGTCTGCAGA ACAAGATGAA AGCGTCAATA GGATTCTGCA GCTCGCAATT GATCAGCTTT CGAGCGAAGA TCTAAAAAGC CATAATGATG AAAGCTAG
|
Protein sequence | MYSLLKSTTL TQDQPFSINT DESLLNEDEA AELADELKAL LRRGDTPKAD AEQIQRMVSG LGDHRGLIRR TFAESLGGVG KAAVPALCVA LHKHSSATVR RAAAKTLKLV GDPNTLPNLL EALLNDPDPV VQGSAAGAMA IFGADAVELL LEVIMNPNST AMQCGFASWG LAFVGAQAPD ALRNAAQSDH AEIRAAAIAG LGEQIQALGD TDARELLLGA LVDPASDVRA EATILLGKLH EPSWAQPMLL ARLDDLHPQV RKNAAMSLMK LKATGTLNEL LARKSAEQDE SVNRILQLAI DQLSSEDLKS HNDES
|
| |