Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_12661 |
Symbol | |
ID | 4718872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | + |
Start bp | 1111964 |
End bp | 1113532 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640080950 |
Product | hypothetical protein |
Protein accession | YP_001011580 |
Protein GI | 123966499 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAATAG ATGATCATCA CTATGACTTC ATAATAATTG GAAGCGGTGC TGGTGGAGCC ACTTTAGCCC GTCAATTATC AAGAAAGGGG AAATGGGTCC TAGTTCTAGA GAGGGGAGGT CAGCTTCCTT TAGAAGAACA GAATATTATA GGCACGGATT TATTTAGAAA AACAAGATAT CACCCTGAGG GGGAAAATTG GCTTGGTCCT GATGGGGATC CTTTTGCTCC TCAAACTGTA TATGCGCTTG GAGGGAACAC TAAAATCTGG GGTGCAGTTT TAGAAAGGAT GCGAACTGAA GATTTTGAAG ATCTTTCTTT ACAAGAGGGA ACTTCTCCAT CATGGCCCAT ATCATATGAG GAACTCGAAC CATTTTATAG AAAAGCAGAA GAAATTTATA ATGTAAAAGG GAGGCAAGGT ATTGATAAAA CAGAGCCTAA TAGATCGAGT GGTTATGATA ATCCCCCAAA ATCTATAGAC CCTTTATTCA AAGAGATACA AAATATTTTG ATAGAAGAGG GATTTAATCC CTATTATCTA CCAATTAGCT GGCCTGAAAG TTCTCAAGAT ATTGATTTTG ATAACTGTGC AATGTTTCAA AAAGGAGATG CCCAACTTTA TGGAATTTAT AATTCAAATC AAGATTTCCT GAGAATTAAA ACCAACGCTA AAGTTTTAAA ACTAGATGTA AACTCTTCTG GGAAATCTGT CAAAGGTGTT GAGGCAGAGA TAGATGGGGA TAAATGGCTT TTCTCCTCAG ATATTGTTAT TCTCTCAGCT GGAGCAATTA ATACTCCTAT AATTTTATTA AATTCAAAAT CTTCATCACA TCCAAATGGT TTAGCTAATA GTTCAAAAAT GGTTGGCAAA AATTTGATGA ATATTCAAAT GACTTGTATT CTTCAAAGAG CTAATAATCT CACCAGTGGA TATTTTTCAA AAACCTTAGG ATTAAATGAT TTTTACTTTG GGGATAAAAA CGTTAATTTC CCATTAGGTC ACATACAAAC TGGAGGAGGA GTACTTAGGG ATGCTTTTTT TGCGGAATCT CCGCCAGTTC TATCTTTAAT TACAAAATTA ATACCCGACT TTGGATTAAA GAATTTAGCA AAAAGGTCAA TATCATGGTG GGCAATGACG GAAGTCTTAC CAGATCCTGA AAACGCAGTA ACCATTCAAA ATAATAAGGT AAAAATTAAT TACATACATA ACAATCTTGA GGCACACGAT CGATTAGTTT ATAGATGGCT TGACACTTTA AAAGTTATTG AGAATAACCC CCTTTCTATA TCAATAACCA GAACGCCGGC ACACCCAAGA GGATTAGCTC CGCTAAGCAT AGTTGGTTAT TCCTGCGGGA CTTGCAAGAT GGGAAATGAC CCGAAAACCT CAGTAGTTAA TAAAAACGGG AAGTGTCATG ACCTTGATAA TCTTTATATT TCAGATGCGA GTATATTTCC AAGCTGCCCA AGTATTGGGC ATGGTTTAAC AGTTATTGCT ATGTCCCTCA AATTAGGAGA TTATCTTACT TATGGAAATC TTTATAAACC ATTAATATGC AGACCTTAA
|
Protein sequence | MIIDDHHYDF IIIGSGAGGA TLARQLSRKG KWVLVLERGG QLPLEEQNII GTDLFRKTRY HPEGENWLGP DGDPFAPQTV YALGGNTKIW GAVLERMRTE DFEDLSLQEG TSPSWPISYE ELEPFYRKAE EIYNVKGRQG IDKTEPNRSS GYDNPPKSID PLFKEIQNIL IEEGFNPYYL PISWPESSQD IDFDNCAMFQ KGDAQLYGIY NSNQDFLRIK TNAKVLKLDV NSSGKSVKGV EAEIDGDKWL FSSDIVILSA GAINTPIILL NSKSSSHPNG LANSSKMVGK NLMNIQMTCI LQRANNLTSG YFSKTLGLND FYFGDKNVNF PLGHIQTGGG VLRDAFFAES PPVLSLITKL IPDFGLKNLA KRSISWWAMT EVLPDPENAV TIQNNKVKIN YIHNNLEAHD RLVYRWLDTL KVIENNPLSI SITRTPAHPR GLAPLSIVGY SCGTCKMGND PKTSVVNKNG KCHDLDNLYI SDASIFPSCP SIGHGLTVIA MSLKLGDYLT YGNLYKPLIC RP
|
| |