Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_13151 |
Symbol | |
ID | 4775933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1118763 |
End bp | 1120433 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640086823 |
Product | hypothetical protein |
Protein accession | YP_001017327 |
Protein GI | 124023020 |
COG category | [R] General function prediction only [S] Function unknown |
COG ID | [COG0645] Predicted kinase [COG2187] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.309836 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCACGA CCAAGCAAAA CCAACCCGCT TTGATTCAGG CATTAATGAA GCCAGTGGCT TATCCACATC CGGTGAAGGT TGTGGAGCTT GTCGAAACCC ACATTTCCTG GGTGCTTCTG ACCGGTTCCT ATGTCTACAA GCTCAAAAAA GCTGTGCATT TTGACTTTGT AGACGCCACC ACGCTGAAGC AACGCCTTCA TTTTTGCCAA GAGGAGCTGC GCCTAAATCG TCGCTTGGCT CCAGACCTTT ATCTAGGTGT GACCAGGATT CTCGAATCAG TTGATGGCGC CAAAGTTCTC GATGAAAACT TTGAATCGAA TGATCCATCC ACAAAGGGTG TGGTTGATGT TGCTGTGAAG ATGCGTCAGT TCCCTGTCTC TCGTCTTCTC AGTGTTTATC TGAGTGATGG GGTTTTGAAG ACTGAATCCC TAAAAAGACT TGCTTTCGAA CTTGCAGAGT TTCATCTCAG TGTCAAAACG GCTGTTGCAG ATGGAAATTT TGGTGGATTT GATGCGGTAA TCAATCCTGT TCATGCCAAT TTGCGTGTAT TGGATCAATT GACACTTCCT GAGCCATTAG AGCTTTGGCT TGAGCAGCAT CGAGCTTGGA TTAAGAGTAT TGAGCCAGAA CTGGCTTTTC GTTTTAAACA ACGACTCAAC GCAGGAGCCA TACGCGAATG CCATGGCGAT CTTCATGTCG GCAATATTTA TCTAAACAAT GATGACCGCC TGGAAGTCTT CGATGCCATT GATTTTAATC CAAGTCTTCG TTGGATCGAT CCGATTAGTG AAATGGCATT TCTAGTGATG GATTTTGAGG TACACGATCA TCAGGGTGAT GCCATGGTGA TACTCAACGA ATGGCTTGAG CAGACAGGCG ATTACAAGGC TCTTGACCTT TGGCCTTGGT ATTCGGCTTA CAGAGCGCTT GTGCGCGCCA AGGTGAGTGG CCTGCAATGG CAACAACTCA CCTCCCAGAG TCAAAATGAC TCTGTTGATC AGCAACACCT TCAACGGCTG CTTAAGGATC TCAACCTCTA CATTCAGCGG GCAAGAGAGG TCCAGCAAAC AAAATCAGCC GGTATTGTGC TGATGCATGG ACTGAGCGGT AGCGGCAAGT CTTATATAAG TGAGCAGCTC TATCAGCAGT TGCCAGCAGT ACGCTTGCGT TCAGATGTCG AACGTGAGCG TGCTTTCGGT CGCCGACCAT TACACAAGCT GCTTGGCTTT GAGAAAGGAT CGATGACAAG TGGTGGCATT ACACCAATTT TTCAAGGAGA CCCTTATCGA CCTGAGGTTA CCAGTTGGTT GTTTGATCAA TGCCTACCGG CATTGACTCA AAGTTGCTTG AGCAGTGGTT ACACCACAAT TGTTGATGCC ACCTTCTTAC GTGAACGAGA ACGACAGCGA ATGTTTGTAT TGGCGCGTCA ACAAGGATGC CCGATTGCCA TCGTGGCCTG TGAATGTAGC GATTTAACAG CTCAGGAGCG CATCGCTACA AGGATGGGGA TTGGAACTGA CCCCTCTGAG GCAGATTTAA GCGTGCGTGA ACTGCAGAAA GCGTGGATTG AACCTCTAAC AACCTTTGAG CAAGAGTTGA CGGTTAGGTT TACGGAAAAA ACTCCAATCA GCATTGGCCT AGAACGTTTG CGTGTTTTGC TGAATCCTTA G
|
Protein sequence | MTTTKQNQPA LIQALMKPVA YPHPVKVVEL VETHISWVLL TGSYVYKLKK AVHFDFVDAT TLKQRLHFCQ EELRLNRRLA PDLYLGVTRI LESVDGAKVL DENFESNDPS TKGVVDVAVK MRQFPVSRLL SVYLSDGVLK TESLKRLAFE LAEFHLSVKT AVADGNFGGF DAVINPVHAN LRVLDQLTLP EPLELWLEQH RAWIKSIEPE LAFRFKQRLN AGAIRECHGD LHVGNIYLNN DDRLEVFDAI DFNPSLRWID PISEMAFLVM DFEVHDHQGD AMVILNEWLE QTGDYKALDL WPWYSAYRAL VRAKVSGLQW QQLTSQSQND SVDQQHLQRL LKDLNLYIQR AREVQQTKSA GIVLMHGLSG SGKSYISEQL YQQLPAVRLR SDVERERAFG RRPLHKLLGF EKGSMTSGGI TPIFQGDPYR PEVTSWLFDQ CLPALTQSCL SSGYTTIVDA TFLRERERQR MFVLARQQGC PIAIVACECS DLTAQERIAT RMGIGTDPSE ADLSVRELQK AWIEPLTTFE QELTVRFTEK TPISIGLERL RVLLNP
|
| |