Gene P9303_13151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_13151 
Symbol 
ID4775933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1118763 
End bp1120433 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content47% 
IMG OID640086823 
Producthypothetical protein 
Protein accessionYP_001017327 
Protein GI124023020 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG0645] Predicted kinase
[COG2187] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.309836 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACGA CCAAGCAAAA CCAACCCGCT TTGATTCAGG CATTAATGAA GCCAGTGGCT 
TATCCACATC CGGTGAAGGT TGTGGAGCTT GTCGAAACCC ACATTTCCTG GGTGCTTCTG
ACCGGTTCCT ATGTCTACAA GCTCAAAAAA GCTGTGCATT TTGACTTTGT AGACGCCACC
ACGCTGAAGC AACGCCTTCA TTTTTGCCAA GAGGAGCTGC GCCTAAATCG TCGCTTGGCT
CCAGACCTTT ATCTAGGTGT GACCAGGATT CTCGAATCAG TTGATGGCGC CAAAGTTCTC
GATGAAAACT TTGAATCGAA TGATCCATCC ACAAAGGGTG TGGTTGATGT TGCTGTGAAG
ATGCGTCAGT TCCCTGTCTC TCGTCTTCTC AGTGTTTATC TGAGTGATGG GGTTTTGAAG
ACTGAATCCC TAAAAAGACT TGCTTTCGAA CTTGCAGAGT TTCATCTCAG TGTCAAAACG
GCTGTTGCAG ATGGAAATTT TGGTGGATTT GATGCGGTAA TCAATCCTGT TCATGCCAAT
TTGCGTGTAT TGGATCAATT GACACTTCCT GAGCCATTAG AGCTTTGGCT TGAGCAGCAT
CGAGCTTGGA TTAAGAGTAT TGAGCCAGAA CTGGCTTTTC GTTTTAAACA ACGACTCAAC
GCAGGAGCCA TACGCGAATG CCATGGCGAT CTTCATGTCG GCAATATTTA TCTAAACAAT
GATGACCGCC TGGAAGTCTT CGATGCCATT GATTTTAATC CAAGTCTTCG TTGGATCGAT
CCGATTAGTG AAATGGCATT TCTAGTGATG GATTTTGAGG TACACGATCA TCAGGGTGAT
GCCATGGTGA TACTCAACGA ATGGCTTGAG CAGACAGGCG ATTACAAGGC TCTTGACCTT
TGGCCTTGGT ATTCGGCTTA CAGAGCGCTT GTGCGCGCCA AGGTGAGTGG CCTGCAATGG
CAACAACTCA CCTCCCAGAG TCAAAATGAC TCTGTTGATC AGCAACACCT TCAACGGCTG
CTTAAGGATC TCAACCTCTA CATTCAGCGG GCAAGAGAGG TCCAGCAAAC AAAATCAGCC
GGTATTGTGC TGATGCATGG ACTGAGCGGT AGCGGCAAGT CTTATATAAG TGAGCAGCTC
TATCAGCAGT TGCCAGCAGT ACGCTTGCGT TCAGATGTCG AACGTGAGCG TGCTTTCGGT
CGCCGACCAT TACACAAGCT GCTTGGCTTT GAGAAAGGAT CGATGACAAG TGGTGGCATT
ACACCAATTT TTCAAGGAGA CCCTTATCGA CCTGAGGTTA CCAGTTGGTT GTTTGATCAA
TGCCTACCGG CATTGACTCA AAGTTGCTTG AGCAGTGGTT ACACCACAAT TGTTGATGCC
ACCTTCTTAC GTGAACGAGA ACGACAGCGA ATGTTTGTAT TGGCGCGTCA ACAAGGATGC
CCGATTGCCA TCGTGGCCTG TGAATGTAGC GATTTAACAG CTCAGGAGCG CATCGCTACA
AGGATGGGGA TTGGAACTGA CCCCTCTGAG GCAGATTTAA GCGTGCGTGA ACTGCAGAAA
GCGTGGATTG AACCTCTAAC AACCTTTGAG CAAGAGTTGA CGGTTAGGTT TACGGAAAAA
ACTCCAATCA GCATTGGCCT AGAACGTTTG CGTGTTTTGC TGAATCCTTA G
 
Protein sequence
MTTTKQNQPA LIQALMKPVA YPHPVKVVEL VETHISWVLL TGSYVYKLKK AVHFDFVDAT 
TLKQRLHFCQ EELRLNRRLA PDLYLGVTRI LESVDGAKVL DENFESNDPS TKGVVDVAVK
MRQFPVSRLL SVYLSDGVLK TESLKRLAFE LAEFHLSVKT AVADGNFGGF DAVINPVHAN
LRVLDQLTLP EPLELWLEQH RAWIKSIEPE LAFRFKQRLN AGAIRECHGD LHVGNIYLNN
DDRLEVFDAI DFNPSLRWID PISEMAFLVM DFEVHDHQGD AMVILNEWLE QTGDYKALDL
WPWYSAYRAL VRAKVSGLQW QQLTSQSQND SVDQQHLQRL LKDLNLYIQR AREVQQTKSA
GIVLMHGLSG SGKSYISEQL YQQLPAVRLR SDVERERAFG RRPLHKLLGF EKGSMTSGGI
TPIFQGDPYR PEVTSWLFDQ CLPALTQSCL SSGYTTIVDA TFLRERERQR MFVLARQQGC
PIAIVACECS DLTAQERIAT RMGIGTDPSE ADLSVRELQK AWIEPLTTFE QELTVRFTEK
TPISIGLERL RVLLNP