Gene P9303_25071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_25071 
Symbol 
ID4778649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2202949 
End bp2204796 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content48% 
IMG OID640088028 
Producthypothetical protein 
Protein accessionYP_001018503 
Protein GI124024196 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAATC AGATCTCTCC AAACCCAAAC CGAAATGATA TACATGCAAA AGATGGTGAT 
TTTAATGATG ATATATTTGA AAACTACGGA AACATTTGGG TAAAAAATGT TATCCTGAAC
AACACATACG AACTGCACAA CAACGACGGC GGCACGCTGC ACAACTTCAA GAGCGGCACG
CTGAACAACA TCGACGGCGG CTTGCTGTTC AACTACGAGA ACAGCACGCT GAACAACAAC
GGCACGCTGA ACAACAACAG CCGTCTGAAG AACTACAGCA TGCTGAACAA CAACAGCAGC
GGCACGCTGA ACATCTACGG CAGCGGCACG CTGAACATCT ATCTGTGGAA CGGAGGCGAC
AGCCTCAGCA GCGGCACGCT GAACAACAAC GGCACGCTGA ACAACATCGA CGGCGGCTTG
CTGTTCAACT ACTCCTGGCT TGCTACACAA CTAAGCACGC TGAACAACAG AGGGTATCTG
AGCAACAACG AGAGCAGCAC GCTTAGAAAC TATAGCGCCA CGCTTAACAA CAGCGGCAGG
CTTGACAACT TGGGAATTCT GATCAACCAC AAGATAAGCA CGCTGGATCG GACACCACCA
CCCTGCACGT TTAACAACAG CGGCAGGCTT GACAACAGCG GCAGGCTGAA CAACATCGAC
GGCGGCGAGA TTAACAACAA CGACGGCGGC GAGATTAACA ACAACGGCAC GCTGAACAAC
ATCGACGGCG GCACGCTGAA CAACAACAGC ACGCTGAACA ACTACGGCGG CGGCATGCTG
AACAACAACA GCACGCTTAT TAACAACGTA GCCGGCAGGC TGCACAACAA CAGCTGGCTT
ATTAACAACA GAGCCGGCAT GCTGAACAAC AAAGGCACGC TTATTAACAA CAGCGCCGGC
AAGCTGCACA ACAGCGGCAC GCTGAACAAC AGCGGTATAA TTAAAAATTC TGACAATAGA
CTAAAAGAAA AGGGGTTTAT TAATAACGGA ACCTATACAG GCGATGGCCA AATTAAAGGC
AGTTGGACAG ACCATGGTCA CGTCAAGCCA GGGAGTTCCG CAGGCGGAAT GCTCGTTGAT
GGCCATTATT ACAAGAAAGG TGGCTCTACA GAAATAGAAC TAGGTGGTAT AGACGATGGC
GATGGAGATC GCACCGCTAC AGAACACGAT TGGATTGAAA TTACTGGCAA CCTAGAACTC
GCAGGAGAAC TCAATGTTTC GCTGATTGAT GAATTCAAAC TCTCTGCTGG TGATTTCTTT
GTGATCACCA AAGTTGGTGG AACTCTCACT GGTCAATATG AGGGCCTTGA TGAAGGAGAT
TCAGTAGGCA GATTTGCCAG CGATAATGGA GGTACCCTAG AGCTCTTCAT TACCTACAAA
GGTGGTGATA GCAATGATAT TGCGCTCTAC ACCCAATCAT TATCTGGTGT TCTTCCTGAG
AGCTTGCGTG AACCAAGAAT CATTGGTTCT GATGCTGATG ATTCCTTAAC TGGAACCTCT
GCAGATGAAG TGATCTTTGG TGGTAGTGGT GATGATGTTT TACTAGGAGG CGGTGGAGAT
GATCAAGTGA CTGGAGGCAA TGGCGATGAT CGGCTATATG GTGGTTTCGG TGATGACATT
CTCAAAGGTG ATCGAGGCGC TGATACCTAC AGGCTGAGTC GTGGTAATGA TGTGATCATC
GCCTTTTCAT TCGCTGAAAA CGATCGCATC TCTGTTGCTA ATGGAGTGGA CCTTTCCTTT
AAGCAAGTTG GTGATGATCT ATTGATCACA GCAGATGGCA TTCACACCAC CTTGAAGGAT
GTTGATAAGG GTGAGTTTCT CGCTGCTGAT GTGATTGACT TTATCTAG
 
Protein sequence
MGNQISPNPN RNDIHAKDGD FNDDIFENYG NIWVKNVILN NTYELHNNDG GTLHNFKSGT 
LNNIDGGLLF NYENSTLNNN GTLNNNSRLK NYSMLNNNSS GTLNIYGSGT LNIYLWNGGD
SLSSGTLNNN GTLNNIDGGL LFNYSWLATQ LSTLNNRGYL SNNESSTLRN YSATLNNSGR
LDNLGILINH KISTLDRTPP PCTFNNSGRL DNSGRLNNID GGEINNNDGG EINNNGTLNN
IDGGTLNNNS TLNNYGGGML NNNSTLINNV AGRLHNNSWL INNRAGMLNN KGTLINNSAG
KLHNSGTLNN SGIIKNSDNR LKEKGFINNG TYTGDGQIKG SWTDHGHVKP GSSAGGMLVD
GHYYKKGGST EIELGGIDDG DGDRTATEHD WIEITGNLEL AGELNVSLID EFKLSAGDFF
VITKVGGTLT GQYEGLDEGD SVGRFASDNG GTLELFITYK GGDSNDIALY TQSLSGVLPE
SLREPRIIGS DADDSLTGTS ADEVIFGGSG DDVLLGGGGD DQVTGGNGDD RLYGGFGDDI
LKGDRGADTY RLSRGNDVII AFSFAENDRI SVANGVDLSF KQVGDDLLIT ADGIHTTLKD
VDKGEFLAAD VIDFI