Gene P9303_24571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_24571 
Symbol 
ID4776123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2158503 
End bp2159708 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content56% 
IMG OID640087977 
Producthypothetical protein 
Protein accessionYP_001018453 
Protein GI124024146 
COG category[S] Function unknown 
COG ID[COG4372] Uncharacterized protein conserved in bacteria with the myosin-like domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGCT GGCTGCTGAT CCTCTCCCTA CTCGTCCTCG GTGGTGTGCT CTCCACCCTC 
GGCGATCGAC TTGGCAGCCG TGTAGGCAAA GCAAGGCTGA GCCTGTTCAA TCTCAGACCG
CGAAGGACAG CCGTATTCAT CACCGTTCTC ACTGGCAGCC TGATTAGCGC CCTGTCCCTA
GGACTGATGC TGCTGGTCAG TCGACAACTG CGAGTAGGAC TTTTTGAACT GGATGACCTT
CAGGCCAAAC TCCAACAAAG CCGCGATGCC CTCAATTCAA GTCGGTCGGC CCAGTTCAAT
GCCGAGCTTG ATCTCAAGCA GGCCAGGTCG GACAGCAATC AGGTTCAGAA TGAACTGAAG
GAAGCCAAAA AGCGAGCTGC CGCCCTTCGC AACGAACTCG CACCACTACA AAAACAAAGA
CAACAGCTCG AGGCGGAACG AGCTCGTCTC AGTCGCGACA TTTCCAAAAA AGATGCCGAT
ATCCGCCGAA CGGAAGTTGA ACTAGCCAAT GTTCGCAGCA GAATTAGCTC GGCTGAAAAA
GAACTGAAGC AACTCGAGAC CAACCTGATT GCCTTACGTC GAGGGGATGT TGTGCTGAGC
AGTGGGCAGC AACTTGCCGC AGCCACCCTG CGGCTCGACA ACCCCAGCCA GGCGAAAGCC
GTCATCGACC GCCTACTTCA GGAAGCAAAC CTGGAGGCTT TTCGACGCGT ACGTCCCGGT
GAAGAAGCCA ATCGACAGAT CCTGCTCGTG CCACGCACCG ATATCAACCG CATCGAACAG
ATCATCCGAA AGCCAGGCAC CTGGGTCGTC TATGTGCGCT CTGCCGCAAA CGTGTTGCGT
GGGGAGAACG TGGTGTATGC CTTCCCGGAT GCTCGCCAAA ATATCAACAT CGTCCGACAA
GGCGAAGTCC TCGCACGAAC GACTCTTGAC CAAAACGAGA AGAGCAGCGA AACCGTGCGC
AACCGACTTA GCCTCCTGCT TGCATCAACT CTGGCAGAGG TAAAAAGACG CGGATCCCTC
AGTTCAGGAC TGCAGTTCGA TGGCAGTGAA ATGAACCGGC TCGGCAAGGC ATTACTGAAC
CGTTCTCAAG AGCGGATTGA GCTAGAAGCC GTGGCACTCC GCAACAGCGA TACAGCCGAT
CCGGTAGCAG TTGTCCTGCA GCCCGTGGGT GGTCCTTGGA CAAAGGTTCC CGAAGACAAA
CCATGA
 
Protein sequence
MTGWLLILSL LVLGGVLSTL GDRLGSRVGK ARLSLFNLRP RRTAVFITVL TGSLISALSL 
GLMLLVSRQL RVGLFELDDL QAKLQQSRDA LNSSRSAQFN AELDLKQARS DSNQVQNELK
EAKKRAAALR NELAPLQKQR QQLEAERARL SRDISKKDAD IRRTEVELAN VRSRISSAEK
ELKQLETNLI ALRRGDVVLS SGQQLAAATL RLDNPSQAKA VIDRLLQEAN LEAFRRVRPG
EEANRQILLV PRTDINRIEQ IIRKPGTWVV YVRSAANVLR GENVVYAFPD ARQNINIVRQ
GEVLARTTLD QNEKSSETVR NRLSLLLAST LAEVKRRGSL SSGLQFDGSE MNRLGKALLN
RSQERIELEA VALRNSDTAD PVAVVLQPVG GPWTKVPEDK P