Gene P9303_03421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_03421 
Symbol 
ID4777514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp350160 
End bp351416 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content57% 
IMG OID640085845 
Producthypothetical protein 
Protein accessionYP_001016359 
Protein GI124022052 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.634587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGCA GCATGTTGAA AGCCCTCTCG CTCACCGTTG CTGTGGGAAC AGCCCTGCTC 
ATGAGTGGAG GCCCCAGCCA CAGCCAAACA TCACCTGAGC TCACCAAAAA GGTGCTGCAG
ATGGAGCGCG ATCGAGAACA AGAATTTGAA AACTACTTCG GTGAAGACCT GGCATCCGTC
AGCAAAACGG CTGATGAAAC TGCCGCCGAA CTCGAACGGC TGAGTGCAGA AACCGGAACA
CGCTCAGCGC TGCTGTATGT GATTCCGCGG AAAAGCCACC TGCACCTGGT GCTGATTCCG
CCCAGTGGCA CTCCGATCGT CAAAGACTTC TATGAAGTCA CCGACCCCGA GCTGTTCGCG
GTCTCGCGCC GCTTTCACAA GGGCATCCTG CGGATGGATA CAACCCAAAG TCAAAGCGCA
GGCCAGCAGT TGTACGACTG GATCATCAAG CCGTATGAGC AGGAGCTGGC GGATGCAGAG
ATTGACCTAC TTCTGTTCTG CCTGGGTGAT GGCGTGAAGG ATCTGGCTTT GCCAGCCCTG
TTCAACAACG GCTCCTACCT GATCGAGAGC TATGCGATGG CGCGGATCCC CGCGTTCAAC
CTGATCGAGA CGACCTACAA ACCCTTTAAA AGCGGTCAGC TGTTGGCCAT GGGAGCCAGC
CAGTTTCAAG ATCCATCGAT TCCAACCTTG CCAGGCACAG CACAAGAAAT CGCAGCCCTC
AGCCAAAGCC TTGGGTCTGC AGGGCAAAGC ACATGGGGGG TAACACGGTT GGAGAACAGG
GCCTTCACGC AGAAGCGGAT CAACCAGAAC CTCTCCAAGA AGCCTTACAC CACGCTGCAC
GTGAGCACCC ATGCCCAGTT TCAGCCTGGC CAGGTGGAGG AGTCCTACAT CCAACTCTGG
GATCAGAAGC TGAAGCTGAA CGCTCTCAAT GCAATCGACT GGGACCAGTC CAAGGCAGAT
CTGATCGTGC TCAGCGCCTG TCAGACCGCT CTGGGAGACA CCGATGCCGC CAATGGATTT
GCCGGACTCG CGCTCAAAGC CGGGGTGCCC TCAGCCATCG GCACCCTTTG GTCGGTCAAC
GATCAATCGA CCACGGAGTT GATGACATCG TTCTACGGCG CACTGCCGGA CAGCCGCACC
AAAGCTCAGG CCCTGCAAAC GGCACAGATC ACTGCGATCC GACAACCATC GTCGTCAACG
TCGAGCGCTG CTCCCTACTA CTGGGCTGGC TTCAGCCTGA TCAGCACACC TTGGTGA
 
Protein sequence
MMRSMLKALS LTVAVGTALL MSGGPSHSQT SPELTKKVLQ MERDREQEFE NYFGEDLASV 
SKTADETAAE LERLSAETGT RSALLYVIPR KSHLHLVLIP PSGTPIVKDF YEVTDPELFA
VSRRFHKGIL RMDTTQSQSA GQQLYDWIIK PYEQELADAE IDLLLFCLGD GVKDLALPAL
FNNGSYLIES YAMARIPAFN LIETTYKPFK SGQLLAMGAS QFQDPSIPTL PGTAQEIAAL
SQSLGSAGQS TWGVTRLENR AFTQKRINQN LSKKPYTTLH VSTHAQFQPG QVEESYIQLW
DQKLKLNALN AIDWDQSKAD LIVLSACQTA LGDTDAANGF AGLALKAGVP SAIGTLWSVN
DQSTTELMTS FYGALPDSRT KAQALQTAQI TAIRQPSSST SSAAPYYWAG FSLISTPW