Gene P9303_21071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_21071 
Symbol 
ID4776903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1865532 
End bp1868738 
Gene Length3207 bp 
Protein Length1068 aa 
Translation table11 
GC content52% 
IMG OID640087615 
ProductMrsD-like protein 
Protein accessionYP_001018107 
Protein GI124023800 
COG category[V] Defense mechanisms 
COG ID[COG4403] Lantibiotic modifying enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.365739 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACAAGTC CAACATCTTG GAAAACCAGT TGGTTAGCAG CTATTGCTCC CGATGAGCCC 
CACAAATTCG ATCGGCGCCT TGAATGGGAT GAGCTATCGG AGGAAAACTT CTTTGCTGCG
CTGAACAGTG CGCCAACTTC CCTCGAGGAG GATGACCCTT GTTTTGATGA GGCTCTACAA
GATGCTCTTG AAGCTTTAAA GGCCGCCTGG GACTTACCCC TGTTACCAGT CGACAACACT
GTTAATAGAC CTTTTGTTGA TCTTTGGTGG CCGATTCGTT GCCACTCAGC AGAGAGTCTT
AGACAAATCT TCGTTTCTGA TAGCGCTGGC CTTGCTGATG AGATTTTTGA TCAATTGGCA
GACAGTCTGC TTGATCGACT CTGCGCGCTT GGCGATCAAG TGTTATGGGA AGCCTTCAAT
AAGGAACGCA CACCAGGCAC GATGCTGCTT GCACATCTGG GTGCTGCTGG AGATGGATCA
GGGCCACCAG TGCGTGAACA CTACGAGCGA TTCATCCAAT CCCATCGCCG CAATGGCTTA
GCACCATTGC TTAAAGAATT TCCTGTACTT GGGCGCCTAA TCGACACCGT TCTTTCGCTT
TGGTTTCAAG GCAGTGTGGA GATGTTGCAA CGCATCTGTG CCGATCGCAC AGTCCTTCAG
CAAGGCTTTG CGATTCCCTG CGGCCATCAT CTCAAAACCG TCAAGCAGGG CTTGAGTGAT
CCCCACCGCG GTGGTCGTGC AGTCGCTGTT TTGGAATTTG CTGATCCCAA TTCGACTGCC
AACTCCTCCA TGCATGTGGT CTACAAGCCC AAAGACATGG CTGTAGACGC GGCTTATCAG
GCCACCTTGG CTGATCTCAA TGCCCACAGT GATCTATCAC CGTTGCGCAC TCTTGCCATC
CACAACGGCG ACGGCTATGG CTATATGGAG CATGTCGTCC ATCACTTTTG CGCTAATGAC
AAAGAACTAA CCAATTTCTA TTTCAATGCC GGTCGACTAA CAGCCTTACT GCATCTACTC
GGTTGCACTG ACTGCCATCA CGAAAATCTA ATTGCCTGCG GTGATCAGTT GCTTCTGATC
GATACCGAAA CATTGTTGGA AGCTGATCTT CCTGATCACA TCAGCGACGC TTCCTCTACG
ACAGCTCAAC CCGAACCTTC CACTCTGCAA AGACAGTTTC AGCGTTCTGT TTTGCGTTCC
GGTTTACTCC CGCAATGGAT GTTTCTTGGA GAATCAAAAC AGGCAATTGA CATCAGCGCT
CTTGGCATTT CGCCACCGAA TAAACCCGAA AGAATTGCTG CTGGCTGGCT TGGGTTTAAC
AGTGACGGGA TGATGCCAGG TCGTGTTAGC CAGCCTGTTG AAATTCCAAC AAGTTTGCCA
GTGGGAATCG CTGATGTTAA TCCTTTTGAT CGTTTTCTTG AAGACTTCTG CGATGGCTTC
TCAATGCAAA GTAAAGCGCT CATCAAACTC CGCAATCGCT GGTTGGATTT AAATGGAATA
TTGGCTCACT TTGCAGGATT GCCACGCCGG ATTGTGTTGC GAGCCACGCG TGTGTATTTC
AACATTCAAC GTCAGCAGTT AGAGCCAACT GCTTTGCGTT CTTCGCTTGC ACAGGCGCTC
AAACTCGAGC AGCTAGCACG TAGTTTTCTG CTCGCCGAAT CCAAGCCTCT TCACTGGCCA
ATTTTTGCAG CAGAAGTAAA GCAGATGCAG CAATTGGATA TTCCTTTCTT CACTCATCTC
ATTGATGCTG ATGCCCTCCA GCTCGGCGGA TTGGAACAAG AATTACCGGG CTTTATCCAA
ACCAGCGGTT TAGCGGCGGC TTATGAGCGA TTGCGAAATC TAGATAGCAA TGAGATTGCT
TTTCAACTGC GCCTGATTCA TGGAGCCGTA GAAGCGCGTG AGCTGCACAC AACCCCCGAG
AGTTCACCAA CCCTTCACCC CCCCGCGACA CCAGAAGCGC TGATGTCAGC ATCGGCTGAG
ACGAGTCTGA AGGCAGCAAA GCGGATCGCC CATCGTCTTT TGGAATTGGC GATTCGCGAC
TCGCAGGGGC AGGTGGAGTG GCTGGGTATG GACCTGGGCG CTGATGGCGA GAGCTTTGCC
TTTGGGCCCG TTGGTCTATC GCTTTATGGT GGATCCATAG GAATTTCCCA CTTGCTGCAG
CGCTTGCAAG CACAGCAGGT TCCATTGATG GATGCCGATG CCATTCAGAC TGCAATCCTG
CAACCCCTGG TTGGTCTTGT TGCTCAACCC AGCGACGATG GCCGTCGGCG CTGGTGGCGT
GATCAGCCAC TTGGACTGAG TGGCTGTGGT GGAACCCTTT TGGCTCTTGC TCTGCAAGGC
GAGCAGGCGA TGGCCAATAG TCTGCTCGCG GCGGCTCTAC CGCGTTTCAT CGAGGCTGAT
CAGCAACTGG ATTTGATTGG TGGTTGCGCA GGCTTGATTG GATCGCTGGT GCAGCTGGGG
ACGGAATCAG CTCTGCAGTG GGCTTTGCGT GCTGGTGACC ATCTCATTGC TCAACAAAAT
GAAGAAGGTG CCTGGAGCTC ATCGTCAAGC CAGCCAGCAC TTTTGGGCTT CTCCCATGGT
GTAGCTGGTT ATGCGGCAGC TCTTGCTCAC CTACATGCTT TCTCCGGTGA CGAGCGCTAC
CGCATAGCTG CCGCAGCTGC ACTGGCTTAT GAGCGGGCTC GATTCAATAA AGACGCGGGC
AATTGGCCTG ATTACCGCAG TATTAGTAGA GACTCAGATT CTGATGAACC AAGCTTCATG
GCTAGCTGGT GTCATGGTGC CCCGGGTATT GCCCTTGGCC GAGCCTGTCT GTGGGGTACG
GCTCTTTGGG ATGAAGAATG CACCAAGGAG ATAGGGATTG GCTTGCAAAC CACAGCAGCT
GTCAGTGCTG TTTCGACGGA TCATCTTTGC TGCGGATCAC TGGGTTTGAT GGTCTTGCTA
GAGATGCTTT CTAAAGGCCC TTGGCCGATT GACAACCAGC TCAGATCTCA TTGCCAAGAC
GTGGCTTCTC AGTACCGCCT GCAAGCTTTA CAACGCTGCT CAGCTGAACC AATCAAGCTG
CGATGCTTCG GCACAAAAGA AGGATTACTT GTACTGCCTG GCTTCTTCAC AGGCTTAAGC
GGGATGGGAT TAGCGCTGCT CGAGGATGAT CCATCACGAG CTGTCGTATC TCAACTGATC
AGCGCTGGGC TGTGGCCTAC TGAATAA
 
Protein sequence
MTSPTSWKTS WLAAIAPDEP HKFDRRLEWD ELSEENFFAA LNSAPTSLEE DDPCFDEALQ 
DALEALKAAW DLPLLPVDNT VNRPFVDLWW PIRCHSAESL RQIFVSDSAG LADEIFDQLA
DSLLDRLCAL GDQVLWEAFN KERTPGTMLL AHLGAAGDGS GPPVREHYER FIQSHRRNGL
APLLKEFPVL GRLIDTVLSL WFQGSVEMLQ RICADRTVLQ QGFAIPCGHH LKTVKQGLSD
PHRGGRAVAV LEFADPNSTA NSSMHVVYKP KDMAVDAAYQ ATLADLNAHS DLSPLRTLAI
HNGDGYGYME HVVHHFCAND KELTNFYFNA GRLTALLHLL GCTDCHHENL IACGDQLLLI
DTETLLEADL PDHISDASST TAQPEPSTLQ RQFQRSVLRS GLLPQWMFLG ESKQAIDISA
LGISPPNKPE RIAAGWLGFN SDGMMPGRVS QPVEIPTSLP VGIADVNPFD RFLEDFCDGF
SMQSKALIKL RNRWLDLNGI LAHFAGLPRR IVLRATRVYF NIQRQQLEPT ALRSSLAQAL
KLEQLARSFL LAESKPLHWP IFAAEVKQMQ QLDIPFFTHL IDADALQLGG LEQELPGFIQ
TSGLAAAYER LRNLDSNEIA FQLRLIHGAV EARELHTTPE SSPTLHPPAT PEALMSASAE
TSLKAAKRIA HRLLELAIRD SQGQVEWLGM DLGADGESFA FGPVGLSLYG GSIGISHLLQ
RLQAQQVPLM DADAIQTAIL QPLVGLVAQP SDDGRRRWWR DQPLGLSGCG GTLLALALQG
EQAMANSLLA AALPRFIEAD QQLDLIGGCA GLIGSLVQLG TESALQWALR AGDHLIAQQN
EEGAWSSSSS QPALLGFSHG VAGYAAALAH LHAFSGDERY RIAAAAALAY ERARFNKDAG
NWPDYRSISR DSDSDEPSFM ASWCHGAPGI ALGRACLWGT ALWDEECTKE IGIGLQTTAA
VSAVSTDHLC CGSLGLMVLL EMLSKGPWPI DNQLRSHCQD VASQYRLQAL QRCSAEPIKL
RCFGTKEGLL VLPGFFTGLS GMGLALLEDD PSRAVVSQLI SAGLWPTE