Gene P9211_10181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_10181 
Symbol 
ID5731840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp910844 
End bp912094 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content38% 
IMG OID641285385 
Productinsulinase family protein 
Protein accessionYP_001550903 
Protein GI159903559 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0999203 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAGC TAAGAATAGT TCTAGATCCC AGGGATACAG CGGGTATAAT GTCGGCAAAA 
TTATGGATTA AAGAAGGTAG TAGAGCAGAC CCTAAAAACA AGCAGGGTCT TCATTATCTT
TTAGGTTCAC TTCTTAGTAG GGGTTGTGGC CCCTACAACC GAATAGAGAT TGCTGATCTA
ATAGAGGGGT GCGGAGCTGC CTTACGTTGC GACACCTTTG AAGATGGTAT ATTGCTCAGT
CTTAAATGTA CAGAAAGAGA TCAATCAAAG CTATTACCAT TATTTGTGTG GATGGTTACA
GCGCCACATA TAGCATCAGA TCAAATGAGT TTAGAACGTG AGCTATCTAT ACAACTACTA
CAAAGACAAA AAGAAAGCGC CTTTCATGTA GCCTTTGATT GTTGGAGGAG AATTATTTAT
CGCAAGTCCC CTTATGAACA TGATCCATTA GGAACCCTTG CTGGTATTGC TGCTATTGGG
GAGGATGATC TAAAAAACTT ATCAACTAAA TTGTCTATAA GAGAACAGGT ATTAGTAATC
TCTGGTAGCT TCCCTAAGAA AATTGAATCA GATATTAAAA GCTTGTTCTC GTTTACTTCC
TCTAGCCTGA AAGATAGTGA CGTGAATATT ACTTCAGAAG GAAGTAAAGA ATTTAAGGTT
GACAATCATA AGCAAAGGCT GATACTTCGA CACCAAGAGA CAAGCCAGGT AATATTAATT
TTAGGGCAAA AGACTATTAG CCACTCTCAT CAAGATGACT TGGTCTTAAG ACTACTGAGT
TGTCATTTGG GTTCTGGTAT GTCCAGTCTA CTATTTAAAA AGTTTAGAGA GCAATATGCA
TTGGCATACG AAACTGGGGT TTACCATCCA ATTAGAGAAT ATGAAGCACC TTTTGCAATT
CATGTGGCAA CTACTCAAGA GAAGGCATTG CATTCTTTAC GTCTCCTCAA AAAATGCTGG
GAAATACAAT TGGAACAAAA GATATCAGAA GAAGAGTTAT TCTTAGCCAG AGCAAAGTTT
AAAGGTAATG TTGCCCATAA CTTGCAGACA GTTAGTCAAA GAGCCGAAAG GAAAGCCCAG
CTGTTAAGTT TTGGTATGAG CGATAACTAT GACAATGAAT GCTTCAAAAG AGTTGATACA
ATTTCTGCAG AAGAGATACA AACCACTGCA ATAAGATATC TCAGTAATCC ATTGTTAAGC
TTATGCGGAC CAAAGAAAAC ACTTGACATA TTGGCAAGTC ATTGGTGTTA G
 
Protein sequence
MQKLRIVLDP RDTAGIMSAK LWIKEGSRAD PKNKQGLHYL LGSLLSRGCG PYNRIEIADL 
IEGCGAALRC DTFEDGILLS LKCTERDQSK LLPLFVWMVT APHIASDQMS LERELSIQLL
QRQKESAFHV AFDCWRRIIY RKSPYEHDPL GTLAGIAAIG EDDLKNLSTK LSIREQVLVI
SGSFPKKIES DIKSLFSFTS SSLKDSDVNI TSEGSKEFKV DNHKQRLILR HQETSQVILI
LGQKTISHSH QDDLVLRLLS CHLGSGMSSL LFKKFREQYA LAYETGVYHP IREYEAPFAI
HVATTQEKAL HSLRLLKKCW EIQLEQKISE EELFLARAKF KGNVAHNLQT VSQRAERKAQ
LLSFGMSDNY DNECFKRVDT ISAEEIQTTA IRYLSNPLLS LCGPKKTLDI LASHWC