Gene P9303_16561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_16561 
Symbol 
ID4778526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1445036 
End bp1446403 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content53% 
IMG OID640087165 
Productinsulinase family protein 
Protein accessionYP_001017665 
Protein GI124023358 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCCG ACTCGACTTC TCGAGCAAAT GCTTCCATTG CTGCAACCAG AGCGCAGCTT 
CACCCTTGTT GCCAGACCGA TGGAAGCAGG CGAATGAATC CTCTTGATGT GGTTTTAGAT
CCAATCGCCG CACCGGGAGT TATTGCCGCC AAGCTCTGGG TTAGAGGCGG TAGTGGTGCT
GACCCAAAAG GGCAACGGGG AGTTCATCAA CTGCTCGGAG CCCTCTTGAC CAGGGGCTGT
GGACCTTATG ACCACCTTGC TCTAGCCGAT CTCGTTGAAG GCTGCGGGGC AGGTTTGCGC
TGCGATACCC ACGAAGACGG ATTGCTAATT AGCCTCAAAT GTGCAGATCG TGATGCCGAA
CGACTCCTTG ATTTACTTGG CTGGATGCTG ATCGATCCGC ATCTGGATTC AAGTCAAGTA
ACGCTGGAAA GGGATCTCAG TCTTCAGGCC TTGCAAAGAC AAAGAGAAGA CCCATTTCAC
TTGGCTTATG ACGGTTGGCG GCATATGGCT TATGGCAGTG GCCCCTACGG CCACGATCCC
CTTGGCCTTA GCGAGGACCT GAACCAACTT GGTCGTCAGC AATTAATTTC CTTAATCGAC
GGGCTAACAG CACAATCACC TGTGCTTGCC CTCGCTGGGA CCCTTCCAGA GGATCTTGAA
CAGCGGCTGG AGGCAATGGA ATCTTTCCAG CGCTGGCCCA ATCAGCCACC TCAGCAAGCG
AGAAAGTCTG AATCAAGCAA GATCTCAACA GAGAACATTC AGATCGAATC CAACATTTGT
CTTCAGCCTG AACCTACAAG TCAGGTGGTC ATGATGCTTG GACAGCCAAC CCTTGCTCAT
GGCCATGAAG ACGATCTGGC ACTGCGTCTA CTGAACTGCC ACCTGGGATT AGGCATGTCG
AGCTTGCTGT TCAGGCGTCT ACGAGAGCAA CACGGGGTGG CCTACGACGT AGGCACTCAT
CACCCGGTAC GTAAGTGTGC CGCTCCATTT GTATTACATG CTTCGACAAG CGAAGACAAG
GCAAAACTCA CCCTTCAGTT GCTTCTAGAC AGCTGGTGGG AACTCAGCCA GCAAGCGATA
TCAGAAGAAG ACATTGAACT GGCACGCGCA AAATTCCATG GTCAACTCGC CCATGGAGCT
CAAACCACTG GACAACGGGC AGAACGCCGA GCCCAATTAC GGGGACTAGG GCTGCCAGGC
AACTATGACG AGCACAGCTT GGAGACAATC AAAAATCTTG ATGGAAGCGC TCTGCAAAAG
GCAGCTCAAC GACATCTAAA AATGCCCTTG CTAAGTCTCT GTGGCCCAGA AACCAGCCTT
CAAATCCTTG CCAAGGACTG GCAACAGCAA GTGGTTCAAA GCTCTTAA
 
Protein sequence
MDADSTSRAN ASIAATRAQL HPCCQTDGSR RMNPLDVVLD PIAAPGVIAA KLWVRGGSGA 
DPKGQRGVHQ LLGALLTRGC GPYDHLALAD LVEGCGAGLR CDTHEDGLLI SLKCADRDAE
RLLDLLGWML IDPHLDSSQV TLERDLSLQA LQRQREDPFH LAYDGWRHMA YGSGPYGHDP
LGLSEDLNQL GRQQLISLID GLTAQSPVLA LAGTLPEDLE QRLEAMESFQ RWPNQPPQQA
RKSESSKIST ENIQIESNIC LQPEPTSQVV MMLGQPTLAH GHEDDLALRL LNCHLGLGMS
SLLFRRLREQ HGVAYDVGTH HPVRKCAAPF VLHASTSEDK AKLTLQLLLD SWWELSQQAI
SEEDIELARA KFHGQLAHGA QTTGQRAERR AQLRGLGLPG NYDEHSLETI KNLDGSALQK
AAQRHLKMPL LSLCGPETSL QILAKDWQQQ VVQSS