Gene P9301_18051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_18051 
Symbol 
ID4911555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1527958 
End bp1529136 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content36% 
IMG OID640161409 
Productzinc metallopeptidase 
Protein accessionYP_001092029 
Protein GI126697143 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.309423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGAG ATCAGTTTCA TAAAAAAATT GATTCGTTTA CTGATGAATT AATTCATTTA 
AGAAGACATA TCCATGCACA TCCGGAATTA AGTGGACTTG AAAATCAAAC AGCAATTTTG
ATCAGTGGTT TTTTAAAAAA TATAGGTTGG AATGTTAGAG AATCTATAGG CAGGACTGGA
GTTATCGCTG ATTTTGGGCC CGTAGAAAAC GGTATTATAG GCTTAAGAGT GGATATGGAT
GCTTTGCCAA TATTTGAGGA AACTAAACTA AGTTTTTCTT CAAAAGTAGA TGGTGTTATG
CATGCCTGTG GTCACGATTT GCACATCTCG ATTGGATTGG GTGTGGCAAA AATTATTAAG
GATTTAAAAC TAAATTTTGG GACTCGGATA ATTTTTCAGC CAGCTGAAGA AATTGCAAGT
GGAGCTAGAT GGATGATTAA AGATGGTGCA ACTAATGGTT TAACCAATAT TTTGGGAGTT
CATGTCTACC CCGATTTATC TGTAGGGACT ATTGGCATCA AAGAGGGAAG TTTAACTGCA
GCTGCTGCAG AACTTAATGT TGAGATTAAA GGAAAATCAG GCCATGGTGC TCGACCTCAT
GAAGGGGTTG ATGCCATTTG GGTAGCCTCT AAAGTTGTCT CGGGAATTCA AGAATCAATA
ACACGGAAGT TAGACCCTTT AGATCCTATA GTAATAACTT TTGGGAAGAT AAATGGTGGT
AATGCATTCA ATGTTCTTGC AGAGAAGGTT AATTTAGTTG GTACAGTTAG ATGTACTAAT
CGTAAAGTAT TTACGAATAT TGGTAATTGG CTAAATGAAA ATATCACTTC TTTAGCCAAT
GGTTGCGGAG CTGAAGCAAA AGTAAGATTT AGAGAAATCA CTCCGGCAGT TAATAATAAT
TCTGAAATTA ATAGAGTCCT CAGAGATTCG GGAATTAAGG TTTTGGGTCA AGAAAATGTT
ATCGAATTAC AAAAACCATC ATTAGGAGCG GAGGATTTTG CTGAATTCTT AAATGATATT
CCAGGAGCCA TGTTTAGGCT TGGCGTTTCT AATTCAAATG GATGTGCTCC TCTTCATAGT
TCTAAATTTA ATCCGGATGA AAGAGCGATT GCTGTTGGAA TTAAAGTCAT AACAGAATCC
ATAGTAAAAT TAAACAATGA AAAAATTAAT ACTATTTGA
 
Protein sequence
MNRDQFHKKI DSFTDELIHL RRHIHAHPEL SGLENQTAIL ISGFLKNIGW NVRESIGRTG 
VIADFGPVEN GIIGLRVDMD ALPIFEETKL SFSSKVDGVM HACGHDLHIS IGLGVAKIIK
DLKLNFGTRI IFQPAEEIAS GARWMIKDGA TNGLTNILGV HVYPDLSVGT IGIKEGSLTA
AAAELNVEIK GKSGHGARPH EGVDAIWVAS KVVSGIQESI TRKLDPLDPI VITFGKINGG
NAFNVLAEKV NLVGTVRCTN RKVFTNIGNW LNENITSLAN GCGAEAKVRF REITPAVNNN
SEINRVLRDS GIKVLGQENV IELQKPSLGA EDFAEFLNDI PGAMFRLGVS NSNGCAPLHS
SKFNPDERAI AVGIKVITES IVKLNNEKIN TI