Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_18051 |
Symbol | |
ID | 4911555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 1527958 |
End bp | 1529136 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640161409 |
Product | zinc metallopeptidase |
Protein accession | YP_001092029 |
Protein GI | 126697143 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.309423 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGAG ATCAGTTTCA TAAAAAAATT GATTCGTTTA CTGATGAATT AATTCATTTA AGAAGACATA TCCATGCACA TCCGGAATTA AGTGGACTTG AAAATCAAAC AGCAATTTTG ATCAGTGGTT TTTTAAAAAA TATAGGTTGG AATGTTAGAG AATCTATAGG CAGGACTGGA GTTATCGCTG ATTTTGGGCC CGTAGAAAAC GGTATTATAG GCTTAAGAGT GGATATGGAT GCTTTGCCAA TATTTGAGGA AACTAAACTA AGTTTTTCTT CAAAAGTAGA TGGTGTTATG CATGCCTGTG GTCACGATTT GCACATCTCG ATTGGATTGG GTGTGGCAAA AATTATTAAG GATTTAAAAC TAAATTTTGG GACTCGGATA ATTTTTCAGC CAGCTGAAGA AATTGCAAGT GGAGCTAGAT GGATGATTAA AGATGGTGCA ACTAATGGTT TAACCAATAT TTTGGGAGTT CATGTCTACC CCGATTTATC TGTAGGGACT ATTGGCATCA AAGAGGGAAG TTTAACTGCA GCTGCTGCAG AACTTAATGT TGAGATTAAA GGAAAATCAG GCCATGGTGC TCGACCTCAT GAAGGGGTTG ATGCCATTTG GGTAGCCTCT AAAGTTGTCT CGGGAATTCA AGAATCAATA ACACGGAAGT TAGACCCTTT AGATCCTATA GTAATAACTT TTGGGAAGAT AAATGGTGGT AATGCATTCA ATGTTCTTGC AGAGAAGGTT AATTTAGTTG GTACAGTTAG ATGTACTAAT CGTAAAGTAT TTACGAATAT TGGTAATTGG CTAAATGAAA ATATCACTTC TTTAGCCAAT GGTTGCGGAG CTGAAGCAAA AGTAAGATTT AGAGAAATCA CTCCGGCAGT TAATAATAAT TCTGAAATTA ATAGAGTCCT CAGAGATTCG GGAATTAAGG TTTTGGGTCA AGAAAATGTT ATCGAATTAC AAAAACCATC ATTAGGAGCG GAGGATTTTG CTGAATTCTT AAATGATATT CCAGGAGCCA TGTTTAGGCT TGGCGTTTCT AATTCAAATG GATGTGCTCC TCTTCATAGT TCTAAATTTA ATCCGGATGA AAGAGCGATT GCTGTTGGAA TTAAAGTCAT AACAGAATCC ATAGTAAAAT TAAACAATGA AAAAATTAAT ACTATTTGA
|
Protein sequence | MNRDQFHKKI DSFTDELIHL RRHIHAHPEL SGLENQTAIL ISGFLKNIGW NVRESIGRTG VIADFGPVEN GIIGLRVDMD ALPIFEETKL SFSSKVDGVM HACGHDLHIS IGLGVAKIIK DLKLNFGTRI IFQPAEEIAS GARWMIKDGA TNGLTNILGV HVYPDLSVGT IGIKEGSLTA AAAELNVEIK GKSGHGARPH EGVDAIWVAS KVVSGIQESI TRKLDPLDPI VITFGKINGG NAFNVLAEKV NLVGTVRCTN RKVFTNIGNW LNENITSLAN GCGAEAKVRF REITPAVNNN SEINRVLRDS GIKVLGQENV IELQKPSLGA EDFAEFLNDI PGAMFRLGVS NSNGCAPLHS SKFNPDERAI AVGIKVITES IVKLNNEKIN TI
|
| |