Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_18231 |
Symbol | |
ID | 4718560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1555974 |
End bp | 1557158 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640079556 |
Product | zinc metallopeptidase |
Protein accession | YP_001010213 |
Protein GI | 123969355 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.852941 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGAG ATCAGTTACA TAAAAAAATT GATTCGTTTA ATGATGAACT AATTAACTTA AGAAGACATA TCCATGAACA TCCGGAATTA AGTGGACTTG AAAATCAAAC AGCGATCTTG ATCAGTGGTT TTTTAAAAGA TATTGGTTGG AATGTTAGAG AATCTATAGG TAGGACTGGA GTTATAGCTG ATTTTGGACC TCTAGATAAA GGTATTATAG GTTTAAGAGT GGATATGGAT GCATTGCCAA TATTTGAGGA AACTAAATTA AGTTTTTCTT CAAAAGTAGA TGGTGTTATG CATGCCTGTG GTCACGATTT GCATATCTCG ATTGGATTGG GTGTCGCAAA AATTATTAAG GATTTAAAAC TAAATTTTGG GACTCGGATA ATTTTTCAGC CTGCTGAAGA AATTGCAAGT GGAGCTAGAT GGATGATTAA GGATGGTGCA ACTAATGGTT TAACCCATAT TTTTGGAGTT CATGTCTACC CCGATTTATC CGTAGGGACT ATTGGCATTA AAGAGGGAAG TTTAACTGCA GCTGCTGGAG AACTTAATGT TGAGATTAAA GGAAAATCAG GCCATGGTGC TCGACCTCAT GAAGGAGTTG ATGCTATTTG GGCTGCCTCA AAAGTTATCT CGGGAATTCA AGAATCAATA ACACGGAAGT TAGACCCTCT AGATCCAGTA GTAATAACTT TTGGGAAAAT AAATGGTGGC AATGCATTCA ATGTTCTCGC AGAAAAGGTT AATTTAATTG GTACTGTTAG ATGCACTAAT CGTAAAGTAT TTATGAATAT TGGTAATTGG CTCAATGAAA ATATCACTTC TTTAGCTAAT AGTTGCGGAG CTGAAGCAAA AGTAATATTT AGAGAAATTA CTTCGGCAGT TAATAATAAT CCTGAAATGA ATAGAGTCCT CAGAGACTCG GGAATTAAGG TTTTGGGTCA AGAAAATGTA ATCGAATTAC AAAAACCATC ATTAGGAGCT GAGGATTTCG CTGAATTCTT AAATGAAATT CCTGGAGCTA TGTTTAGGCT TGGGGTTTCT AGTTCCGATG GATGTGCTCC TCTTCATAGT TCTAAATTTG ATCCGGATGA AAGAGCTATT GCTGTTGGAA TTAAAGTGAT AACAGAATCC ATAGTAAAAT TAAATAATGA AATAATTAAT ACTATTGGCA AATGA
|
Protein sequence | MNRDQLHKKI DSFNDELINL RRHIHEHPEL SGLENQTAIL ISGFLKDIGW NVRESIGRTG VIADFGPLDK GIIGLRVDMD ALPIFEETKL SFSSKVDGVM HACGHDLHIS IGLGVAKIIK DLKLNFGTRI IFQPAEEIAS GARWMIKDGA TNGLTHIFGV HVYPDLSVGT IGIKEGSLTA AAGELNVEIK GKSGHGARPH EGVDAIWAAS KVISGIQESI TRKLDPLDPV VITFGKINGG NAFNVLAEKV NLIGTVRCTN RKVFMNIGNW LNENITSLAN SCGAEAKVIF REITSAVNNN PEMNRVLRDS GIKVLGQENV IELQKPSLGA EDFAEFLNEI PGAMFRLGVS SSDGCAPLHS SKFDPDERAI AVGIKVITES IVKLNNEIIN TIGK
|
| |