Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_20651 |
Symbol | |
ID | 4779320 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1709997 |
End bp | 1711181 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640085361 |
Product | zinc metallopeptidase |
Protein accession | YP_001015885 |
Protein GI | 124026770 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGATT TAGGAAAAAA AATTGACGTT TTAACCAAGG ATATACTGCC TGATTTAATT CAATTACGTC GTCATCTCCA TGCTCATCCA GAGCTAAGTG GTCAGGAATA TCAAACTGCT GCTCTTGTTG CCGGGGAGCT TAGAAAATCA GGTTGGGAAG TAAAAGAAGC AGTTGGCAAA ACCGGAGTAG TAGCTGAAAT AGGTAAGAAA AGCGGACCTG TTGTTGGCTT ACGAGTTGAT ATGGATGCTT TGCCAATTGA GGAGAGAACA GGTTTAGAAT ATTCTTCTTC AATTCAAGGC CTGATGCATG CATGTGGCCA TGATCTCCAT ACTTGTATTG GTTTGGGGGT GGCTAAAGTA TTAGCAAAAA ACAAATTTAC AAATTCTCGA ATCCGAATCA TTTTTCAACC TGCTGAAGAG ATTGCTCAAG GTGCAAATTG GATGAGAGCT GAAAAGGTTC TTGAAGGTGT TCAAGCTCTC TTTGGTGTGC ATGTTTATCC AGATTTGTCA GTTGGCAAGA TTGGAATAAA AACTGGAACT TTTACAGCTG CCGCTGCTGA ATTGGAAATA GAGATTATTG GTGATGGAGG GCATGGAGCT AGACCACATG AAGGCATAGA TTCAATTTGG ATTTCTGCAA AAGTTATTAG TGGACTTCAA GAGGCTATTA GTAGACGTTT AGATGCGCTT AAGCCTGTAG TTATTAGCTT TGGGAAGATT TCAGGAGGTA ATGCTTTCAA TGTAATTGCT GAGAGGGTTA AGCTTCTAGG TACAGTAAGG TGTCTTGATA ACAACCTTTA TGAAAAATTG CCTCAATGGA TTGAGAAAAT AGTACAAAAT ATAGCCTCTA CTCACGGAGG TAAGGCGAAC ATAAAATTTA AGTCGATCGC GCCCCCAGTT TATAACGATC CAGAGTTGAC TAGTTTGTTA TCTACCTGTG CGAAGAATTT TATGGATGAA GAAAATATTG TTTTTTTAGA AAATCCGTCA TTAGGAGCTG AAGATTTTGC TTTCTTCTTG CAAGATGTTC CAGGCACGAT GTTTAGATTA GGAGTGGCTG GCAATCAAGG TTGTGCTCCA TTGCACAGTG GAAACTTTTC TTTGGATGAA AGAAGCCTAG AATTAGGAAT AAAAATTTTG TCTCAAACGT TAGTCATGGC ATCTAAAACC CTTCAAGACA TTTAG
|
Protein sequence | MKDLGKKIDV LTKDILPDLI QLRRHLHAHP ELSGQEYQTA ALVAGELRKS GWEVKEAVGK TGVVAEIGKK SGPVVGLRVD MDALPIEERT GLEYSSSIQG LMHACGHDLH TCIGLGVAKV LAKNKFTNSR IRIIFQPAEE IAQGANWMRA EKVLEGVQAL FGVHVYPDLS VGKIGIKTGT FTAAAAELEI EIIGDGGHGA RPHEGIDSIW ISAKVISGLQ EAISRRLDAL KPVVISFGKI SGGNAFNVIA ERVKLLGTVR CLDNNLYEKL PQWIEKIVQN IASTHGGKAN IKFKSIAPPV YNDPELTSLL STCAKNFMDE ENIVFLENPS LGAEDFAFFL QDVPGTMFRL GVAGNQGCAP LHSGNFSLDE RSLELGIKIL SQTLVMASKT LQDI
|
| |