Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_05501 |
Symbol | |
ID | 4780324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 497581 |
End bp | 499110 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640083827 |
Product | carboxypeptidase Taq (M32) metallopeptidase |
Protein accession | YP_001014377 |
Protein GI | 124025261 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2317] Zn-dependent carboxypeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.30292 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCAAAGT CTGCTTGGCA GCTTTTGGGT GATTACCTAA AAGATACGCA GTTGTTGGGA TCTATACAAA GCACTCTCTA CTGGGATCAA AATACATCTA TGCCTATTGC TGGTTCCAAT TGGAGAGGAG AGCAATTAAG TCTTTTAGCT AAGCAACTTC ATGCAAGACA AAGTTCTGAA CAGTTCGAAA TTTTAATAAA AGAAGCTAAA TCTGAACTTC AAAAATCAAA AGAAAAAGAT GATTTTGAAT CACAACTCAT CACAGATAGA TTTAGAAATA TTGATTTGCT TGAGCAAGAT TTCAATAGAC AGAAAAGTTT GGATCCTCAA TTGGTCGTTG AGCTCGCAAC AGCAAAGTCT GAAGGGTATA TGTGTTGGCA GGAAGCTAGG AAAAATAATG ATTTCAAAAG CTTTTCTCCA GCTCTTAAGA AATTAATTGC ATTACGAACA GAACAATCCA ATCAGCTCTG TGAAGAAAGA AGTTGCTGGG AGACACTTGC CCAGCCTTTT GAACCGAATT TAACGATTGA TCGTGTAAGC GAACTATTTG AACCTTTACA AAAGAGATTG CCAGAATTGA TTCAGAAGGC TGAGACAATT ACCAATAAAA AGAGTGAAAA ATGGGATTTA GCAATTAGTG ATCAAGAAAA ACTCTGTCAA ATACTTTTAA ATGATTGGTC TAGGGATCCT GCTAATACAG CGATAGCTAA GTCCCCTCAT CCATTCTCTA TAACTTTAGG TCCGGATGAT TATCGAATTA CGACTCGAAT AGTTAAAGGT CAGCCCCTTT CTTGCTTATT AGCTACTGCC CATGAGTGGG GTCATTCTCT TTATGAACAA GGTTTGCCTT CTAAAAGTCA CCAATGGTTT GCATGGCCGT TAGGTCAAGC AACCTCTATG GCTGTTCATG AGAGTCAATC TCTATTTTGG GAAAATAGGA TTGCTAGGAG CTTTTCATTT GCAAAGTCTT TTTGGCATCA TTTTGAGAAT GCAGGTGCTC CAATTCACTC TGGAGATGAT TTATGGATCA ATCTAAATCC ATTTACTCCG GGATTGAACC GAGTAGAGGC TGATGAACTC AGTTATGGCT TGCACATAAT GATTAGGACT GAATTGGAAA TTGATCTTCT CGAGAGAGGC CTTTCTGTGG AAGATCTGCC TAATGAATGG AATAAAAGGT ATTTGAACCT TTTAGGTGTG TCGCCTAAAA ATGATACTGA AGGATGTTTG CAAGATGTGC ACTGGAGTGA GGGGATGTTT GGTTATTTCC CTTCTTATTT GCTTGGTCAT CTTATTAGCG CTCAGTTGAC AAAAACTCTT GAAGAAGATT TAGGGAAAAT TGAAAATCTT ATTGAATCTA CGGAAATCAG TAAAATATTG GGTTGGCTTC GCAAAAATGT TCATCATTAT GGGAGAAGTT TAGATTCTGA GGAACTTGTA AGGAAGGTCT CTGGAGCAAA ATTATCACCA ACTTATTTTC TTGAATACTT AGATAATAAA CTTGAAAAGC TGTCTACAAT CTCTAAGTAA
|
Protein sequence | MSKSAWQLLG DYLKDTQLLG SIQSTLYWDQ NTSMPIAGSN WRGEQLSLLA KQLHARQSSE QFEILIKEAK SELQKSKEKD DFESQLITDR FRNIDLLEQD FNRQKSLDPQ LVVELATAKS EGYMCWQEAR KNNDFKSFSP ALKKLIALRT EQSNQLCEER SCWETLAQPF EPNLTIDRVS ELFEPLQKRL PELIQKAETI TNKKSEKWDL AISDQEKLCQ ILLNDWSRDP ANTAIAKSPH PFSITLGPDD YRITTRIVKG QPLSCLLATA HEWGHSLYEQ GLPSKSHQWF AWPLGQATSM AVHESQSLFW ENRIARSFSF AKSFWHHFEN AGAPIHSGDD LWINLNPFTP GLNRVEADEL SYGLHIMIRT ELEIDLLERG LSVEDLPNEW NKRYLNLLGV SPKNDTEGCL QDVHWSEGMF GYFPSYLLGH LISAQLTKTL EEDLGKIENL IESTEISKIL GWLRKNVHHY GRSLDSEELV RKVSGAKLSP TYFLEYLDNK LEKLSTISK
|
| |