Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_05931 |
Symbol | |
ID | 4779892 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 538982 |
End bp | 540199 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640083870 |
Product | Zn-dependent proteases |
Protein accession | YP_001014420 |
Protein GI | 124025304 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.111882 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCA GAGGTATCCC TTTGAGGATC CATCCAAGTT GGTTTTTGGT TTACTTGTAT TTTACTTTGT CATCTAAAGA TCAGTTCGAG ACGCTTTTGA ATGGTCAAGC AACTATTTGG AATGGATGGG TGATTGGTGC TTTTACCTCT TCTCTTTTGT TTTTATCTGT TTTATTGCAT GAATTGGCTC ATTCTTTTGT AGCAATTGGA GAGGGTCTAA AAGTTAGAGA CATAACACTT TTTTTTCTTG GAGGTATGGC AAGTCTTGAA AAGGAATGTC CGACTTCAAA AGGAAGTTTA AAAATTGCCA TTTCAGGTCC TGTTGTTAGT CTTTTATTAG CTTTTTTAAT GATTTTATTA AGTAATAATT TATCAGTATC GAATTTTATT CTCTCTAATT TATTTAAGCA GGTTGGAAGC CTCAACCTTT TAATAGGTGT ATTTAATTTA CTTCCGATAA TTCCTCTAGA TGGTGGCGTA ATATTAAAAT CTTTAATTTG GTACTTTACA GGGAGTAAAA GAGCAGGGAT TAAAGTTGCT ATTGGCTCTG CAAGATTAAT TTCTTTTCTT GCTATTTTTA TTGGCTTTTT AAGTTTGGTT AGGGGTAACT TATATCTTGC CATTTGCTTT TCTATTATTG GTTTATTTGT TTTTTCTTCA TCTAAATCAC AGAGCCAAAT TATTCAAATA CAAAAGATAT TATCTGAATC ATATGTTAAT CAGGTTTGTA GTCGTTCATT TAGGGTTCTA GAGGATGATT TGCCTGTGAA AGTTTTATCT AAATATAGTT CATTTAATAA AGATAATTTT TTCAATGAAG TATGGATCCT TTTGTGTAGA GAAGGGAGAT GGGTCGGTTA TGTGAATGAA AAAATCTTGA AGAATATTTC TGTACAAAAC TGGGATAAAA AATTTCTTTA TGAATTCTCA CAACCAATAA ATGAATTGCC ATCTATTAGT GAAAAAGAAT CATTATGGAA AGCAATATTA AAAATAGAAA AAACAAAAGA TGGAAGGCTA CTTGTACTAT CATTTTCTGG TCTTCCTCTT GGAACTTTAG ATAGAGTAGA TATAGGTAAA GCAGTACTTA AAAAAATCGG ATTAAACCTT CCAGACCAAT TAATTAAAAT TGCAAGAAAA GATAATATTT ATCCACTAGG ATTAAATCTA CTTAATATTG CACAATCAAT GGATTCAAGT GACTTGCTAG AGGACTAA
|
Protein sequence | MKIRGIPLRI HPSWFLVYLY FTLSSKDQFE TLLNGQATIW NGWVIGAFTS SLLFLSVLLH ELAHSFVAIG EGLKVRDITL FFLGGMASLE KECPTSKGSL KIAISGPVVS LLLAFLMILL SNNLSVSNFI LSNLFKQVGS LNLLIGVFNL LPIIPLDGGV ILKSLIWYFT GSKRAGIKVA IGSARLISFL AIFIGFLSLV RGNLYLAICF SIIGLFVFSS SKSQSQIIQI QKILSESYVN QVCSRSFRVL EDDLPVKVLS KYSSFNKDNF FNEVWILLCR EGRWVGYVNE KILKNISVQN WDKKFLYEFS QPINELPSIS EKESLWKAIL KIEKTKDGRL LVLSFSGLPL GTLDRVDIGK AVLKKIGLNL PDQLIKIARK DNIYPLGLNL LNIAQSMDSS DLLED
|
| |