Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_03541 |
Symbol | |
ID | 4780061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 326238 |
End bp | 327506 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640083621 |
Product | hemolysin-like protein |
Protein accession | YP_001014183 |
Protein GI | 124025067 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTAT TACTTCTTTG TATATTGCTT GTAATACCAG CATTTTTCAA TGCAGGTCAA TTTGCCATAT TGCGACTTCG TTCAACTAAG GTTCAAAGAC TTGTAGAAGA TGGACTACCT GGTTCCAATT CCATAATTCG TCTGCAGAAA AGGCTGAGAA GAACCCTTTT AATAGCTGAG TTAGGAATAA CTATTTCATT GATTTCAATT GGTTGGATCT GCAAAAATTT TGCAAGTCAA TGGTGGGGAA ATAATGCTTC AATTAACTAT TTGTTAGATC TAGGTCTTTT TATTACTGTT GTACTTTTAG CTACTCTGAT ATCTGGTCTG CTCCCAAAAG CACTTGTTCT TAATCAACCA GAGACTTCTG CCCTGAAATT ATCGCCTCTA ATTGAAGCAG CGATAAAATG GATGTCTCCA TTCCTTTCTT TGCTTGAAGG ACTTGCTTTA TTAATTCTTA GATTAGTTGG ACTTAATACT CAATCAGAAA GTCTTACTAC AGCTGCATTC TCTGCAGGGG AATTAGAAAA ATTAATTGAA ACTGGTGGCG TAACTGGATT AAAACCTGAT GAAAGAAACA TACTTGAAGG TGTTTTTGCT TTAAGAGATA CACAAGTCAG AGAAGTAATG GTTCCCAGGT CTGGAATGGT TACTCTTCCA AGAGAAGTTT CCTTCACTCA AATGATGGAA GAAGTTCATA AAACTCGTCA TGCCAGATAC TTAGTAATTG ATGATTCACT TGATAATGTT CTTGGAGTTT TAGATTTAAG GCAACTTGCT GATCCAATAG CTAAAGGGGC AATGCAAGCT AACTCCTCTT TGGAGCCATA TATAAAGCCA GTTGTTCGTG TTTTAGAAAC TTCTACTTTG GCCGAATTGC TACCCCTAAT AAAAAATGGG AATCCACTAC TTTTGGTCGT TGATGAATAT GGAGGAACTG AAGGATTAAT AACATCAGCA GACTTAACAG GTGAAATTGT TGGTGACGAG ATTCAATTTG ATAATAAAGA ATCTGAGTTA AGATCTCTTG ATGACTTAAA AAAAATCTGG CTTACTTCTG GCGAAATAGA AGTAATAGAA CTAAATAGAG AGCTTAACTT AAAATTGCCA GAAGCTGATG ATCATTACAC ACTTGCTGGA TTTGTTTTAG AAAAACTCCA AGAAATTCCA AGCTCAGGAG AAACGTTCAT TCATAATGAA ATTGTATTTG AAATTATCTC CATGAAAGGT CCAAGAATCA ACAAAGTAAA AATAATCCTT CCTAAGTAA
|
Protein sequence | MSLLLLCILL VIPAFFNAGQ FAILRLRSTK VQRLVEDGLP GSNSIIRLQK RLRRTLLIAE LGITISLISI GWICKNFASQ WWGNNASINY LLDLGLFITV VLLATLISGL LPKALVLNQP ETSALKLSPL IEAAIKWMSP FLSLLEGLAL LILRLVGLNT QSESLTTAAF SAGELEKLIE TGGVTGLKPD ERNILEGVFA LRDTQVREVM VPRSGMVTLP REVSFTQMME EVHKTRHARY LVIDDSLDNV LGVLDLRQLA DPIAKGAMQA NSSLEPYIKP VVRVLETSTL AELLPLIKNG NPLLLVVDEY GGTEGLITSA DLTGEIVGDE IQFDNKESEL RSLDDLKKIW LTSGEIEVIE LNRELNLKLP EADDHYTLAG FVLEKLQEIP SSGETFIHNE IVFEIISMKG PRINKVKIIL PK
|
| |