Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_03601 |
Symbol | |
ID | 4779660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 332724 |
End bp | 334217 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640083628 |
Product | retinal pigment epithelial membrane protein |
Protein accession | YP_001014189 |
Protein GI | 124025073 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGTTA GTTATTTAAA AAGAGAAACA TCACCAAAGC AAACAATCTT CAACAAAGAA GATTGGTCCA GTGCATATTG CAATGTTGAA AAAGAATTAG ATCACGTTCA ACTCAAGCTT GTAAAAGGAT CTATTCCTGA ACAAATTTCT GGTACCTTTT ATCGAAACGG GCCAGGTAGA TTAGAAAGAG GGGGAAGATG GGTCCATCAT CCATTTGATG GAGATGGCAT GATTGCTGCC TTCAAATTTG ACAATGGAAA AATAAACCTG ACGAATCGTT TTGTTCGCAC AAAGGAATGG ACAGAAGAAG AAAAATCCCA AAAATTTCTA TATAGAGGTG TATTTGGAAC TCAAAAAGAA GGAGGAGTGT TAGCTAATGC TTTTGATGTA AGGCTAAAAA ATATTGCCAA TACACACGTA ATAAAACTTG GAGATGATCT ACTAGCTTTA TGGGAAGCAT CTAGTCCATA TTCACTTAAT CCAAATACCC TTGAGACCAA AGGTTTATCA AATTTAAAAG GAGTTTTAAA AAAAGGTGAA GCATTTAGTG CTCATCCACG ATTTGACCCT GGCCATCATC AAAGTCAAAG AATGGTCACT TTTGGGGTAT CTACAGGTCC TAAAAGCACA ATAAGATTGA TGGAATTTTC CACAAAGGGA GAAAATATTG GTTCTCTTTT AAGTGATAGA AAAGATTCTT TTAATGGATT TGCGTTCTTG CATGATTTTG CCATAACTCC AAACTGGGCA ATATTTCTGC AAAATGCTAT TAGTTTTAAT CCTCTTCCTT TTCTTCTTGG ACAAAAAGGA GCCGCACAAT GTTTAGCCTC TAAAAGTGAT GGAACTCCAA AATTTTTATT AATTCCAAGG GACTCTGGCA AGTTTGCTGG TCAACCTCCA AAATCAGTTG ATGCTCCAAA GGGTTTTGTT TTTCATCATC TAAATGCATG GGAAGATAAT GAAAAAATCA ATATTGAAAG TATTTTTTAT GATGATTTTC CGAGCATTGG ACCCGAAGAT AATTTTAGAG AAATTGATTT TGATCTTTTA CCAGAAGGAA TTTTGAAAAG AAGTGAAATC AATCCCATAG AAAATACATT TACCTGCTCA ACAATAAGCA ATCAATGTTG TGAATTTGCA ATGGTTAATC CTCATTTTGA AGGATTAAAG GCCCGCTTTA GTTGGATGGC AACTGCAGAA GAAAAAGAGG GGAATGGGCC ACTTCAAGCC ATAAAAAAAA TCGATTTATC TAATAATAAA GAGATAAGTT GGAGTGCGGC TCCAAGAGGT TTTGTAAGTG AACCTATATT TATTCCATCT CAAGAATCAA AGTCTGAAGA AGACAATGGA TGGGTTGTTG CATTGGTTTG GAATAGTATT AGATCGGGAA CTGATTTAAT AATTCTTGAT TCTAAAGATC TGACTGAAAA AGCTATTCTT GAAGTTCCAA TATCAATTCC ACATGGATTA CACGGAAGCT GGGTTGAAAA TTAA
|
Protein sequence | MAVSYLKRET SPKQTIFNKE DWSSAYCNVE KELDHVQLKL VKGSIPEQIS GTFYRNGPGR LERGGRWVHH PFDGDGMIAA FKFDNGKINL TNRFVRTKEW TEEEKSQKFL YRGVFGTQKE GGVLANAFDV RLKNIANTHV IKLGDDLLAL WEASSPYSLN PNTLETKGLS NLKGVLKKGE AFSAHPRFDP GHHQSQRMVT FGVSTGPKST IRLMEFSTKG ENIGSLLSDR KDSFNGFAFL HDFAITPNWA IFLQNAISFN PLPFLLGQKG AAQCLASKSD GTPKFLLIPR DSGKFAGQPP KSVDAPKGFV FHHLNAWEDN EKINIESIFY DDFPSIGPED NFREIDFDLL PEGILKRSEI NPIENTFTCS TISNQCCEFA MVNPHFEGLK ARFSWMATAE EKEGNGPLQA IKKIDLSNNK EISWSAAPRG FVSEPIFIPS QESKSEEDNG WVVALVWNSI RSGTDLIILD SKDLTEKAIL EVPISIPHGL HGSWVEN
|
| |