Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_15171 |
Symbol | |
ID | 4779092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1230968 |
End bp | 1232947 |
Gene Length | 1980 bp |
Protein Length | 659 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640084799 |
Product | HD superfamily hydrolase |
Protein accession | YP_001015339 |
Protein GI | 124026223 |
COG category | [R] General function prediction only |
COG ID | [COG1480] Predicted membrane-associated HD superfamily hydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.450871 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCTGTC TATTAGTAGC AATATTTTCA AGTTATAAGT TACTTGCTGT TCCGGATCTC AAACCTGGAG ATATTGCTCA AGTCAATGTA ATAGCCCCTA GAGATGCAAA GGTAATAGAC ACAACGGATT TAAAAGAAAA GAAACAAGGC TTAAAAGAAA GTTTTGTACA ATCAATAGAC AAGAATAAAT CAAGTGATTT AGAAAAGACT GTTTCCAAGC AAATCAATAT ACTTCGTACT CAAAAATTCA ACAATTTTGG AATCGATTTC AATGAATTCA ACATAACAAC TCTAGAGAAA GATTGGATAT TAAATGTTAA AGACAATGAA TGGGAAGAGT GGAAAAAAGA GATCAAAAAT GTTTCAAAAA AAATGCTTTC TCAGGGAATT ATTAATACAC TTGCACTTGA TCAACTTAAT GAAGCTTCTT CACTTCAATT AATAGATTTA GGTGAGAAAG ATTCTCCAAA TAGATCATTA GGTGCAAAAA TATTATCAAA TAGTTTTCAT CAAAAAAGTA ATTTAAAAAT TGATAAACTA AAAACTAATA TATTACTCGA AAATCTAATC AATCAAGAAG GTATAAACAC AATAAATGTT AAAGAGGGCA GTATAATATC AAGGAAAGGT AAGCCAATAA CTTCACAAGA GTTTGATATC CTAGAACATT TTAATAAAGT AAGTCGAAGT CCTAGGCCCT TAAAGTGGTT AATTACCTTC TCAGAATCCA TGGGAAGTTG TGGATTACTT CTTATGATCA TGAGAAGAGA AAAGCCTAGA CTTCAGGCCA GGCACGGATT ACTATCCTTA ACTTTATTAC TTGTAGTTCA ACTAACAAAA GATTGGCTTG GGCCTATAGC AAGTCCCATG CAATTAATAT TACCTCCCAC ATTGCTTCTT TCTCAAGGGA TAGGCACCAT AACATCATTG GCTTGGATGG CAGCTGCTAG TCTGATTTGG CCATCATCTC TTGGTGAATC AATTGAAGTC AGACTAATAA TTGCTTGTAT CGCTGGTTCA TTTATTGCAT TTTTAGGCAG AAGGATGAGA AGCCGAGCAC AGGTTCTTCA AATAGCTGTA TTTATTCCTT TTGGTGCATT ATTAGGGCAA TGGTTTATTC TTAATCAGGT AATTAAAGAA AATAATATAG AATTTAACAA TCTATCTATT GACCCTAATT CTCTTTTTAA TGAGACCATT ATTATTAGTT CAATATTAAT GGTAACAATA TTAATTATTC CAATTCTAGA AAATACATTT GGATTACTTA CTAGAGCAAG ATTAATGGAA CTTGCTGATC AAGAACGTCC TTTACTTCGT AGATTATCTA GAGAAGCTCC AGGGACGTTT GAACATACTT TAACAATTTT AAGCCTCGCC GAGGAAGGAG CAAGAGTTAT TGGAGCTGAT GTTGACTTAA TAAGAACTGG GGCTTTATAC CATGATGTAG GGAAATTGCA TGCTCCAAAT TGGTTTATTG AAAATCAGAA AGATGGAATA AACCCACACG ATGAAATAAA AAACCCTTAT AAAAGTGCCG ATATTCTTCA AGCCCATGTC GATGAAGGAT TGAAGCTTGC GAGGAAATAT CGACTTCCAT CTCCTATTGC TGATTTCATC CCAGAGCATC AAGGTACTTT AAAGATGGGA TATTTTCTTC ATAAAGCTAG AGAAAGTGAT CCTTCAGCCT CTGAAAAACG TTTTAGATAT AAAGGACCTA TTCCTCATTC AAAAGAAACT GGAATACTCA TGCTTGCAGA TGGCTGCGAA GCAGCATTAA GAGCTCTTGA CTCTTCTTCT TCAGATAAAG ATGCATGCAA AACAGTTAGA AAGATCATCC AATCTCGTCA GGTTGACGGT CAATTAAAAG AAAGTAGTTT AACCAGAGCA GAAATAGAAA TAATTCTTAG AGCCTTTGTC TCTGTATGGA GGAGAATGCG TCACAGACGC TTAAAATACC CAAGCTTCAA TCCAAGGTGA
|
Protein sequence | MVCLLVAIFS SYKLLAVPDL KPGDIAQVNV IAPRDAKVID TTDLKEKKQG LKESFVQSID KNKSSDLEKT VSKQINILRT QKFNNFGIDF NEFNITTLEK DWILNVKDNE WEEWKKEIKN VSKKMLSQGI INTLALDQLN EASSLQLIDL GEKDSPNRSL GAKILSNSFH QKSNLKIDKL KTNILLENLI NQEGINTINV KEGSIISRKG KPITSQEFDI LEHFNKVSRS PRPLKWLITF SESMGSCGLL LMIMRREKPR LQARHGLLSL TLLLVVQLTK DWLGPIASPM QLILPPTLLL SQGIGTITSL AWMAAASLIW PSSLGESIEV RLIIACIAGS FIAFLGRRMR SRAQVLQIAV FIPFGALLGQ WFILNQVIKE NNIEFNNLSI DPNSLFNETI IISSILMVTI LIIPILENTF GLLTRARLME LADQERPLLR RLSREAPGTF EHTLTILSLA EEGARVIGAD VDLIRTGALY HDVGKLHAPN WFIENQKDGI NPHDEIKNPY KSADILQAHV DEGLKLARKY RLPSPIADFI PEHQGTLKMG YFLHKARESD PSASEKRFRY KGPIPHSKET GILMLADGCE AALRALDSSS SDKDACKTVR KIIQSRQVDG QLKESSLTRA EIEIILRAFV SVWRRMRHRR LKYPSFNPR
|
| |