Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_08961 |
Symbol | |
ID | 4717602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 770120 |
End bp | 771346 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640078608 |
Product | hypothetical protein |
Protein accession | YP_001009287 |
Protein GI | 123968429 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.359171 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTTTC AACAAGGTTT AATCACAACA ATACATGAAT ATGGAGTTAC AGGAAATTTA CTTAAAGAAT TAAACAAAAG TCTTAAAAGA AGATCAACTA GCATTTTAAT ACCTTGCTTA TATGAGGAAT TTGAGCGTCC AGCATTAAAA GATATAAGAG AAGTTTTAAA AAACCTTACA GGCTTAAATG AATTAGTAAT TGCTCTCTCT GCAAAAACTG TTGAGCAAGT TAAGGCAGCA AAATCATTTT TTGACTCAAT GCCATTCCCA GTTCACGTTC AATGGACTAA TTCTCCCTCT GTAATTGAGC TATTGAAAAG CCAAGAAAAA AATGGATTAG AACTTTTAGG AACTCCAGGT AAAGGATGGG CTGTATGGCA AGGTATAGGA GTTGCGACTA GAAAATCAGA AGTTGTTGCT CTTTTTGATG CTGATATAAG AACTTTTAGT CCTTTATATC CTTCAAGAAT GATACTTCCA CTTCTGGATG AATCATATGG AATATCATAT GTAAAAGCTT TTTACAGTAG ATTATCTCTA GAAACAAATC AATTGCAAGG AAGAGCAACA AGATTATTCG TGGGTCCTTT ATTAGCAAGT CTGGAGCAAT TAGTTGGTAA GGGTCCTTTT TTACAATATC TTCAATCATT TAGATATCCA TTAGCAGGTG AGTTTGCTTT TACTAAAGAC CTTGCTATGA ATTTAAGAAT ACCTTGTGAC TGGGGTTTAG AGATAGGTTT ATTATCAGAG GTTTATAGAA ACGTAAGGAC CTCCAAAATA GCCCAGGTTG ACCTAGGTTT ATTTGACCAT AAACATAAGA ACATTGGAGA TTCTTCTAAA GAAGGATTGC AAAAAATGTG TACAGAAATA CTATCAAGTG TTTTGAGAGG TCTCATGGAG CATCAAGCCG AGACCTTAAC TAGCACTCAA CTAGCAACTT TAGAAGTTCT CTACAAAAGA GTTGGAGAAG ATCGGGTAAA ACAATTTGGA TTAGATTCAG CAGTTAATCA ACTTCCATAC GATAGGCACG AAGAAGAATT ATCAGTACAA AAGTTTGCGA AACTATTAAG ACCTGCTACA GAAGATTATC TGGCTTGTCC TACAACACTT CAATTACCAA GTTGGTCAAG GGTTCTATCT TGTGAGAACA AACTTCAAGA AGATTTAGCG ATTGCGGGGT CAAAAGACAT AAAGATAAGT GAAAAAGAAT TAATTAAAAA CTTCTAA
|
Protein sequence | MDFQQGLITT IHEYGVTGNL LKELNKSLKR RSTSILIPCL YEEFERPALK DIREVLKNLT GLNELVIALS AKTVEQVKAA KSFFDSMPFP VHVQWTNSPS VIELLKSQEK NGLELLGTPG KGWAVWQGIG VATRKSEVVA LFDADIRTFS PLYPSRMILP LLDESYGISY VKAFYSRLSL ETNQLQGRAT RLFVGPLLAS LEQLVGKGPF LQYLQSFRYP LAGEFAFTKD LAMNLRIPCD WGLEIGLLSE VYRNVRTSKI AQVDLGLFDH KHKNIGDSSK EGLQKMCTEI LSSVLRGLME HQAETLTSTQ LATLEVLYKR VGEDRVKQFG LDSAVNQLPY DRHEEELSVQ KFAKLLRPAT EDYLACPTTL QLPSWSRVLS CENKLQEDLA IAGSKDIKIS EKELIKNF
|
| |